W3C home > Mailing lists > Public > www-international@w3.org > January to March 2000

RE: charset of XML (was RE: Localization of XML)

From: Langer, Paul <Paul.Langer@softwareag.com>
Date: Wed, 9 Feb 2000 23:09:50 +0100
Message-ID: <2339B88D6AA6D31187A80008C7E6F6722D9097@daemsg01.software-ag.de>
To: "'www-international@w3.org'" <www-international@w3.org>
-----Original Message-----
From: Nir Dagan [mailto:nir@nirdagan.com]
Sent: Mittwoch, 9. Februar 2000 18:47
Subject: charset of XML (was RE: Localization of XML)

<disclaimer>
I am not arguing that the encoding declaration in the XML
declaration should be omitted or wrong, if shipping XML
via HTTP.
At Software AG we try to be good citizens and always send
both, the charset parameter of the media type and the encoding
declaration in the XML declaration (we even try to keep both
identical and correct).
</disclaimer>

> I'm afraid that the very same reference (4.3.3 of XML1.0 spec.) says:
> 
> "Parsed entities which are stored in an encoding other 
> than UTF-8 or UTF-16 must begin with a text declaration 
> containing an encoding declaration:..."

Please note "stored". Things are different on the wire. HTTP
provides content negotiation. Servers may choose to transcode
the body of requests to meet desired formats/encodings/languages
and whatsoever. 
We are talking about shipping XHTML with the media type "text/xml".
See "XML Media Types" (http://www.ietf.org/rfc/rfc2376.txt) for
details.

All the best,
Paul

--------------------------------------------------------------------
Paul Langer                      E-mail   Paul.Langer@softwareag.com
Software AG                      Tel.     +49-6151-92-1912
Uhlandstr. 12                    Fax      +49-6151-92-1613
D-64297 Darmstadt
Received on Wednesday, 9 February 2000 17:10:04 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:55 GMT