Re: Encoding Requested Even When XML Decl. Is Ommited

Masayasu Ishikawa <mimasa@w3.org> wrote in
news:20020304.113022.07635268.mimasa@w3.org:

> According to RFC 3023, if an entity
> is received with the charset parameter omitted, the default
> charset value is "us-ascii" in the case of "text/xml",

And this even applies to XML documents with the encoding set in
the XML declaration. For example, if you this XHTML document
sent as 'text/xml' over HTTP:

<?xml version="1.0" encoding="iso-8859-1"?>
<!DOCTYPE html
  PUBLIC "-//W3C//DTD XHTML 1.1//EN"
"http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html>
  <head>
      <meta http-equiv="Content-Type" content="text/html;
charset=utf-16" />
...
</html>

The character encoding is *neither* 'iso-8859-1' nor 'utf-16',
but 'us-ascii'. (This will no doubt lead to many
interoperability problems.)

-- 
Karl Ove Hufthammer

Received on Monday, 4 March 2002 07:12:22 UTC