W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > March 2011

[Bug 12062] UTF-8 BOM should not be forbidden in Polyglot Markup

From: <bugzilla@jessica.w3.org>
Date: Thu, 03 Mar 2011 18:29:44 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1PvDHU-0003fe-BQ@jessica.w3.org>

Eliot Graff <eliotgra@microsoft.com> changed:

           What    |Removed                     |Added
             Status|REOPENED                    |RESOLVED
         Resolution|                            |FIXED

--- Comment #5 from Eliot Graff <eliotgra@microsoft.com> 2011-03-03 18:29:43 UTC ---
Thanks, Leif.

The Editor's Draft of 3 March now says:

Polyglot markup uses either UTF-8 or UTF-16. UTF-8 is preferred. When polyglot
markup uses UTF-16, it includes the BOM indicating little-endian UTF-16 or
big-endian UTF-16.

Polyglot markup declares character encoding in the following ways, which may be
used separately or in combination (if used in combination, each approach
contains identical encoding information):

    * Within the document
          o By using the BOM of the specific encoding.
          o By relying on UTF-8 as the encoding default of XML, used in
combination with the HTML <meta charset="UTF-8"/> element.
    * In the HTTP header of the response [HTTP11], as in the following: 

I believe this takes care of the points you raised.

As they say, the devil is in the details. And if English is funny, it's
hysterical when I get to it. :-)

Thanks for your help.


Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Thursday, 3 March 2011 18:29:46 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 16:31:06 UTC