W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > March 2011

[Bug 12062] UTF-8 BOM should not be forbidden in Polyglot Markup

From: <bugzilla@jessica.w3.org>
Date: Thu, 03 Mar 2011 18:29:44 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1PvDHU-0003fe-BQ@jessica.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=12062

Eliot Graff <eliotgra@microsoft.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|REOPENED                    |RESOLVED
         Resolution|                            |FIXED

--- Comment #5 from Eliot Graff <eliotgra@microsoft.com> 2011-03-03 18:29:43 UTC ---
Thanks, Leif.

The Editor's Draft of 3 March now says:

]]
Polyglot markup uses either UTF-8 or UTF-16. UTF-8 is preferred. When polyglot
markup uses UTF-16, it includes the BOM indicating little-endian UTF-16 or
big-endian UTF-16.

Polyglot markup declares character encoding in the following ways, which may be
used separately or in combination (if used in combination, each approach
contains identical encoding information):

    * Within the document
          o By using the BOM of the specific encoding.
          o By relying on UTF-8 as the encoding default of XML, used in
combination with the HTML <meta charset="UTF-8"/> element.
    * In the HTTP header of the response [HTTP11], as in the following: 
[[

I believe this takes care of the points you raised.

As they say, the devil is in the details. And if English is funny, it's
hysterical when I get to it. :-)

Thanks for your help.

Eliot

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Thursday, 3 March 2011 18:29:46 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 3 March 2011 18:29:47 GMT