W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > July 2012

[Bug 13392] i18n-ISSUE-72: BOM as preferred encoding declaration

From: <bugzilla@jessica.w3.org>
Date: Thu, 12 Jul 2012 11:32:56 +0000
Message-Id: <E1SpHdg-0005WJ-T6@jessica.w3.org>
To: public-html-bugzilla@w3.org
https://www.w3.org/Bugs/Public/show_bug.cgi?id=13392

--- Comment #24 from I18n Core WG <public-i18n-core@w3.org> 2012-07-12 11:32:56 UTC ---
There seems to be convergence on the proposal to change the first two
sub-bullets to say:

- By using the Byte Order Mark (BOM) character, which is an encoding
 signature that both XML and HTML parsers are required to support.
- By using <meta charset="UTF-8"/> (the HTML encoding declaration) and
 thus, for XML parsers, rely on XML´s encoding default (see above).

I just offer an editorial suggestion. There is already a paragraph immediately
after the bulleted list that says that "The HTML encoding declaration has no
effect in XML. When the HTML encoding declaration is the only encoding
declaration, the encoding default from XML makes XML parsers treat content as
UTF-8."

So why not just say:

[[
- Within the document:
  - By using the Byte Order Mark (BOM) character
  - By using <meta charset="UTF-8"/> (the HTML encoding declaration)
- Outside the document 
  ...

Both XML and HMTL parsers are required to support the byte order mark. The HTML
encoding declaration has no effect in XML. When the HTML encoding declaration
is the only encoding declaration, the encoding default from XML makes XML
parsers treat content as UTF-8. 
]]

-- 
Configure bugmail: https://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Thursday, 12 July 2012 11:33:06 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 12 July 2012 11:33:06 GMT