- From: <bugzilla@jessica.w3.org>
- Date: Thu, 03 Mar 2011 01:48:44 +0000
- To: public-html-bugzilla@w3.org
http://www.w3.org/Bugs/Public/show_bug.cgi?id=12062
Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|RESOLVED |REOPENED
Resolution|FIXED |
--- Comment #2 from Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no> 2011-03-03 01:48:42 UTC ---
(In reply to comment #1) Some nitty gritties.
FIRSTLY: In the *below* quote, inside the parenthesis, please change the
upper-case "I" (If used ...") to lowercase:
]]
> Polyglot markup declares character encoding in the following ways, which may be
> used separately or in combination (If used in combination, each approach
> contains identical encoding information):
[[
SECONDLY, in the *below* quote, you changed my suggested wording from " XML
UTF-8 encoding default" to "default XML UTF-8 encoding". I don't know if it is
really correct to say that UTF-8 is _the_ default encoding of XML? My
intention was to say that UTF-8 is _an_ encoding default - one of two - the
other being UTF-16. (In my proposal, "XML" is an adjective - think "XML-ish",
or "of XML" or "XML's".)
> ◦By relying on the default XML UTF-8 encoding in combination with the
> use of the <meta charset="UTF-8"/> element.
It is true that XML says that if a document *DOES NOT* have an encoding
declaration (internal or external) and also does not have an encoding signature
(aka BOM), then the document *MUST* be in the UTF-8 encoding - see section
'4.3.3 Character Encoding in Entities' of XML 1.0.) From that angle it seems
correct that UTF-8 has something to do with "the default"
* But I still think that my wording was better. Feel free to go back and use
it.
* Or else, I suggest to use the follwing formulation instead, where I use the
word 'autodetection':
]] By relying on XML's autodection of the UTF-8 encoding, in combination with
the HTML <meta charset="UTF-8"/> encoding declaration. [[
Justification for the 'autodetection' variant:
* XML 1.0 has an entire section about 'autodetection of character encodings':
http://www.w3.org/TR/REC-xml/#sec-guessing
* in contrast, the word 'default' only occurs once in relation to encoding:
http://www.w3.org/TR/REC-xml/#charencoding
* the autodetection section relates to 'default' by the 'other' option in the
"Without a Byte Order Mark" table. Quot:
]] Other UTF-8 without an encoding declaration, or else the data
stream is mislabeled
(lacking a required encoding declaration), corrupt, fragmentary, or
enclosed in a wrapper
of some kind [[
Sidenote: In Norwegian, 'default' and 'automatic' are often synonyms.
--
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Thursday, 3 March 2011 01:48:46 UTC