Re: HTML5 and Unicode Normalization Form C

Andreas Prilop, Fri, 27 May 2011 16:35:11 +0200 (CEST):

> The HTML5 validator does not complain about charset=ISO-8859-15.
> Are you going to tell us that ISO-8859-15 is "better" than
> non-NFC Unicode?

The HTML5 validator could very well show a warning for displaying 
ISO-8859-15. HTML5 excplicitly allows such a warning. Mike, would you 
add a such a warning? Consider it a feature request from my part!

This is what HTML5 says: [1]

]]
Authors are encouraged to use UTF-8. Conformance checkers may advise 
authors against using legacy encodings. [RFC3629]

Authoring tools should default to using UTF-8 for newly-created 
documents. [RFC3629]
  [ snip ]
Using non-UTF-8 encodings can have unexpected results on form 
submission and URL encodings, which use the document's character 
encoding by default.
[[

[1] http://www.w3.org/TR/html5/semantics.html#charset
-- 
Leif Halvard Silli

Received on Sunday, 29 May 2011 16:54:21 UTC