- From: Martin Duerst <duerst@w3.org>
- Date: Tue, 05 Jun 2001 17:02:27 +0900
- To: Bjoern Hoehrmann <derhoermi@gmx.net>, tkinias@optimalco.com
- Cc: "'www-validator@w3.org'" <www-validator@w3.org>
At 04:32 01/06/05 +0200, Bjoern Hoehrmann wrote: >* Thanasis Kinias wrote: > >The validator complains about "non-SGML character" references (e.g., “ > >instead of the correct “) only when validating as XHTML. That implies > >that “ and the other Microsoft characters from decimal 128-159 (hex > >80-9f) _are_ valid in HTML. > >They are, they just refer to non-printing control characters. No, sorry, they are not. See http://www.w3.org/TR/REC-html40/sgml/sgmldecl.html CHARSET BASESET "ISO Registration Number 177//CHARSET ISO/IEC 10646-1:1993 UCS-4 with implementation level 3//ESC 2/5 2/15 4/6" DESCSET 0 9 UNUSED 9 2 9 11 2 UNUSED 13 1 13 14 18 UNUSED 32 95 32 127 1 UNUSED 128 32 UNUSED 160 55136 160 55296 2048 UNUSED -- SURROGATES -- 57344 1056768 57344 The line "128 32 UNUSED" excludes them, or doesn't it? Actually, these code positions are valid (though rather useless) in XML, but they are invalid in HTML. So I'm not sure what the result is for XHTML. Regards, Martin.
Received on Tuesday, 5 June 2001 04:03:05 UTC