Re: Non-SGML Char Refs

* Martin Duerst wrote:
>At 04:32 01/06/05 +0200, Bjoern Hoehrmann wrote:
>>* Thanasis Kinias wrote:
>> >The validator complains about "non-SGML character" references (e.g., “
>> >instead of the correct “) only when validating as XHTML.  That implies
>> >that “ and the other Microsoft characters from decimal 128-159 (hex
>> >80-9f) _are_ valid in HTML.
>>
>>They are, they just refer to non-printing control characters.

The other way round, valid XML, invalid HTML.

>  CHARSET
>           BASESET  "ISO Registration Number 177//CHARSET
>                     ISO/IEC 10646-1:1993 UCS-4 with
>                     implementation level 3//ESC 2/5 2/15 4/6"
>          DESCSET 0       9       UNUSED
>                  9       2       9
>                  11      2       UNUSED
>                  13      1       13
>                  14      18      UNUSED
>                  32      95      32
>                  127     1       UNUSED
>                  128     32      UNUSED
>                  160     55136   160
>                  55296   2048    UNUSED  -- SURROGATES --
>                  57344   1056768 57344

>Actually, these code positions are valid (though rather useless)
>in XML, but they are invalid in HTML. So I'm not sure what the
>result is for XHTML.

x'posted to www-html@w3.org. Are the as unused declared characters from
HTML 4.0 valid in XHTML 1.0?
-- 
Björn Höhrmann { mailto:bjoern@hoehrmann.de } http://www.bjoernsworld.de
am Badedeich 7 } Telefon: +49(0)4667/981028 { http://bjoern.hoehrmann.de
25899 Dagebüll { PGP Pub. KeyID: 0xA4357E78 } http://www.learn.to/quote/

Received on Sunday, 15 July 2001 19:08:16 UTC