Re: Non-SGML Char Refs

From: Bjoern Hoehrmann (derhoermi@gmx.net)
Date: Sun, Jul 15 2001

  • Next message: Bjoern Hoehrmann: "Re: CSS/XHTML Validators weirdness"

    From: Bjoern Hoehrmann <derhoermi@gmx.net>
    To: Martin Duerst <duerst@w3.org>
    Cc: tkinias@optimalco.com, "'www-validator@w3.org'" <www-validator@w3.org>, www-html@w3.org
    Date: Mon, 16 Jul 2001 01:07:11 +0200
    Message-ID: <ug74lto5khroivak7p8k25evra2fcdte0e@4ax.com>
    Subject: Re: Non-SGML Char Refs
    
    * Martin Duerst wrote:
    >At 04:32 01/06/05 +0200, Bjoern Hoehrmann wrote:
    >>* Thanasis Kinias wrote:
    >> >The validator complains about "non-SGML character" references (e.g., &#147;
    >> >instead of the correct &#8220;) only when validating as XHTML.  That implies
    >> >that &#147; and the other Microsoft characters from decimal 128-159 (hex
    >> >80-9f) _are_ valid in HTML.
    >>
    >>They are, they just refer to non-printing control characters.
    
    The other way round, valid XML, invalid HTML.
    
    >  CHARSET
    >           BASESET  "ISO Registration Number 177//CHARSET
    >                     ISO/IEC 10646-1:1993 UCS-4 with
    >                     implementation level 3//ESC 2/5 2/15 4/6"
    >          DESCSET 0       9       UNUSED
    >                  9       2       9
    >                  11      2       UNUSED
    >                  13      1       13
    >                  14      18      UNUSED
    >                  32      95      32
    >                  127     1       UNUSED
    >                  128     32      UNUSED
    >                  160     55136   160
    >                  55296   2048    UNUSED  -- SURROGATES --
    >                  57344   1056768 57344
    
    >Actually, these code positions are valid (though rather useless)
    >in XML, but they are invalid in HTML. So I'm not sure what the
    >result is for XHTML.
    
    x'posted to www-html@w3.org. Are the as unused declared characters from
    HTML 4.0 valid in XHTML 1.0?
    -- 
    Björn Höhrmann { mailto:bjoern@hoehrmann.de } http://www.bjoernsworld.de
    am Badedeich 7 } Telefon: +49(0)4667/981028 { http://bjoern.hoehrmann.de
    25899 Dagebüll { PGP Pub. KeyID: 0xA4357E78 } http://www.learn.to/quote/