Re: XHTML validator doesn't completely support Unicode

From: Bertilo Wennergren (bertilow@bertilo.se.fm)
Date: Sun, Apr 29 2001

  • Next message: Peter Sheerin: "RE: XHTML validator doesn't completely support Unicode"

    Date: Sun, 29 Apr 2001 07:38:34 -0400 (EDT)
    Message-ID: <01ab01c0d0a1$fa3281e0$536859d5@chello.se>
    From: "Bertilo Wennergren" <bertilow@bertilo.se.fm>
    To: <psheerin@cmp.com>, <www-validator@w3.org>
    Subject: Re: XHTML validator doesn't completely support Unicode
    
    Peter Sheerin:
    
    > Take a look at http://www.petesguide.com/style/index.html, and then
    > follow the icon link to the validator, and watch what it reports. The
    > text file is encoded in UTF-8, and uses the DOS end of line
    > conventions, but has the Unicode string "U+FEFF" as the first character.
    
    Are you sure it's the end of line characters that give the problem?
    
    I'd guess it's the BOM ("U+FEFF") that's the culprit. It's not very
    common to use a BOM in UTF-8 files. Some even say it's not allowed
    in UTF-8. It's certainly not necessary to use a BOM in UTF-8.
    
    -- 
    #####################################################################
                             Bertilo Wennergren
                     <http://purl.oclc.org/net/bertilo>
                            <bertilow@bertilo.se.fm>
    #####################################################################