W3C home > Mailing lists > Public > www-validator@w3.org > April 2001

Re: XHTML validator doesn't completely support Unicode

From: Bertilo Wennergren <bertilow@bertilo.se.fm>
Date: Sun, 29 Apr 2001 07:38:34 -0400 (EDT)
Message-ID: <01ab01c0d0a1$fa3281e0$536859d5@chello.se>
To: <psheerin@cmp.com>, <www-validator@w3.org>
Peter Sheerin:

> Take a look at http://www.petesguide.com/style/index.html, and then
> follow the icon link to the validator, and watch what it reports. The
> text file is encoded in UTF-8, and uses the DOS end of line
> conventions, but has the Unicode string "U+FEFF" as the first character.

Are you sure it's the end of line characters that give the problem?

I'd guess it's the BOM ("U+FEFF") that's the culprit. It's not very
common to use a BOM in UTF-8 files. Some even say it's not allowed
in UTF-8. It's certainly not necessary to use a BOM in UTF-8.

-- 
#####################################################################
                         Bertilo Wennergren
                 <http://purl.oclc.org/net/bertilo>
                        <bertilow@bertilo.se.fm>
#####################################################################
Received on Sunday, 29 April 2001 21:18:05 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:13:58 GMT