W3C home > Mailing lists > Public > public-xml-core-wg@w3.org > March 2005

Re: Minutes for XML Core WG telcon of 2005 March 9

From: François Yergeau <francois@yergeau.com>
Date: Wed, 09 Mar 2005 18:22:03 -0800
To: Paul Grosso <pgrosso@arbortext.com>
Cc: public-xml-core-wg@w3.org
Message-id: <422FAF4B.1040002@yergeau.com>

Paul Grosso a écrit :
> JohnC noted:
> 
> In PEX6: whether or not an initial U+FFEF is part of a 
> text/plain document (or in our case, a document which is 
> being treated as text/plain) depends on the character encoding.  
> If it's UTF-{8,16,32}, then no, it's a BOM and should be 
> discarded.  If it's UTF-{16,32}{LE,BE}, then yes, it's a
> ZWNBSP character and should be kept.

FWIW, I agree with John that here the initial U+FEFF should be 
considered a BOM and discarded, but I find some confusion in the above. 
  In the case presented in PEX6, it has been determined (through 
defaulting) that the encoding is UTF-8, so all the rest about 
UTF-{16,32} and UTF-{16,32}{LE,BE} is non sequitur.

-- 
François
Received on Thursday, 10 March 2005 02:22:05 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 January 2008 14:21:32 GMT