- From: François Yergeau <francois@yergeau.com>
- Date: Wed, 09 Mar 2005 18:22:03 -0800
- To: Paul Grosso <pgrosso@arbortext.com>
- Cc: public-xml-core-wg@w3.org
Paul Grosso a écrit : > JohnC noted: > > In PEX6: whether or not an initial U+FFEF is part of a > text/plain document (or in our case, a document which is > being treated as text/plain) depends on the character encoding. > If it's UTF-{8,16,32}, then no, it's a BOM and should be > discarded. If it's UTF-{16,32}{LE,BE}, then yes, it's a > ZWNBSP character and should be kept. FWIW, I agree with John that here the initial U+FEFF should be considered a BOM and discarded, but I find some confusion in the above. In the case presented in PEX6, it has been determined (through defaulting) that the encoding is UTF-8, so all the rest about UTF-{16,32} and UTF-{16,32}{LE,BE} is non sequitur. -- François
Received on Thursday, 10 March 2005 02:22:05 UTC