- From: Martin Duerst <duerst@w3.org>
- Date: Fri, 18 Oct 2002 09:48:07 +0900
- To: Markus Scherer <markus.scherer@jtcsv.com>, charsets <ietf-charsets@iana.org>
At 09:12 02/10/17 -0700, Markus Scherer wrote: >Patrik F$BgM(Btstr$B‹N(B wrote: > >>What I hear on this list is that the consensus is that BOM SHOULD NOT be >>used. I would like it to be MUST NOT be used in Internet protocols, which >>leads to tagged UTF-8 text be illegal if the BOM exists in the text. > > >That would violate the Unicode standard. Hello Markus, Can you give the details of why and how (in terms e.g. of conformance clauses in the Unicode Standard)? Regards, Martin. >If UTF-8 is clearly indicated with some charset label, then an initial >sequence of ef bb bf must be interpreted as the character U+feff ZWNBSP. >Since that is not a very useful character at the beginning of a text, it >can usually be ignored. > >Personally, I find Fran$BmP(Bis' text very clear. It acknowledges existing, >reasonable and useful practice. > >Best regards, >markus > > >-- >Opinions expressed here may not reflect my company's positions unless >otherwise noted. >
Received on Thursday, 17 October 2002 21:46:47 UTC