At 9:08 PM +0900 9/21/03, MURATA Makoto wrote: >UTF-8 has its own technical problems (the Unicode signature, representation >of non-BMP characters, etc.). By Unicode signature, I'm guessing you mean the BOM? That problem seems to have been easily dealt with by simply deciding to allow it in UTF-8. It doesn't appear to have caused any problems in practice today. I don't know what you problems you refer to with "representation of non-BMP characters". UTF-8 precisely specifies how these characters are represented. There's no issue here. Did you mean something else? -- Elliotte Rusty Harold elharo@metalab.unc.edu Processing XML with Java (Addison-Wesley, 2002) http://www.cafeconleche.org/books/xmljava http://www.amazon.com/exec/obidos/ISBN%3D0201771861/cafeaulaitAReceived on Sunday, 21 September 2003 10:08:38 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:55:49 GMT