* Sigurd Lerstad wrote: >In an XML file that says utf-8 in the xml declaration. There could be 4 byte >characters later in the file. How should those be treated to convert them to >utf-16? Just like any other sequence. U+10000 is F0 90 80 80 in UTF-8 and D8 00 DC 00 or 00 D8 00 DC (depending on byte order) in UTF-16. >Is there some spec which says what to do? Unicode.Received on Saturday, 26 July 2003 17:02:17 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 4 September 2006 18:11:23 GMT