W3C home > Mailing lists > Public > www-svg@w3.org > July 2003

Re: utf-8

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sat, 26 Jul 2003 23:02:00 +0200
To: "Sigurd Lerstad" <sigler@bredband.no>
Cc: <www-svg@w3.org>
Message-ID: <3f3aea73.362859975@smtp.bjoern.hoehrmann.de>

* Sigurd Lerstad wrote:
>In an XML file that says utf-8 in the xml declaration. There could be 4 byte
>characters later in the file. How should those be treated to convert them to
>utf-16?

Just like any other sequence. U+10000 is F0 90 80 80 in UTF-8 and
D8 00 DC 00 or 00 D8 00 DC (depending on byte order) in UTF-16.

>Is there some spec which says what to do?

Unicode.
Received on Saturday, 26 July 2003 17:02:17 GMT

This archive was generated by hypermail 2.3.1 : Friday, 8 March 2013 15:54:25 GMT