W3C home > Mailing lists > Public > www-svg@w3.org > July 2003

Re: utf-8

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Fri, 25 Jul 2003 16:55:22 +0200
To: "Sigurd Lerstad" <sigler@bredband.no>
Cc: <www-svg@w3.org>
Message-ID: <3f294495.254862793@smtp.bjoern.hoehrmann.de>

* Sigurd Lerstad wrote:
>DOM is always 2 bytes, what happens in an utf-8 file when you encounter a
>character that uses 4 bytes (UCS-4), just ignore the two last bytes?

Characters > U+FFFF are encoded using surrogate characters in UTF-16.
Received on Friday, 25 July 2003 10:55:46 GMT

This archive was generated by hypermail 2.3.1 : Friday, 8 March 2013 15:54:25 GMT