W3C home > Mailing lists > Public > www-international@w3.org > April to June 2004

Re: Windows and Mac character encoding questions

From: by way of Martin Duerst <nobody@xyzzy.claranet.de>
Date: Wed, 07 Apr 2004 10:58:46 +0900
Message-Id: <4.2.0.58.J.20040407105829.037f43e8@localhost>
To: www-international@w3.org




Chris Lilley wrote:

 >>> the bytes 0x80..0x9F in iso-8859-1 are so rarely used
 >> ...they're even illegal in XML 1.0, aren't they ?

 > The characters at those positions in the UCS are illegal.
 > Bytes with those values in a given encoding are not illegal.

That's what I wanted to say, sorry for the confusion.  0x80 in
windows-1252 is an Euro, and U20AC is a legal Unicode character.

Ox80 in Latin-1 is a control character, it's the very same
control character in Unicode, and that's illegal in XML 1.0

All &#127; up to &#159; are illegal in XML 1.0   Bye, Frank
Received on Wednesday, 7 April 2004 00:00:23 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:03 GMT