RE: [CSS21] out of range unicode escapes

Any data outside the range of valid Unicode is not defined. To be consistent with handling bad UTF-8, we should probably specify changing it into the replacement character.


-----Original Message-----
From: [] On Behalf Of Bjoern Hoehrmann
Sent: Friday, January 12, 2007 6:52 AM
Subject: [CSS21] out of range unicode escapes


  The current CSS 2.1 draft does not address handling of Unicode escapes
that appear to be above U+10FFFF like \FFFFFF. Such a sequence could be
interpreted as 5-digit escape followed by 'F', or be considered invalid,
or handled as if it was the replacement character \FFFD, or in other
ways. Implementations do not agree on how to handle this case.

Björn Höhrmann · ·
Weinh. Str. 22 · Telefon: +49(0)621/4309674 ·
68309 Mannheim · PGP Pub. KeyID: 0xA4357E78 · 

Received on Friday, 12 January 2007 15:35:29 UTC