Re: UTC Agenda Item: Recommendations for handling ill-formed sequences

�istein E. Andersen (quoted by Mark Davis) scripsit:

> One notable difference is that overlong sequences as well as UTF-8
> sequences representing surrogates and characters outside Unicode
> (>10FFFF) will typically map to several replacement characters according
> to your proposal, but to only one in Markus Kuhn's system

I agree that overlong sequences, surrogates, and old-10646 sequences
should become a single FFFD.

-- 
The first thing you learn in a lawin' family    John Cowan
is that there ain't no definite answers         cowan@ccil.org
to anything.  --Calpurnia in To Kill A Mockingbird

Received on Friday, 11 April 2008 23:13:10 UTC