Re: UTC Agenda Item: Recommendations for handling ill-formed sequences

�istein E. Andersen (quoted by Mark Davis) scripsit:

> One notable difference is that overlong sequences as well as UTF-8
> sequences representing surrogates and characters outside Unicode
> (>10FFFF) will typically map to several replacement characters according
> to your proposal, but to only one in Markus Kuhn's system

I agree that overlong sequences, surrogates, and old-10646 sequences
should become a single FFFD.

Received on Friday, 11 April 2008 23:13:10 UTC