- From: <Misha.Wolf@reuters.com>
- Date: Thu, 11 Apr 2002 19:37:19 +0100
- To: Markus Scherer <markus.scherer@jtcsv.com>
- Cc: ietf-charsets@iana.org
On 11/04/2002 18:45:56 Markus Scherer wrote:
> RFC 2279 still describes encodings for code points >U+10ffff.
> That should be removed. Among other changes, this results in "The octet values
> F5..FF never appear." (instead of "FE and FF", in the Intro)
>
> Why not just point to the definition in the Unicode Standard, Version 3.2?
That is a possibility. It never was before, as prior to
Unicode 3.2, the Unicode definition of UTF-8 was seriously
flawed, allowing irregular code unit sequences. On the
other hand, the definition of UTF-8 in Unicode 3.2 is made
up of amendments to existing text in Unicode 3.0, is it not?
That isn't a suitable format for a normative reference.
Misha
------------------------------------------------------------- ---
Visit our Internet site at http://www.reuters.com
Any views expressed in this message are those of the individual
sender, except where the sender specifically states them to be
the views of Reuters Ltd.
Received on Thursday, 11 April 2002 14:39:49 UTC