- From: <Misha.Wolf@reuters.com>
- Date: Thu, 11 Apr 2002 19:37:19 +0100
- To: Markus Scherer <markus.scherer@jtcsv.com>
- Cc: ietf-charsets@iana.org
On 11/04/2002 18:45:56 Markus Scherer wrote: > RFC 2279 still describes encodings for code points >U+10ffff. > That should be removed. Among other changes, this results in "The octet values > F5..FF never appear." (instead of "FE and FF", in the Intro) > > Why not just point to the definition in the Unicode Standard, Version 3.2? That is a possibility. It never was before, as prior to Unicode 3.2, the Unicode definition of UTF-8 was seriously flawed, allowing irregular code unit sequences. On the other hand, the definition of UTF-8 in Unicode 3.2 is made up of amendments to existing text in Unicode 3.0, is it not? That isn't a suitable format for a normative reference. Misha ------------------------------------------------------------- --- Visit our Internet site at http://www.reuters.com Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Reuters Ltd.
Received on Thursday, 11 April 2002 14:39:49 UTC