Re: RFC 2279 (UTF-8) to Full Standard from Misha.Wolf@reuters.com on 2002-04-11 (ietf-charsets@w3.org from April to June 2002)

From: <Misha.Wolf@reuters.com>
Date: Thu, 11 Apr 2002 19:37:19 +0100
To: Markus Scherer <markus.scherer@jtcsv.com>
Cc: ietf-charsets@iana.org
Message-id: <T5a32f4b1d2c407b7074ec@reuters.com>

On 11/04/2002 18:45:56 Markus Scherer wrote:
> RFC 2279 still describes encodings for code points >U+10ffff.
> That should be removed. Among other changes, this results in "The octet values
> F5..FF never appear." (instead of "FE and FF", in the Intro)
>
> Why not just point to the definition in the Unicode Standard, Version 3.2?

That is a possibility.  It never was before, as prior to
Unicode 3.2, the Unicode definition of UTF-8 was seriously
flawed, allowing irregular code unit sequences.  On the
other hand, the definition of UTF-8 in Unicode 3.2 is made
up of amendments to existing text in Unicode 3.0, is it not?
That isn't a suitable format for a normative reference.

Misha

------------------------------------------------------------- ---
        Visit our Internet site at http://www.reuters.com

Any views expressed in this message are those of  the  individual
sender,  except  where  the sender specifically states them to be
the views of Reuters Ltd.

Received on Thursday, 11 April 2002 14:39:49 UTC