W3C home > Mailing lists > Public > ietf-charsets@w3.org > January to March 2003

RE: Proposed changes to UTF-8 draft

From: Martin Duerst <duerst@w3.org>
Date: Mon, 13 Jan 2003 13:26:59 -0500
To: Francois Yergeau <FYergeau@alis.com>, ietf-charsets@iana.org
Message-id: <4.2.0.58.J.20030113132640.02e27130@localhost>

At 11:03 03/01/13 -0500, Francois Yergeau wrote:
>Martin Duerst wrote:
> > Just to be sure: Is a 4-byte sequence that encodes a codepoint
> > beyond 10FFFF legal in your new version of the draft or not?
>
>Good question, I had not thought about it.  Thinking about it now, my take
>is that the essence of the proposal is to restrict UTF-8 to the
>UTF-16-accessible range, which means 0-10FFFF.  Anything beyond that would
>be out of bounds.

I think that makes a lot of sense.

Regards,    Martin.


>Note that my "new version of the draft" doesn't exist yet, I just floated a
>proposal that I think makes sense.  Probably I should produce a new draft
>ASAP to have something concrete to discuss and to root out any devilish
>details, such as the above.
>
>--
>Fran輟is
Received on Monday, 13 January 2003 16:14:59 GMT

This archive was generated by hypermail 2.2.0 + w3c-0.30 : Monday, 12 September 2005 15:53:29 GMT