W3C home > Mailing lists > Public > public-script-coord@w3.org > January to March 2012

Re: New full Unicode for ES6 idea

From: Tab Atkins Jr. <jackalmage@gmail.com>
Date: Tue, 21 Feb 2012 15:15:50 -0800
Message-ID: <CAAWBYDAsEfbhU9PvdNoL874j9+4wU6zDvg1Sf7nnVa72d8bBGQ@mail.gmail.com>
To: Brendan Eich <brendan@mozilla.com>
Cc: Mark Davis ☕ <mark@macchiato.com>, "public-script-coord@w3.org" <public-script-coord@w3.org>, Anne van Kesteren <annevk@opera.com>, mranney@voxer.com, es-discuss <es-discuss@mozilla.org>
On Tue, Feb 21, 2012 at 3:11 PM, Brendan Eich <brendan@mozilla.com> wrote:
> Hi Mark, thanks for this post.
> Mark Davis ☕ wrote:
>>
>> UTF-8 represents a code point as 1-4 8-bit code units
>
> "1-6".
...
> Lock up your encoders, I am so not a Unicode guru but this is what my
> reptile coder brain remembers.

Only theoretically.  UTF-8 has been locked down to the same range that
UTF-16 has (RFC 3629), so the largest real character you'll see is 4
bytes, as that gives you exactly 21 bits of data.

~TJ
Received on Tuesday, 21 February 2012 23:16:38 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 8 May 2013 19:30:05 UTC