W3C home > Mailing lists > Public > public-script-coord@w3.org > January to March 2012

RE: New full Unicode for ES6 idea

From: Phillips, Addison <addison@lab126.com>
Date: Tue, 21 Feb 2012 15:18:27 -0800
To: Brendan Eich <brendan@mozilla.com>, Mark Davis ☕ <mark@macchiato.com>
CC: Cameron McCormack <cam@mcc.id.au>, "public-script-coord@w3.org" <public-script-coord@w3.org>, Anne van Kesteren <annevk@opera.com>, "mranney@voxer.com" <mranney@voxer.com>, es-discuss <es-discuss@mozilla.org>
Message-ID: <131F80DEA635F044946897AFDA9AC3476AA7E21ECB@EX-SEA31-D.ant.amazon.com>
> 
> Hi Mark, thanks for this post.
> 
> Mark Davis ☕ wrote:
> > UTF-8 represents a code point as 1-4 8-bit code units
> 
> "1-6".

No. 1 to *4*. Five and six byte "UTF-8" sequences are illegal and invalid. 

> 
> > UTF-16 represents a code point  as 2 or 4 16-bit code units
> 
> "1 or 2".

Yes, 1 or 2 16-bit code units (that's 2 or 4 bytes, of course).

Addison

Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)

Internationalization is not a feature.
It is an architecture.




Received on Tuesday, 21 February 2012 23:18:59 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 8 May 2013 19:30:05 UTC