- From: Brendan Eich <brendan@mozilla.com>
- Date: Tue, 21 Feb 2012 17:06:35 -0800
- To: "Phillips, Addison" <addison@lab126.com>, Anne van Kesteren <annevk@opera.com>, "Tab Atkins Jr." <jackalmage@gmail.com>
- CC: Mark Davis ☕ <mark@macchiato.com>, Cameron McCormack <cam@mcc.id.au>, "public-script-coord@w3.org" <public-script-coord@w3.org>, "mranney@voxer.com" <mranney@voxer.com>, es-discuss <es-discuss@mozilla.org>
Thanks, all! That's a relief to know, six bytes always seemed to long but my reptile coder brain was also reptile-coder-lazy and I never dug into it. /be Phillips, Addison wrote: >> Hi Mark, thanks for this post. >> >> Mark Davis ☕ wrote: >>> UTF-8 represents a code point as 1-4 8-bit code units >> "1-6". > > No. 1 to *4*. Five and six byte "UTF-8" sequences are illegal and invalid. > >>> UTF-16 represents a code point as 2 or 4 16-bit code units >> "1 or 2". > > Yes, 1 or 2 16-bit code units (that's 2 or 4 bytes, of course). > > Addison > > Addison Phillips > Globalization Architect (Lab126) > Chair (W3C I18N WG) > > Internationalization is not a feature. > It is an architecture. > > > >
Received on Wednesday, 22 February 2012 01:07:04 UTC