W3C home > Mailing lists > Public > ietf-charsets@w3.org > January to March 2002

Re: Registration of new charset CESU-8

From: Paul Hoffman / IMC <phoffman@imc.org>
Date: Wed, 02 Jan 2002 10:01:40 -0800
To: toby_phipps@peoplesoft.com, ietf-charsets@iana.org
Message-id: <p05101001b858fab1e331@[]>
At 1:48 AM -0800 1/2/02, toby_phipps@peoplesoft.com wrote:
>Published specification(s):
>    Unicode Technical Report #26
>    "Compatibility Encoding Scheme for UTF-16: 8-bit (CESU-8)"
>    http://www.unicode.org/unicode/reports/tr26

The summary in that TR says:

>This document specifies an 8-bit Compatibility Encoding Scheme for 
>UTF-16 (CESU) that is intended for internal use within systems 
>processing Unicode in order to provide an ASCII-compatible 8-bit 
>encoding that is similar to UTF-8 but preserves UTF-16 binary 
>collation. It is not intended nor recommended as an encoding used 
>for open information exchange. The Unicode Consortium, does not 
>encourage the use of CESU-8, but does recognize the existence of 
>data in this encoding and supplies this technical report to clearly 
>define the format and to distinguish it from UTF-8. This encoding 
>does not replace or amend the definition of UTF-8.

The statement "It is not intended nor recommended as an encoding used 
for open information exchange." is underlined in the TR for emphasis.

Charset labels are used for exchanging information. Thus, CESU-8 is 
*not* a candidate for having a charset label.

--Paul Hoffman, Director
--Internet Mail Consortium
Received on Wednesday, 2 January 2002 13:02:12 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:52:17 UTC