W3C home > Mailing lists > Public > ietf-charsets@w3.org > April to June 2001

Registration of new charset: UTF-32

From: Mark Davis <markdavis34@home.com>
Date: Fri, 11 May 2001 07:29:45 -0700
To: ietf-charsets@iana.org
Message-id: <007501c0da26$d3984cc0$0c680b41@c1340594a>
Charset aliases:


Suitability for use in MIME text:


Published specification(s):


The IETF registration imposes one additional constraint: if there is no
initial BOM then the byte-orientation must be big-endian. That is, in any
stream that does not begin with the (hex) byte sequence <00 00 FE FF> all of
the bytes are interpreted as big-endian.

Note: This is parallel to the IETF registration of UTF-16. As defined by the
Unicode Standard Version 3.1, without a BOM the byte orientation of UTF-32
and UTF-16 could be either little-endian or big-endian. The choice of byte
orientation would be determined by a higher-level protocol. The IETF
registration is such a protocol, and constrains the byte orientation to be
big-endian for determinant interpretation.

ISO 10646 equivalency table:

Also in http://www.unicode.org/unicode/reports/tr19/

Additional information:

Mark Davis
2509 Alpine Road, Menlo Park, CA 94025

Intended usage:

Received on Friday, 11 May 2001 12:37:44 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:52:17 UTC