W3C home > Mailing lists > Public > xml-editor@w3.org > July to September 2001

Definition of UTF-8 in the XML specification

From: <Misha.Wolf@reuters.com>
Date: Mon, 02 Jul 2001 20:34:20 +0100
Message-ID: <T5481c184a4c407b706454@>
To: w3c-xml-core-wg@w3.org, xml-editor@w3.org
Cc: w3c-i18n-ig@w3.org
Dear XML Core WG,

Please see the following extract from the I18N WG minutes at:
http://lists.w3.org/Archives/Member/w3c-i18n-wg/2001Jun/0157

  AGREED: We will ask the XML Core WG to make the definition of UTF-8 in
  the XML specification be the one in Unicode 3.1, with the added
  restriction that "irregular code unit sequences" be treated as fatal
  errors.

  ACTION: Misha to send the XML Core WG our position on UTF-8.

We would be happy to discuss this with you.  Owing to the inclusion, in
Unicode 3.1, of many characters outside of plane 0, this has become very
topical.  Any ambiguity in the interpretation of UTF-8 has the potential
to allow serious security breaches.

Many thanks,
Misha






-----------------------------------------------------------------
        Visit our Internet site at http://www.reuters.com

Any views expressed in this message are those of  the  individual
sender,  except  where  the sender specifically states them to be
the views of Reuters Ltd.
Received on Monday, 2 July 2001 15:36:41 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:59:31 GMT