W3C home > Mailing lists > Public > ietf-charsets@w3.org > January to March 1999

Re: draft-hoffman-utf16-01.txt available

From: MURATA Makoto <murata@apsdc.ksp.fujixerox.co.jp>
Date: Fri, 05 Feb 1999 10:52:26 +0900
To: ietf charsets <ietf-charsets@iana.org>
Message-id: <199902050152.AA03416@murata.apsdc.ksp.fujixerox.co.jp>
Larry Masinter wrote:
> Appendix F of the XML specification
> 
> (http://www.w3.org/TR/1998/REC-xml-19980210#sec-guessing)
> 
> makes it clear that BOM isn't actually necessary for auto-guessing
> of charset between UTF-16LE and UTF-16BE, because of the differing
> encodings of "<" and "?" in the initial "<?".

Actually, external DTD subsets, external parsed entities, and external 
parameter entites do not have to begin with "<".  In particular, 
external parsed entities can begin with any character.  To allow 
autodetection, XML 1.0 requires that external parsed entities must begin 
either the BOM or text declarations, if not in UTF-8.

Makoto
 
Fuji Xerox Information Systems
 
Tel: +81-44-812-7230   Fax: +81-44-812-7231
E-mail: murata@apsdc.ksp.fujixerox.co.jp
Received on Thursday, 4 February 1999 20:54:34 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 5 June 2006 15:10:50 GMT