W3C home > Mailing lists > Public > www-international@w3.org > October to December 2012

Re: byte order mark article

From: John Cowan <cowan@mercury.ccil.org>
Date: Wed, 21 Nov 2012 20:45:15 -0500
To: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>
Cc: Anne van Kesteren <annevk@annevk.nl>, www-international@w3.org
Message-ID: <20121122014515.GF13361@mercury.ccil.org>
Leif Halvard Silli scripsit:

> > Note also that an XML document in UTF16-BE or UTF16-LE must have an
> > XML declaration saying so.
> 
> That requirement exists only when there is no external protocol:  

True.

> Second: When there is an an external declaration which says "UTF-16",
>         then the requirement to include a BOM is relaxed. The parser
>         could e.g. default to UTF-16LE, as Unicode says.

It does not default to the UTF-16LE encoding, but to the UTF-16 encoding
with little-endian interpretation.  These are two different things, though
often confused.

-- 
It was impossible to inveigle           John Cowan <cowan@ccil.org>
Georg Wilhelm Friedrich Hegel           http://www.ccil.org/~cowan
Into offering the slightest apology
For his Phenomenology.                      --W. H. Auden, from "People" (1953)
Received on Thursday, 22 November 2012 01:45:39 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 22 November 2012 01:45:39 GMT