W3C home > Mailing lists > Public > www-international@w3.org > October to December 2012

Re: byte order mark article

From: John Cowan <cowan@mercury.ccil.org>
Date: Wed, 21 Nov 2012 18:33:53 -0500
To: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>
Cc: Anne van Kesteren <annevk@annevk.nl>, www-international@w3.org
Message-ID: <20121121233352.GD13361@mercury.ccil.org>
Leif Halvard Silli scripsit:

> UTF-16LE and UTF-16BE theoretical ability to let a leading FF FE or FE 
> FF represent a ZERO WIDTH NO-BREAK SPACE rather than a BOM, seems to be 
> withot value for mark-up languages. 

Well, that's true of XML documents, because their content is always
preceded and followed by markup.  But this is not necessarily true of
HTML documents, nor XML external entities, nor LMNL documents.

Note also that an XML document in UTF16-BE or UTF16-LE must have an
XML declaration saying so.  If there is no XML declaration in a 16-bit
format document, it is necessarily UTF-16, and XML requires a BOM in
that case.

-- 
                Si hoc legere scis, nimium eruditionis habes.
Received on Wednesday, 21 November 2012 23:34:16 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 21 November 2012 23:34:17 GMT