W3C home > Mailing lists > Public > www-international@w3.org > October to December 2012

Re: byte order mark article

From: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>
Date: Thu, 22 Nov 2012 00:26:43 +0100
To: John Cowan <cowan@mercury.ccil.org>
Cc: Anne van Kesteren <annevk@annevk.nl>, www-international@w3.org
Message-ID: <20121122002643324428.ee081b9a@xn--mlform-iua.no>
John Cowan, Wed, 21 Nov 2012 16:27:27 -0500:
> http://www.w3.org/mid/20121121212726.GA13361@mercury.ccil.org

> 
> Per Unicode,  [ … ]  If a UTF-16LE document begins FF FE, that means the
> first character is U+FEFF, ZERO BASED NON-BREAKING SPACE; likewise if
> a UTF-16BE document begins FE FF.

Note that the draft in question says: [1]

]] This is because the Unicode Standard says you should not use a BOM 
when the text is labeled as one of those encodings. [[

If the above is an accurate reflection of what Unicode says, then it 
doesn’t sound as if it is considered as very safe to let a leading FF 
FE/FE FF for anything but the BOM - not even when using 
UTF-16LE/UTF-16BE.

[1] 
http://www.w3.org/International/questions/new/qa-byte-order-mark-new.en.php#bomhow

-- 
leif halvard silli
Received on Wednesday, 21 November 2012 23:27:11 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 21 November 2012 23:27:11 GMT