Re: byte order mark article

John Cowan, Wed, 21 Nov 2012 16:27:27 -0500:
> http://www.w3.org/mid/20121121212726.GA13361@mercury.ccil.org

> 
> Per Unicode,  [ … ]  If a UTF-16LE document begins FF FE, that means the
> first character is U+FEFF, ZERO BASED NON-BREAKING SPACE; likewise if
> a UTF-16BE document begins FE FF.

Note that the draft in question says: [1]

]] This is because the Unicode Standard says you should not use a BOM 
when the text is labeled as one of those encodings. [[

If the above is an accurate reflection of what Unicode says, then it 
doesn’t sound as if it is considered as very safe to let a leading FF 
FE/FE FF for anything but the BOM - not even when using 
UTF-16LE/UTF-16BE.

[1] 
http://www.w3.org/International/questions/new/qa-byte-order-mark-new.en.php#bomhow

-- 
leif halvard silli

Received on Wednesday, 21 November 2012 23:27:11 UTC