Re: byte order mark article

Leif Halvard Silli scripsit:

> UTF-16LE and UTF-16BE theoretical ability to let a leading FF FE or FE 
> FF represent a ZERO WIDTH NO-BREAK SPACE rather than a BOM, seems to be 
> withot value for mark-up languages. 

Well, that's true of XML documents, because their content is always
preceded and followed by markup.  But this is not necessarily true of
HTML documents, nor XML external entities, nor LMNL documents.

Note also that an XML document in UTF16-BE or UTF16-LE must have an
XML declaration saying so.  If there is no XML declaration in a 16-bit
format document, it is necessarily UTF-16, and XML requires a BOM in
that case.

-- 
                Si hoc legere scis, nimium eruditionis habes.

Received on Wednesday, 21 November 2012 23:34:16 UTC