W3C home > Mailing lists > Public > whatwg@whatwg.org > September 2009

[whatwg] U+FEFF (BOM) stripping in UTF-16BE and UTF-16LE

From: Ian Hickson <ian@hixie.ch>
Date: Tue, 15 Sep 2009 03:42:31 +0000 (UTC)
Message-ID: <Pine.LNX.4.62.0909150342250.14605@hixie.dreamhostps.com>
On Wed, 9 Sep 2009, ?istein E. Andersen wrote:
>
> ? 9.2.2.2 "Preprocessing the input stream" requires that a leading 
> U+FEFF (byte order mark) be stripped irrespective of encoding, contra 
> Unicode, which says that a leading U+FEFF is part of the document when 
> the byte order is already established by other means.  This is probably 
> harmless and potentially useful to deal with bislabelled documents, but 
> it might be worth adding an explanatory note.

Fixed.

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'
Received on Monday, 14 September 2009 20:42:31 UTC

This archive was generated by hypermail 2.3.1 : Monday, 13 April 2015 23:08:52 UTC