Re: Should the UTF-8 BOM trump overriding via HTTP or by users? from John Cowan on 2011-06-08 (www-international@w3.org from April to June 2011)

From: John Cowan <cowan@mercury.ccil.org>
Date: Wed, 8 Jun 2011 16:28:19 -0400
To: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>
Cc: Bjoern Hoehrmann <derhoermi@gmx.net>, www-international <www-international@w3.org>
Message-ID: <20110608202819.GA32536@mercury.ccil.org>

Leif Halvard Silli scripsit:

> So, really, I don't know if Firefox uses your algorithm for the
> file:// protocol. All I know is that its *parser* fails to retun
> 'fatal error' when the BOM and the declaration differ. Based on the
> XML parsers I have used recently (Webkit, Gecko, Opera, 'oXygen XML
> editor', 'XMLmind XML editor'), it is the *exception* (only Webkit
> does it) rather than the rule, that file protocol parsing returns
> "fatal error" whenever encoding declaration differs from the BOM.

That's clearly a bug, then.  If the encoding declaration is *not* UTF-8,
then the BOM is not a BOM at all, but characters preceding the XML
declaration.  That means the input is not well formed.

-- 
John Cowan                                <cowan@ccil.org>
Yakka foob mog.  Grug pubbawup zink wattoom gazork.  Chumble spuzz.
    --Calvin, giving Newton's First Law "in his own words"

Received on Wednesday, 8 June 2011 20:28:50 UTC