W3C home > Mailing lists > Public > whatwg@whatwg.org > October 2009

[whatwg] Charset sniffing from XML prolog

From: Kartikaya Gupta <lists.whatwg@stakface.com>
Date: Thu, 08 Oct 2009 04:26:14 +0000
Message-ID: <20091008042615.80C8C8DB001D@zapata.dreamhost.com>
> In this case, though, it's totally my fault: I just forgot that I had 
> the HTML5 parser turned on locally.  Turning that off, I do get UTF-8, 
> because of 
> http://hg.mozilla.org/mozilla-central/file/603759afc77a/parser/htmlparser/src/nsParser.cpp#l2553 
> and following.  That code is just bogus, in my somewhat biased 
> opinion.... ;)

So then is this behavior getting axed or specced? The site in question that relies on this behavior is http://bell.mobi/primary - it's not as noticeable in the english-locale version but if you switch to a french locale you get a bunch of french encoded as utf-8. Browsers with the prolog sniffing will render it fine but others will show garbage.

I'd be happier with not having to change my code to deal with this website, since it will occasionally show garbage even in utf-8.

kats
Received on Wednesday, 7 October 2009 21:26:14 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:59:17 UTC