Re: several messages about content sniffing in HTML

Julian Reschke wrote:
> 
> Ian Hickson wrote:
>> ...
>> [things about defining handling of multiple Content-Type headers]
>> ...
> 
> It would be interesting to know how many pages actually have that 
> problem; chat on IRC indicates it is below 1/1000.

The only data I have is 
<http://philip.html5.org/data/multiple-content-types.txt>, which is too 
limited in scope to see if there are cases where the difference matters 
in practice.

Looking at that data anyway, there were multiple Content-Type headers on 
about 0.04% of those pages (or 0.02% of the hostnames). (For comparison, 
that's about the same number that use <del> or <var>, and about half the 
number that use "Content-Encoding: deflate"). All of them appear to work 
correctly in browsers regardless of the Content-Type processing, since a 
browser that chooses the "text/html" instead of "text/html; charset=..." 
will fall back on some other method to find the right charset.

> BR, Julian

-- 
Philip Taylor
pjt47@cam.ac.uk

Received on Saturday, 1 March 2008 01:04:22 UTC