W3C home > Mailing lists > Public > www-international@w3.org > July to September 2005

Re: Encoding in the HTML/HTTP header

From: Michael Monaghan <Michael.Monaghan@Sun.COM>
Date: Wed, 07 Sep 2005 08:59:58 -0400
To: Jon Hanna <jon@hackcraft.net>
Cc: www-international@w3.org
Message-id: <431EE44E.5040109@sun.com>

Jon Hanna wrote:
> 
> Martin Duerst wrote:
> 
>>
>> Two possibilities I can immagine:
>>
>> - The document is XML-based, the browser recognizes this, and
>>   the uses the UTF-8 default for XML documents.
>> - The browser analyses the byte sequences in the document and
>>   heuristically detects that the document looks like UTF-8.
>>   The chances for detecting UTF-8 correctly go up very quickly
>>   even with only very few non-ASCII characters.
> 
> 
> And goes up massively if the stream begins with a BOM (though using a 
> BOM with UTF-8 has other issues).
> 

OK - thanks, this makes sense. btw - my browsers are Mozilla 1.7 & IE 6 /XP.

Appreciated.

-mm
Received on Wednesday, 7 September 2005 13:00:06 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:05 GMT