Re: Encoding in the HTML/HTTP header

Martin Duerst wrote:
> 
> Two possibilities I can immagine:
> 
> - The document is XML-based, the browser recognizes this, and
>   the uses the UTF-8 default for XML documents.
> - The browser analyses the byte sequences in the document and
>   heuristically detects that the document looks like UTF-8.
>   The chances for detecting UTF-8 correctly go up very quickly
>   even with only very few non-ASCII characters.

And goes up massively if the stream begins with a BOM (though using a 
BOM with UTF-8 has other issues).

Received on Wednesday, 7 September 2005 10:02:52 UTC