Re: autodetecting character encoding

* Nick Kew wrote:
>But there's no requirement on HTML documents to start with those four
>bytes: they can be preceded by whitespace or an SGML comment.  Neither
>does HTML have a BOM to deal with multibyte character encodings, which
>I think is the key feature in XML that enables autodetection.

The BOM is allowed (but not required) in HTML, see HTML 4.01/5.2.1.

Received on Saturday, 8 February 2003 08:57:35 UTC