Wrong character encoding detect for XHTML from Karl Ove Hufthammer on 2001-12-07 (www-validator@w3.org from December 2001)

From: Karl Ove Hufthammer <huftis@bigfoot.com>
Date: Fri, 07 Dec 2001 12:13:45 +0100
To: www-validator@w3.org
Message-Id: <9uqbpn.3vundad.1@ID-99504.news.dfncis.de>

Bug report:

Testcase:
<URL: http://home.no.net/huftis/kritikk/false-encoding.html >

This document is an XHTML 1.1 document with no XML declaration.
No 'charset' parameter is sent by HTTP, therefore, the document
uses the character encoding 'UTF-8' (the default for all
X(HT)ML documents). But this document contains the following
line:

<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1" />

The W3C Validator says:
'Detected Character Encoding: iso-8859-1'

which is wrong. It should be 'UTF-8'. The validator also claims
the document is valid XHTML, which it isn't, since it contains
illegal UTF-8 byte sequences (if I change 'ISO-8859-1' to
'UTF-8', the validator (correctly) refuses to validate the
document because of this).

-- 
Karl Ove Hufthammer

Received on Friday, 7 December 2001 06:13:58 UTC