charset parameter not used for decoding

Hi,

Validating an XHTML document with encoding information only present in
the HTTP header fails, the validator does not use the charset paremeter
and tries to use UTF-8 instead (this fails, if the document is e.g.
ISO-8859-1 encoded). For example:

  % http-head http://jigsaw.w3.org/css-validator/validator.html.de
  HTTP/1.1 200 OK
  Connection: close
  Date: Wed, 14 Aug 2002 15:17:23 GMT
  Content-Length: 2578
  Content-Language: de
  Content-Type: text/html;charset=ISO-8859-1
  Etag: "m4r72t:tnioiej8"
  Last-Modified: Sat, 18 May 2002 21:49:54 GMT
  Server: Jigsaw/2.2.0

And the validator reports

  Target: http://jigsaw.w3.org/css-validator/validator.html.de

  Please, validate your XML document first!
  
  Line 45
  
  Column 97
  
  An invalid XML character (Unicode: 0xfc) was found in the element
  content of the document. 
  
This is a bug, as per XML 1.0 Second Edition Errata.

Received on Wednesday, 14 August 2002 11:21:35 UTC