W3C home > Mailing lists > Public > www-validator-cvs@w3.org > November 2011

[Bug 14680] New: Using windows-1252 instead of the declared encoding iso-8859-1

From: <bugzilla@jessica.w3.org>
Date: Wed, 02 Nov 2011 16:26:19 +0000
To: www-validator-cvs@w3.org
Message-ID: <bug-14680-169@http.www.w3.org/Bugs/Public/>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=14680

           Summary: Using windows-1252 instead of the declared encoding
                    iso-8859-1
           Product: Validator
           Version: HEAD
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: blocker
          Priority: P2
         Component: HTML5
        AssignedTo: mike+validator@w3.org
        ReportedBy: matteosistisette@gmail.com
         QAContact: www-validator-cvs@w3.org


I'm trying to validate (by url) a page that is encoded with iso-8859-1
character encoding and is declared as iso-8859-1 character encoding. It is
html5. The encoding is being declared in the http Content-Type header properly
and corrsepond to the encoding actually used AND the one declared in the html.

If I choose "detect automatically" as both the character encoding and the
doctype, I get this warning:

Using windows-1252 instead of the declared encoding iso-8859-1

First of all it is unclear: Is it telling me that the page is encoded with a
different encoding than the one declared, or is it telling me that the
validator is using a different encoding to decode it?

In both cases it doesn't make sense. In the first case, it is plain wrong,
because the page IS encoded with iso-8859-1.

In the second case, then tha automatic detection doesn't work properly.

At the top of the validation page, it says "encoding: iso-8859-1" and "doctype:
html5", so it looks like it is the first hypothesis, then the error message is
bogus.


If I select manually iso-8859-1 instead of detect automatically, then the
warning disappears and everything look correct (i do get errors but that's ok
because the page does have errors).




I can't provide a link to the page, but here's the first part of the content:

<!DOCTYPE html>
<html>
<head>
  <meta charset="iso-8859-1">
  <title>XXXX: Estás en xxxxx. </title>
  <link rel="stylesheet" type="text/css" href="includes/style.css" >
  <script type="text/javascript" src="includes/jquery162.js"></script>
  <script type="text/javascript" src="includes/general.js"></script>
  <script type="text/javascript" src="includes/swfobject.js"></script>
  <script type="text/javascript" src="includes/megafone.js"></script>

</head>
....

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Wednesday, 2 November 2011 16:26:24 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 26 April 2012 12:55:30 GMT