[Bug 14680] Using windows-1252 instead of the declared encoding iso-8859-1

https://www.w3.org/Bugs/Public/show_bug.cgi?id=14680

transoral@yahoo.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                URL|                            |http://www.lovatasinhala.co
                   |                            |m/iso-8859-1-l.htm
                 CC|                            |transoral@yahoo.com
         OS/Version|Linux                       |Windows XP

--- Comment #1 from transoral@yahoo.com 2012-08-03 18:32:41 UTC ---
I have this exact situation. The sample web page (HTML5) is
http://www.lovatasinhala.com/iso-8859-1-l.htm
I went ahead and included windows-1252 as the character encoding in the HTTP
header (via .htacess file). However, the point is that this page does not have
any characters outside iso-8859-1, and yet the validator says it is
windows-1252 when it is only iso-8859-1.

As far as I see the difference between is-8859-1 and windows-1252 is that
windows-1252 has the following characters in addition to iso-8859-1. 
The first column is the single-byte code and the third is the corresponding
Unicode codepoint.
80     €     20AC     EURO SIGN
82     ‚     201A     SINGLE LOW-9 QUOTATION MARK
83     ƒ     0192     LATIN SMALL LETTER F WITH HOOK
84     „     201E     DOUBLE LOW-9 QUOTATION MARK
85     …     2026     HORIZONTAL ELLIPSIS
86     †     2020     DAGGER
87     ‡     2021     DOUBLE DAGGER
88     ˆ     02C6     MODIFIER LETTER CIRCUMFLEX ACCENT
89     ‰     2030     PER MILLE SIGN
8A     Š     0160     LATIN CAPITAL LETTER S WITH CARON
8B     ‹     2039     SINGLE LEFT-POINTING ANGLE QUOTATION MARK
8C     Π    0152     LATIN CAPITAL LIGATURE OE
8E     Ž     017D     LATIN CAPITAL LETTER Z WITH CARON
91     ‘     2018     LEFT SINGLE QUOTATION MARK
92     ’     2019     RIGHT SINGLE QUOTATION MARK
93     “     201C     LEFT DOUBLE QUOTATION MARK
94     ”     201D     RIGHT DOUBLE QUOTATION MARK
95     •     2022     BULLET
96     –     2013     EN DASH
97     —     2014     EM DASH
98     ˜     02DC     SMALL TILDE
99     ™     2122     TRADE MARK SIGN
9A     š     0161     LATIN SMALL LETTER S WITH CARON
9B     ›     203A     SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
9C     œ     0153     LATIN SMALL LIGATURE OE
9E     ž     017E     LATIN SMALL LETTER Z WITH CARON
9F     Ÿ     0178     LATIN CAPITAL LETTER Y WITH DIAERESIS

-- 
Configure bugmail: https://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.

Received on Friday, 3 August 2012 18:32:43 UTC