W3C home > Mailing lists > Public > www-validator-cvs@w3.org > June 2013

[Bug 22223] Latin-1 characters (æ, þ etc.) are rejected as errors by validator

From: <bugzilla@jessica.w3.org>
Date: Tue, 25 Jun 2013 20:11:20 +0000
To: www-validator-cvs@w3.org
Message-ID: <bug-22223-169-iGHJdxpShX@http.www.w3.org/Bugs/Public/>
https://www.w3.org/Bugs/Public/show_bug.cgi?id=22223

--- Comment #5 from jc ahágama <ahangama@gmail.com> ---
I understand now. Thank you for explaining so clearly. My fault.

My problem has been that I still use HTML-Kit that has no choices of encoding
types. I suspect the new ruling will upgrade from SHOULD to MUST forcing files
like what I write (and all Western European pages) to be larger, unnecessarily
taking up precious bandwidth. I think Latin Basic and Latin-1 Supplement can
both go as safe single-byte if the first 32 characters of the latter are
prohibited or windows-1252 allowed instead.

Anyway, I put the shown file into Notepad and saved it as utf-8 to get the
passing page. Thank you. And thanks for telling me to use the NU page which is
friendlier:
http://hathvenibalavegaya.com/index.htm <== utf-8 (20,038 bytes) 
http://hathvenibalavegaya.com/indexOld.htm <== windows-1252 (18,633 bytes)

I beg the great technocrats to allow windows-1252 for the sake of the poor,
like those living in Sri Lanka, and the public network that has lot of Western
European documents. <-- My proposed patch of penny's worth.

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
Received on Tuesday, 25 June 2013 20:11:21 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:17:55 UTC