W3C home > Mailing lists > Public > www-validator@w3.org > July 2012

Re: Outdated link for character entity list in validator error message (non SGML character number)

From: Jukka K. Korpela <jkorpela@cs.tut.fi>
Date: Mon, 30 Jul 2012 08:45:18 +0300
Message-ID: <50161F6E.1000407@cs.tut.fi>
To: www-validator@w3.org
2012-07-30 8:13, Terje Bless wrote:

> On Sun, Jul 29, 2012 at 10:59 PM, Jukka K. Korpela <jkorpela@cs.tut.fi> wrote:
>> I have no information about such matters, no instructions on how to submit
>> proposals that might have a chance of being accepted.
>
> Oh, I'm sorry. The procedure is... to send the new text to the
> www-validator mailing list.

I think you missed the "that might have a chance of being accepted" 
part. Maybe I'm too pessimistic, but the current text has a link that 
could easily be fixed by making it point to a W3C spec instead of a 
draft that expired 17 years ago. Once that has been done, more 
complicated fixes could be suggested.

Oh well... here's what I think the explanation should really say 
(complete text):

"The document contains a character that is not allowed in HTML. In most 
cases, this error is caused by declaring the character encoding as 
ISO-8859-1, when the real encoding is windows-1252."

(I considered suggesting that the words "character encoding" link to 
http://www.w3.org/International/tutorials/tutorial-char-enc/
but while that's a great tutorial, it is too UTF-8 minded in this issue. 
What people really need to do in almost all cases, when getting the "non 
SGML character number ..." message, is to declare windows-1252, or some 
other 8-bit encoding maybe.)

> Or, since you are somewhat more technically inclined than most, you
> could file an issue in the W3C Bugzilla (which is easier to track).

I can't find a section for improving error message explanations there.

Yucca
Received on Monday, 30 July 2012 05:45:47 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 30 July 2012 05:45:53 GMT