errors.html from Clemens Radl on 2001-04-15 (www-validator@w3.org from April 2001)

From: Clemens Radl <clemens.radl@uni-tuebingen.de>
Date: Sun, 15 Apr 2001 19:55:51 +0200 (CEST)
To: www-validator@w3.org
Message-ID: <Pine.LNX.4.21.0104151914580.17049-100000@hsmau1.histo.geschichtswissenschaft.un>

Hi,

I have two minor suggestions for the validator errors page 
<http://validator.w3.org/docs/errors.html>:

1. Section about "general entity `FOO' not defined and no default entity"
<http://validator.w3.org/docs/errors.html#bad-entity>

I am referring to the problem with ampersands in URLs.  The error
explanation reads "The only way to get around this is for the author of
the CGI program to use a different value between arguments [...]".  

Another workaround is to use "&amp;" instead of '&' in URLs, as mentioned
in <http://www.htmlhelp.com/tools/validator/problems.html#amp>, which in
turn is linked to from <http://validator.w3.org/feedback.html>.

Maybe one should include this reference into the error messages page,
because this offers a chance for web authors to create valid _and_ working
documents.  Of course, it is nevertheless a good idea to ask maintainers
of cgi-programs to accept other separators besides '&'.


2. Section about "non SGML character number ###"
<http://validator.w3.org/docs/errors.html#bad-char>

"HTML uses the standard ISO8859-1 character encoding, and ISO8859-1 leaves
undefined 65 character codes (0 to 31 inclusive and 127 to 159
inclusive); The Validator has found one of these undefined characters in
your document."

This passage suggests that tabs (character code 9), line feeds (character
code 10) and carriage returns (character code 13) are not valid characters
to be used in an iso8859-1 encoded html document.  Of course, no one would
misunderstand this passage, but still it might be a good idea to indicate
more precisely which character codes are illegal.

By the way, I also think it is not quite correct to say that "ISO8859-1
leaves undefined" the character codes below 32.  I always thought that
iso8859-1 character codes below 128 were equivalent to ASCII.

Regards,

Clemens


--
Clemens Radl
Universitaet Tuebingen
Abteilung fuer mittelalterliche Geschichte
http://www.uni-tuebingen.de/mittelalter/

Received on Sunday, 15 April 2001 14:02:20 UTC