Re: Infinite loop bug in the validator...

On Tue, 12 Oct 1999, Julien PERVILLE wrote:

> Try to validate
> www.premier-ministre.gouv.fr
> the validator starts an infinite loop
- -
>                                      Non SGML character number 0

The document at http://www.premier-ministre.gouv.fr/ as sent by the server
contains a large number on NUL characters (octet/byte where all bits are
zero). Such data is typically ignored by browsers but it a validator
reports it as a "character" which is not allowed in an HTML document.
(Naturally sending NULs is waste of bandwidth.)

Removing the NUL characters helps, though the validation report is still
hard to read, since the lines in the document are _very_ long.

And it's not an infinite loop, just rather long. Each occurrence of the
NUL character is reported separately. 

Perhaps the validator should be prepared to such problems, as a practical
measure against "authoring tools" which produce long strings of NULs.
An ad hoc test perhaps? After getting a NUL, skip all consecutive NULs
after it, just counting them and then reporting something clever, and
giving a polite hint to get a decent authoring tool. :-)

-- 
Yucca, http://www.hut.fi/u/jkorpela/ or http://yucca.hut.fi/yucca.html

Received on Tuesday, 12 October 1999 05:31:54 UTC