RE: error when clean html with tidy (fwd)

Chunbo Shao <cxs0187@omega.uta.edu> wrote:
	Attached files are some html page on some university.  I didn't
	make the page.  I just use Tidy to parse it.
	...	
	The file "42-upenn.edu" gave me the error "Error:  <a> missing
	'>' for end of tag".
	...
	I cannot see any clue to figure out why the error happans.

I've only checked one of these files, the 42-upenn.edu one.

HTML Tidy says the error is on line 195.  So look at line 195, which
I have split inside the second </A> to fit in a reasonable width.
                                             v
    <LI><A href="http://www.arstechnica.com/"</A>Ars Technica</A
    > <FONT Size=-2>Ars Technica</FONT><br>

Blow me down.  HTML Tidy told you there is a missing '>'
because there *IS* a missing '>'.  How about that?
The letter "v" points to the place where the '>' should have been.

Not only that, the <A> is ended twice.  It looks as though the first
'</A>' on line 195 should have been '>'.

Note for the discerning:  the appearance of '<FONT SIZE=-2> is an
extremely reliable indicator of a page written by the HTML-clueless,
and so it proves.  This is a page that badly needs the attentions of Tidy.

Received on Sunday, 26 August 2001 21:45:55 UTC