W3C home > Mailing lists > Public > html-tidy@w3.org > October to December 2002

conspicuous example of newer tidy not handling errors as well

From: Lucas W. Fletcher <lucas@dealersinnotions.com>
Date: Mon, 25 Nov 2002 23:10:30 -0800
Message-ID: <001601c2951a$e9ac80c0$0200a8c0@uncon>
To: <html-tidy@w3.org>
Hi,

I'm using tidy to screen-scrape the http://dmoz.org site (transformed to xml and parsed as xslt) and have recently moved from using the old COM wrapper (André Blavier's) to the new one. I've noticed that in at least one case the old version interprets a bad closing table tag more like a browser than does the new one. If you go to any listings page and view the source you'll notice that the unordered list of sites is not in a table. Also note that there is a bad table closing tag near the top of the page. The old tidy "correctly" closed the table, while the new one closes the table at the end of the body. I've noticed other cases where tidy waits until the end of the body to close a tag that could have been closed earlier.

Just thought this might be of some interest...

Lucas Fletcher
lucas@dealersinnotions.com
http://dealersinnotions.com
Received on Tuesday, 26 November 2002 02:44:11 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 5 February 2014 23:39:48 UTC