W3C home > Mailing lists > Public > www-validator@w3.org > April 2004

Re: Validation & errors

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sun, 25 Apr 2004 00:42:32 +0200
To: Nick Kew <nick@webthing.com>
Cc: www-validator@w3.org
Message-ID: <409cec3f.162046770@smtp.bjoern.hoehrmann.de>

* Nick Kew wrote:
>> You don't need to be, you could use an output filter for your webserver
>> that passes invalid documents to HTML Tidy first and thus correct most
>> error automatically...
>
>Indeed you could.  Except that Tidy has neither a SAX (or comparable
>linear) parse mode nor a parseChunk API, and would therefore be
>seriously inefficient in this context.  libxml2 does the job a whole
>lot more efficiently.

Well, you can't really compare these two tools; AFAIK, libxml2 helps to
parse tag soup to some extend, but lacks most of Tidy's functionality
which requires a DOM and could thus not be realized with a streaming
API. And performance does not matter much as you could cache Tidy's
output, unless you have truely dynamic content that changes for each
request (personalized services, for example).
Received on Saturday, 24 April 2004 18:42:58 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:13 GMT