- From: Bjoern Hoehrmann <derhoermi@gmx.net>
- Date: Sun, 25 Apr 2004 00:42:32 +0200
- To: Nick Kew <nick@webthing.com>
- Cc: www-validator@w3.org
* Nick Kew wrote: >> You don't need to be, you could use an output filter for your webserver >> that passes invalid documents to HTML Tidy first and thus correct most >> error automatically... > >Indeed you could. Except that Tidy has neither a SAX (or comparable >linear) parse mode nor a parseChunk API, and would therefore be >seriously inefficient in this context. libxml2 does the job a whole >lot more efficiently. Well, you can't really compare these two tools; AFAIK, libxml2 helps to parse tag soup to some extend, but lacks most of Tidy's functionality which requires a DOM and could thus not be realized with a streaming API. And performance does not matter much as you could cache Tidy's output, unless you have truely dynamic content that changes for each request (personalized services, for example).
Received on Saturday, 24 April 2004 18:42:58 UTC