W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2002

Re: web site indexer?

From: Dave Raggett <dsr@w3.org>
Date: Fri, 19 Apr 2002 16:58:39 +0100 (BST)
To: David Davis <ddavis4@columbus.rr.com>
cc: html-tidy@w3.org
Message-ID: <Pine.LNX.4.44.0204191655460.1474-100000@hazel>
You could try first tidying the documents and then piping them 
through a second filter to pick out the links. It would also be
simple to extend HTML Tidy to pickout the links, but you would
have to define an output format. Note that it would be tricky
to pick out links dynamically defined through scripting.

I have copied this response to the html-tidy list in case
others would find such a change helpful.

-- 
 Dave Raggett <dsr@w3.org> or <dave.raggett@openwave.com>
 W3C lead for voice/multimodal. http://www.w3.org/People/Raggett 
 tel/fax: +44 1225 866240 (or 867351) +44 771 213 7629 (GSM)
Received on Friday, 19 April 2002 11:58:48 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:52 GMT