W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2002

Re: web site indexer?

From: Thaddeus L. Olczyk <olczyk@interaccess.com>
Date: Fri, 19 Apr 2002 18:06:48 +0000 (GMT)
To: Dave Raggett <dsr@w3.org>
Cc: David Davis <ddavis4@columbus.rr.com>, html-tidy@w3.org
Message-id: <3cc05b7f.191651750@smtp.interaccess.com>
On Fri, 19 Apr 2002 16:58:39 +0100 (BST), Dave Raggett <dsr@w3.org>

>You could try first tidying the documents and then piping them 
>through a second filter to pick out the links. It would also be
>simple to extend HTML Tidy to pickout the links, but you would
>have to define an output format. Note that it would be tricky
>to pick out links dynamically defined through scripting.
Pass the test through Tidy with the -asxhtml flag.
Then Take the output and pass it through expat.
It's fairly easy to get the links that way.

>I have copied this response to the html-tidy list in case
>others would find such a change helpful.
!@#$. You could have included the original post so we know what
you are tlking about.
Received on Friday, 19 April 2002 13:06:58 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:51 UTC