- From: Andy Quick <ac.quick@sympatico.ca>
- Date: Fri, 24 Mar 2000 11:47:56 -0600
- To: <html-tidy@w3.org>
You could use the DOM interface of Java tidy and traverse
the parse tree for TEXT_NODE's, or use getElementsByTagName("p").
Andy Quick
----- Original Message -----
From: Spencer Marks <smarks@digisolutions.com>
To: <html-tidy@w3.org>
Sent: March 18, 2000 1:41 PM
Subject: HTML 2 Text
>
> Hi, I was wondering if there's a way to use Tidy to remove all HTML
> from a page and just get the text.
>
Received on Friday, 24 March 2000 14:13:28 UTC