- From: Andy Quick <ac.quick@sympatico.ca>
- Date: Fri, 24 Mar 2000 11:47:56 -0600
- To: <html-tidy@w3.org>
You could use the DOM interface of Java tidy and traverse the parse tree for TEXT_NODE's, or use getElementsByTagName("p"). Andy Quick ----- Original Message ----- From: Spencer Marks <smarks@digisolutions.com> To: <html-tidy@w3.org> Sent: March 18, 2000 1:41 PM Subject: HTML 2 Text > > Hi, I was wondering if there's a way to use Tidy to remove all HTML > from a page and just get the text. >
Received on Friday, 24 March 2000 14:13:28 UTC