- From: Dave Raggett <dsr@w3.org>
- Date: Fri, 24 Mar 2000 11:47:56 -0600
- To: Spencer Marks <smarks@digisolutions.com>
- Cc: html-tidy@w3.org
On 18 Mar 2000, Spencer Marks wrote: > > Hi, I was wondering if there's a way to use Tidy to remove all > HTML from a page and just get the text. > > In other words, I like to use Tidy as an HTML to Text conversion > utility that I can call problematically. > > Actually, I am planning on using JTidy so that I can do this > conversion as part of an application I am working on. This feature is supported by W3C's open source line mode browser. However, you could adapt Tidy to do this via a new routine for pretty printing the parse tree. Regards, -- Dave Raggett <dsr@w3.org> http://www.w3.org/People/Raggett tel/fax: +44 122 578 3011 (or 2521) +44 385 320 444 (mobile) World Wide Web Consortium (on assignment from HP Labs)
Received on Friday, 24 March 2000 13:13:35 UTC