W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2000

HTML to TEXT with tidy ??

From: Max Hadersbeck <max@cis.uni-muenchen.de>
Date: Fri, 19 May 2000 16:43:37 +0200
Message-ID: <39255319.194BE701@cis.uni-muenchen.de>
To: html-tidy@w3.org, max <max@cis.uni-muenchen.de>
Dear users of tidy !

In the archive of mails I found, that there was allready a discussion
about HTML2TXT.
We are interessted in a C-routine which does this job.
Are there recent developments ?
With tidy it should be "easy". "Just" a new pprint.c Version would be
neccessary.
Is there any research done, or should I write the routine ?
We are an institute in Munich/Germany researching about Information
Processing and need a C-Program to
eliminate HTML Tags.

Thanks in advance

Max Hadersbeck

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Dr. Maximilian Hadersbeck
Centrum fuer Informations- und Sprachverarbeitung
Ludwig Maximilians Universitaet
Oettingerstr. 67
80538 Muenchen
(089) 21782717, FAX (089) 21782701
max@cis.uni-muenchen.de
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Received on Friday, 19 May 2000 10:40:19 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:43 GMT