W3C home > Mailing lists > Public > html-tidy@w3.org > January to March 2003

Need to Strip all HTML tages from a renderd web Page

From: Jamie Eagan <jamieeagan@agora-inc.com>
Date: Wed, 5 Feb 2003 15:43:01 -0500
Message-ID: <9C5B4C2D9DC7D211BB0F00A0C9DD461809306151@MERCURY>
To: "'html-tidy@w3.org'" <html-tidy@w3.org>

> Is anyone aware of a utility to remove the content from a web page. We are
> converting a large amount of content from an existing web site to a CM
> system.  In the past my company has always done this manually by copying
> the site content from a rendered page and copying to a txt editor like
> Notepad (thereby stripping all the HTML) and then copying into the CM
> editor.  We have the ability to load the information into the app if the
> content is loaded as text.  Is anyone aware of a tool that can spider
> through a site and create multipletext files....
Received on Wednesday, 5 February 2003 15:44:02 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:53 UTC