W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2001

RE: to XML, not XHTML

From: Jelks Cabaniss <jelks@jelks.nu>
Date: Wed, 29 Aug 2001 02:26:46 -0400
To: <html-tidy@w3.org>
Message-ID: <001f01c13053$946aca90$6501a8c0@alex1.va.home.com>
Matt G wrote:

> Is their a way to force Tidy to ignore "HTML good/bad-ness" 
> and only convert badly formed HTML into well-formed XML 
> (which should be much more efficient). Or is there another 
> utility (COM interface preferred, command-line okay, no GUI 
> allowed) that will do this?
> 
> I don't care about producing good HTML/XHTML, all I need is 
> to produce something I can shove into an XML parser and use 
> XPath/XSLT to extract data. It will be used by automation 
> scripts and robots.

XHTML *is* well-formed XML.

As to a Tidy COM interface, see

	http://perso.wanadoo.fr/ablavier/TidyCOM/


/Jelks
Received on Wednesday, 29 August 2001 02:27:07 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:46 GMT