RE: to XML, not XHTML

Matt G wrote:

> Is their a way to force Tidy to ignore "HTML good/bad-ness" 
> and only convert badly formed HTML into well-formed XML 
> (which should be much more efficient). Or is there another 
> utility (COM interface preferred, command-line okay, no GUI 
> allowed) that will do this?
> 
> I don't care about producing good HTML/XHTML, all I need is 
> to produce something I can shove into an XML parser and use 
> XPath/XSLT to extract data. It will be used by automation 
> scripts and robots.

XHTML *is* well-formed XML.

As to a Tidy COM interface, see

	http://perso.wanadoo.fr/ablavier/TidyCOM/


/Jelks

Received on Wednesday, 29 August 2001 02:27:07 UTC