- From: Jelks Cabaniss <jelks@jelks.nu>
- Date: Wed, 29 Aug 2001 02:26:46 -0400
- To: <html-tidy@w3.org>
Matt G wrote: > Is their a way to force Tidy to ignore "HTML good/bad-ness" > and only convert badly formed HTML into well-formed XML > (which should be much more efficient). Or is there another > utility (COM interface preferred, command-line okay, no GUI > allowed) that will do this? > > I don't care about producing good HTML/XHTML, all I need is > to produce something I can shove into an XML parser and use > XPath/XSLT to extract data. It will be used by automation > scripts and robots. XHTML *is* well-formed XML. As to a Tidy COM interface, see http://perso.wanadoo.fr/ablavier/TidyCOM/ /Jelks
Received on Wednesday, 29 August 2001 02:27:07 UTC