- From: P. T. Rourke <ptrourke@mediaone.net>
- Date: Thu, 20 Jan 2000 13:13:34 -0500
- To: "Peter Levine" <plevine@intraware.com>, <html-tidy@w3.org>
It is, after all, called "HTML-TIDY." My guess is that the program assumes when you ask for XML that you want a *web page* in XML rather than a db for example; and most web browsers expect the <html>, <head>, <title> and <body> tags (and their closing equivalents), so it puts them in. I could be wrong, of course . . . I'd use TIDY as my FIRST cleanup step. PTRourke > Hi, > > When I set output-xml: yes why does the output include <html>, <head>, > <title> and <body> tags when my original file doesn't include these > tags? > > I'm using tidy as a last cleanup step after stripping those tags from an > HTML file. The idea is to get my 'almost' XML' file cleaned up by tidy > before presenting it to an XML parser. > > TIA, > Pete > > Peter Levine > Senior Software Engineer > plevine@intraware.com http://www.intraware.com > phone: (925) 253-6658 fax: (925) 253-4599 > > Intraware...Control Your Technology > >
Received on Thursday, 20 January 2000 13:14:03 UTC