Re: extra tags in output

It is, after all, called "HTML-TIDY." My guess is that the program assumes
when you ask for XML that you want a *web page* in XML rather than a db for
example; and most web browsers expect the <html>, <head>, <title> and <body>
tags (and their closing equivalents), so it puts them in.

I could be wrong, of course . . .

I'd use TIDY as my FIRST cleanup step.

PTRourke

> Hi,
>
> When I set output-xml: yes why does the output include <html>, <head>,
> <title> and <body> tags when my original file doesn't include these
> tags?
>
> I'm using tidy as a last cleanup step after stripping those tags from an
> HTML file. The idea is to get my 'almost' XML' file cleaned up by tidy
> before presenting it to an  XML parser.
>
> TIA,
> Pete
>
> Peter Levine
> Senior Software Engineer
> plevine@intraware.com   http://www.intraware.com
> phone: (925) 253-6658   fax: (925) 253-4599
>
> Intraware...Control Your Technology
>
>

Received on Thursday, 20 January 2000 13:14:03 UTC