RE: Tidy-ing multiple XML files: A good idea?

Hi Roger,

I think you'll be OK.  Tidy is _not_ appropriate if your XML is whitepace
sensitive.  But, if you just want pretty printing, which necessarily adds
newlines and other whitespace, Tidy should do what you want.

Question: which version are you using?  Would you consider trying out the
current beta (see http://tidy.sourceforge.net)?  We'd appreciate hearing
about any problems you encounter.

take it easy,
Charlie



-----Original Message-----
From: Harkavy, Roger [mailto:rharkavy@erisco.com]
Sent: Thursday, November 01, 2001 2:33 PM
To: 'html-tidy@w3.org'
Subject: Tidy-ing multiple XML files: A good idea?


I'm working on a project that will require referencing and displaying
roughly 75 XML files in a document. All of these files are stored in the
same folder, and some of them are pretty messy with too many/not enough line
breaks, no indentation, etc. In order to make the XML presentable for
publishing, I have been experimenting with Tidy to make the files look a
little nicer.

I'm using the following command line (without a config.txt) and I like what
I'm getting: tidy -xml -i -m -wrap 72 *.xml

However, I recently re-read the Tidy home page's notes on "Limited support
for XML". I love using Tidy, but am I better off choosing another tool for
this job? As far as I know, our files don't use the two features the page
says Tidy has trouble with (CDATA sections or DTD subsets). However, it is
critical that the XML information we present is accurate and I'm hoping that
Tidy isn't messing anything up.

Thanks in advance for any info you can give me.

Received on Thursday, 1 November 2001 15:18:53 UTC