W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 1999

Problem with Clean on Word 97 file

From: Jim Mundy <Mundy.Jim@metnet.navy.mil>
Date: Fri, 17 Sep 1999 10:37:58 -0700
Message-ID: <000901bf0133$62ae1940$1d665098@metcast.ifnoc>
To: <html-tidy@w3.org>
Tried to run HTML-Tidy with the clean (-c) option on the attached page (test.htm), which was saved from Word 97 as an HTML document.  Also attached is the first part of the TidyOut.log file generated (enough to show the problem).  It's obvious that Tidy got stuck in a loop somewhere on line 17 of the input HTML file.  When I originally ran Tidy (from HomeSite 4.5) it eventually locked up my machine after the TidyOut.log file grew to over 500MB.

I think Tidy is great, and was looking forward to using it to clean up Word's poor excuse for HTML rather than doing it by hand.  This test, however, gives me pause.  I don't know whether this problem has been reported previously (I didn't see it in a brief perusal of the Release Notes), but I hope it can be fixed or worked around.

Thanks for listening.

Jim Mundy
Integrated Performance Decisions, Inc.
Monterey, CA
Mundy.Jim@metnet.navy.mil
(831) 656-4566



Received on Friday, 17 September 1999 13:38:20 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:42 GMT