TIDY : SEVERE BUG ? (was:Re: Howto deactivate Tidy Wrap Option ?)

(This has nothing to do with the wrap option...)

Hi Dave,

Perhaps you'll hear from me more often. I'm just about to tidy up a >1000 pages
Website, some of which with a quite complex structure, which might push TIDY to
its limits. Maybe even beyond them... :-))

     (FYI, I use a Cyrix 586 133 MHz machine built out of
     spare parts, with 64M RAM, under Win95.A release and
     Debian 2.0 GNU/Linux 2.0.34 (but no RPM converter, and
     not yet time to compile the sources...)

It was the third page I was working on when happened the following :

calling TIDY to cleanup the page "frag.html" (I'll keep a copy for you on the
site at URL http://www.globenet.org/periph/agenda/frag-old.html , so you may try
out yourself. If the page doesn't exist, this means that today's update is still
to come, and the old file is still named "frag.html").

I use a "html.bat" file with just one line :

> tidy.exe -config html.ini -wrap 1024 %1 > html.htm

You notice that I redirect the screen output in order to create the cleaned-up
file, as (unlike the manual says) TIDY doesn't write to the disk, at least when
I use it in a DOS-window inside Win95. I didn't yet try it otherwise.

and the "html.ini" file looks like this :

> indent: auto
> doctype: strict
> char-encoding: ascii
> break-before-br: yes
> uppercase-tags: yes
> uppercase-attributes: yes
> clean: yes
> logical-emphasis: yes
> keep-time: yes
> error-file: error.txt

So, I started with :

<keyboard>html frag~1.htm</keyboard>

waited one or 2 seconds, as usual, but then... (thought, maybe it takes somewhat
longer, as this page is the biggest I tidied up, 'til now)
after some minutes, I <CTRL-BREAK> the program. Then, I found a zero-length
"html.htm" file and an "error.txt" file which at first sight had 365k in size,
but NO ! It were 365MEGAS ! In fact, if I'd waited a bit more, maybe the program
had eaten up all my disk space...
Next time, I'll let it write the output to a diskette :-)

CU soon

Norbert

---------------
P.S. The error file starts like this :

> 
> Tidy (vers 27th September 1999) Parsing "frag~1.htm"
> line 8 column 5 - Warning: <script> lacks "type" attribute
> line 15 column 5 - Warning: <table> lacks "summary" attribute
> line 20 column 17 - Warning: <table> lacks "summary" attribute
> line 23 column 29 - Warning: <img> lacks "alt" attribute
> line 27 column 29 - Warning: <img> lacks "alt" attribute
> line 31 column 29 - Warning: <img> lacks "alt" attribute
> line 42 column 5 - Warning: <table> lacks "summary" attribute
> line 56 column 45 - Warning: discarding unexpected plain text
> line 57 column 45 - Warning: discarding unexpected plain text
> line 58 column 45 - Warning: discarding unexpected plain text
> line 59 column 45 - Warning: discarding unexpected plain text
> line 60 column 45 - Warning: discarding unexpected plain text
> line 61 column 45 - Warning: discarding unexpected plain text
> line 62 column 45 - Warning: discarding unexpected plain text
> line 63 column 45 - Warning: discarding unexpected plain text
> line 64 column 45 - Warning: discarding unexpected plain text
> line 65 column 45 - Warning: discarding unexpected plain text
> line 66 column 45 - Warning: discarding unexpected plain text
> line 67 column 45 - Warning: discarding unexpected plain text
> line 68 column 45 - Warning: discarding unexpected plain text
> line 73 column 102 - Warning: <img> proprietary attribute value "absmiddle"
> line 84 column 51 - Warning: discarding unexpected plain text
> line 85 column 55 - Warning: discarding unexpected plain text
> line 86 column 55 - Warning: discarding unexpected plain text
> line 87 column 55 - Warning: discarding unexpected plain text
> line 88 column 55 - Warning: discarding unexpected plain text
> line 89 column 55 - Warning: discarding unexpected plain text
> line 99 column 104 - Warning: discarding unexpected </a>
> line 103 column 9 - Warning: plain text isn't allowed in <dir> elements
> line 103 column 10 - Warning: missing </blockquote> before <p>
> line 106 column 5 - Warning: discarding unexpected </dir>
> line 145 column 97 - Warning: unescaped & or unknown entity "&nbsp:"
> line 164 column 9 - Warning: missing </tt> before <dir>
> line 164 column 9 - Warning: trimming empty <tt>
> line 164 column 14 - Warning: <u> isn't allowed in <dir> elements
> line 164 column 14 - Warning: inserting implicit <tt>
> line 165 column 10 - Warning: missing </tt> before <p>
> line 165 column 12 - Warning: inserting implicit <tt>
> line 165 column 126 - Warning: inserting implicit <br>
> line 166 column 10 - Warning: missing </tt> before <p>
> line 166 column 12 - Warning: inserting implicit <tt>
> line 168 column 9 - Warning: inserting implicit <br>
> line 169 column 10 - Warning: missing </tt> before <p>
> line 169 column 12 - Warning: inserting implicit <tt>
> line 172 column 30 - Warning: replacing unexpected </span> by </tt>
> line 175 column 11 - Warning: discarding unexpected </tt>
> line 179 column 9 - Warning: plain text isn't allowed in <dir> elements
> line 179 column 10 - Warning: missing </blockquote> before <p>
> line 197 column 5 - Warning: discarding unexpected </dir>
> line 198 column 27 - Error: <img> missing '>' for end of tag
> line 200 column 5 - Warning: inserting implicit <br>
> line 201 column 6 - Warning: missing </a> before <p>
> line 201 column 27 - Error: <img> missing '>' for end of tag
> line 202 column 580 - Warning: inserting implicit <br>
> line 203 column 6 - Warning: missing </a> before <dir>
> line 204 column 9 - Warning: plain text isn't allowed in <dir> elements
> line 204 column 10 - Warning: missing </blockquote> before <p>
> line 208 column 5 - Warning: discarding unexpected </dir>
> line 209 column 27 - Error: <img> missing '>' for end of tag
> line 211 column 5 - Warning: inserting implicit <br>
> line 212 column 6 - Warning: missing </a> before <dir>
> line 212 column 10 - Warning: plain text isn't allowed in <dir> elements
> line 218 column 27 - Error: <img> missing '>' for end of tag
> line 219 column 351 - Warning: replacing unexpected </b> by </font>
> line 219 column 437 - Warning: discarding unexpected </font>
> line 220 column 5 - Warning: inserting implicit <br>
> line 221 column 6 - Warning: missing </a> before <dir>
> line 222 column 9 - Warning: plain text isn't allowed in <dir> elements
> line 222 column 10 - Warning: missing </blockquote> before <p>
> line 223 column 5 - Warning: discarding unexpected </dir>
> line 226 column 10 - Warning: plain text isn't allowed in <dir> elements
> line 228 column 10 - Warning: missing </blockquote> before <p>
> line 233 column 5 - Warning: discarding unexpected </dir>
> line 240 column 9 - Warning: <p> isn't allowed in <dir> elements
> line 262 column 10 - Warning: plain text isn't allowed in <dir> elements
> line 289 column 10 - Warning: discarding unexpected </p>
> line 289 column 10 - Warning: discarding unexpected </p>
...

(last line repeated indefinitely...)



-- 

   *   *
    \ /
 *-- * --*
    / \    This was TMTM (The Megabyte Text Magician)
   /   *   ------------------------------------------
  /                  norbert@globenet.org 
 /              http://www.globenet.org/periph
/
+-----------------------------------------------------------+
|           stop the execution of Mumia Abu-Jamal           |
|    if you agree copy these 3 sentences in your own sig    |
| more info: http://www.xs4all.nl/~tank/spg-l/sigaction.htm |
|              42 DAYS LEFT TO SAVE MUMIA !!!               |
+-----------------------------------------------------------+

Received on Wednesday, 20 October 1999 08:50:25 UTC