Sorry, forgot the sample files. When converting paragraphs w/ the class="MsoListBullet" attribute to <ul> <li></li> ... </ul>, Tidy always leaves the second item out of the list and it appears as the following paragraph. A sample input file (word2k_list.html) and output file (work2k_tidy.html) are attached. The fix is to the CleanWord2000() routine in clean.c: C:\tidy\TidyLib>diff -w -c1 c:\temp\clean.c .\clean.c *** c:\temp\clean.c Tue Dec 05 18:57:59 2000 --- .\clean.c Tue Dec 05 19:21:45 2000 *************** *** 1805,1807 **** InsertNodeAtEnd(list, node); ! node = list->next; } --- 1805,1807 ---- InsertNodeAtEnd(list, node); ! node = list; } The effect of the change is to avoid Purging the attributes of the second item of the list before the class=MsoListBullet test can be applied. This change takes effect at the bottom of the containing loop.
This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:37:49 GMT