Inserted nodes are being created with incorrect node->end values in certain cases. The following java example program (provided by dglo@users.sourceforge.net) illustrates the problem: import java.io.ByteArrayInputStream; import org.w3c.tidy.Tidy; public class NodeBug { public static final void main(String[] args) { String badHTML = "<html><font><center></center></p>\n\n</html>"; Tidy tidy = new Tidy(); tidy.parseDOM(new ByteArrayInputStream(badHTML.getBytes()), null).cloneNode(true); } } Similar results are obtained with the parse() method. I don't have a c compiler so I can't reliably produce c code here which causes the same problem but it should look approximately the same. The problem occurs, I believe, at istack.c line 242 in method InsertedToken. The line node->end = lexer->txtstart; should be changed to read node->end = lexer->txtend; GaryReceived on Wednesday, 20 December 2000 04:17:47 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 18:21:27 GMT