- From: Gary L Peskin <garyp@firstech.com>
- Date: Wed, 20 Dec 2000 01:17:18 -0800
- To: Html-Tidy <html-tidy@w3.org>
Inserted nodes are being created with incorrect node->end values in certain cases. The following java example program (provided by dglo@users.sourceforge.net) illustrates the problem: import java.io.ByteArrayInputStream; import org.w3c.tidy.Tidy; public class NodeBug { public static final void main(String[] args) { String badHTML = "<html><font><center></center></p>\n\n</html>"; Tidy tidy = new Tidy(); tidy.parseDOM(new ByteArrayInputStream(badHTML.getBytes()), null).cloneNode(true); } } Similar results are obtained with the parse() method. I don't have a c compiler so I can't reliably produce c code here which causes the same problem but it should look approximately the same. The problem occurs, I believe, at istack.c line 242 in method InsertedToken. The line node->end = lexer->txtstart; should be changed to read node->end = lexer->txtend; Gary
Received on Wednesday, 20 December 2000 04:17:47 UTC