- From: Gary L Peskin <garyp@firstech.com>
- Date: Fri, 4 Aug 2000 19:28:30 -0400 (EDT)
- To: Html-Tidy <html-tidy@w3.org>
- Message-ID: <398B51A7.C244D01B@firstech.com>
I've attached a particularly ugly piece of HTML code that I received. You'll notice on line 59, there is a td element with a BACKGROUND= element where the quotes seem to be hosed up. When Tidy is parsing this, it thinks BACKGROUND is the attribute and SRC= is the value of the attribute. So far, so good. The next attribute is images/clearPixel.gif" and there is no value since this is not followed by an equal sign. This causes Tidy to vomit while trying to reference a null pointer due to a null value being passed into Report.attrError when reporting a BAD_ATTRIBUTE_VALUE. My recommended fix for Java Tidy is to change Lexer.parseAttrs at line 2,612 of Lexer.java (8 july 2000 edition) from: Report.attrError(this, this.token, value, Report.BAD_ATTRIBUTE_VALUE); to: if (value == null) Report.attrError(this, this.token, attribute, Report.MISSING_ATTR_VALUE); else Report.attrError(this, this.token, value, Report.BAD_ATTRIBUTE_VALUE); Gary
Attachments
- text/html attachment: Ugly1.html
Received on Friday, 4 August 2000 19:37:17 UTC