W3C home > Mailing lists > Public > public-html@w3.org > April 2009

Re: Reparsing of comments, escaped text spans, and --!>

From: James Graham <jgraham@opera.com>
Date: Wed, 29 Apr 2009 13:22:54 +0200
Message-ID: <49F8388E.3040408@opera.com>
To: Simon Pieters <simonp@opera.com>
CC: Ian Hickson <ian@hixie.ch>, "public-html@w3.org" <public-html@w3.org>
Simon Pieters wrote:
> On Fri, 23 May 2008 05:49:19 +0200, Ian Hickson <ian@hixie.ch> wrote:
> 
>>> Philip` found 128 pages with open "<!--" out of ~130K pages, listed in
>>> http://philip.html5.org/data/pages-with-unclosed-comments.txt . I looked
>>> through the first 82 pages.
> 
>>> Also found during this research was that a lot of pages use --!> and
>>> expect it to close the comment. --!> closes comments in WebKit and
>>> Gecko. We'll probably make --!> close comments given this data.
>>
>> IE doesn't close on --!>, so I haven't changed this.
> 
> But IE reparses, so the pages mostly work in IE thanks to the reparse.
> 
> We are still finding pages that break because they expect --!> to close 
> the comment. Please change HTML5 here to make more Web pages work.
> 

I agree. I looked at some pages that currently trigger reparsing in IE 
(the list of urls was compiled by Philip and is available at [1]). 
Assuming my analysis was correct, 129 of (the unscripted, live versions 
of) those pages have an unterminated comment in the current parsing 
algorithm, of which 20 are fixed by making --!> end comments (matches 
Gecko, Webkit, IE, Opera quirks) and 23 are fixed by making --\s*> 
terminate comments (matches Gecko, IE, Opera quirks). I think HTML 5 
should make both of those cases work to prevent significant 
compatibility regressions.

[1] http://philip.html5.org/data/dotbot-20090424.txt
Received on Wednesday, 29 April 2009 11:26:58 UTC

This archive was generated by hypermail 2.3.1 : Monday, 29 September 2014 09:39:03 UTC