Re: several messages

On 2 Jun 2009, at 11:22, Simon Pieters wrote:

>>> > Supporting both '&' and ';' seems like a exercise in bug creation.
>>> > Parsing URIs is hard enough to do right as it is without making  
>>> things
>>> > even more complicated and adding even more edge cases.
>>>
>>> But that's exactly what you are doing, except here it applies to  
>>> parsing
>>> href attributes, not URIs.
>>
>> No, no change to the parsing rules was involved here.
>
> Writing HTML documents seems to make this valid:
>
>   <a href="&copy=">
>
> and claims that the attribute value contains just text and no  
> character references (since character references end with ";").
>
> Yet, Parsing HTML documents interprets the above the same as <a  
> href="©=">, as far as I can tell.


Indeed. I've incorrectly remembered that semicolon in attributes is  
required, while in reality it's more complicated than that. Here's  
previous discussion and a test case:

http://lists.whatwg.org/htdig.cgi/whatwg-whatwg.org/2007-June/011931.html

-- 
regards, Kornel

Received on Tuesday, 2 June 2009 11:22:18 UTC