[whatwg] Stripping newlines from URI attributes

On Wed, Jul 29, 2009 at 5:49 PM, Kartikaya
Gupta<lists.whatwg at stakface.com> wrote:
> It seems that most browsers do some sort of newline and tab removal from URI attributes. For example, if you have
>
> <img src="foo
> bar.jpg">
>
> browsers will still render the image called "foobar.jpg" despite the CRLF pair in the middle of the src attribute. The behavior actually seems a bit more complex; quote from one of my co-workers who investigated this:
>
>
> This behavior doesn't seem to be specced anywhere as far as I can tell. Assuming the WEBADDRESSES spec referred to in HTML5 is the one at http://www.w3.org/html/wg/href/draft.html that only says to trim leading/trailing whitespace and url-encode the rest. This doesn't seem to match existing behavior, so it should probably be updated.

How weird. Frankly how insane. While I can believe that some browsers
act like this, I would be quite surprised to find that they were
compatible with each other. Indeed your tests seem to show they
aren't.

This is an area where we should not attempt (and probably simply
cannot) maintain compatibility with existing browsers. They're just
too broken.

-- 
Elliotte Rusty Harold
elharo at ibiblio.org

Received on Thursday, 30 July 2009 06:37:08 UTC