Re: URL parsing from Julian Reschke on 2010-04-28 (public-html@w3.org from April 2010)

From: Julian Reschke <julian.reschke@gmx.de>
Date: Wed, 28 Apr 2010 16:59:25 +0200
To: Adam Barth <w3c@adambarth.com>
CC: HTML WG <public-html@w3.org>, Larry Masinter <LMM@acm.org>
Message-ID: <4BD84D4D.6080708@gmx.de>

On 23.04.2010 00:03, Adam Barth wrote:
> I haven't been paying that close attention to all the machinations
> around URL parsing in this working group, but I've been looking into
> URL parsing a bit recently.  In case it's useful to this working group
> (or the IETF's URL working group), I've attached some raw data on how
> various browsers parse URLs.  These tests are from this test suite:
>
> http://trac.webkit.org/browser/trunk/LayoutTests/fast/url
>
> which is adapted from these unit tests:
>
> http://code.google.com/p/google-url/source/browse/trunk/src/url_canon_unittest.cc
>
> I might send a summary of my findings after I analyze the data.
>
> Enjoy!
> Adam

Hi Adam,

very interesting.

Here's a question; picking a random test case; scheme name normalization:

   PASS canonicalize('HTTP://example.com/') is 'http://example.com/'

Could you explain based on the HTML5 spec (in doubt an earlier version 
which doesn't yet rely on the IRI spec) why it's expected that the 
scheme name get's lowercased?

Best regards, Julian

Received on Wednesday, 28 April 2010 15:00:10 UTC