Re: Error handling in URIs from Julian Reschke on 2008-06-24 (uri@w3.org from June 2008)

From: Julian Reschke <julian.reschke@gmx.de>
Date: Tue, 24 Jun 2008 16:18:27 +0200
To: Anne van Kesteren <annevk@opera.com>
CC: Ian Hickson <ian@hixie.ch>, uri@w3.org
Message-ID: <48610233.3090502@gmx.de>

Anne van Kesteren wrote:
>> There is nothing that needs to be fixed in the URI spec.
>>
>> If other specs accept invalid URIs, then those specs also need to 
>> define how to do that.
> 
> So why do we get LEIRIs in iri-bis?

First of all, IRI != URI. URI is a full, stable standard. IRI currently 
is a proposed standard.

I'm not convinced that a definition of LEIRIs *belongs* into the IRI 
spec; it probably should be handled separately.

>>>  I think the problem is that currently no specification says how to 
>>> construct a URI from a bunch of Unicode characters while taking into 
>>> account that the path component always needs to be in UTF-8 and the 
>>> query component in the document encoding.
>>
>> But again, that's not a problem with URI or IRI, right?
> 
> I'm not quite sure what you mean with again. Also, the URI/IRI spec 
> seems like the logical place to define this. I suppose that's the reason 
> the IRI spec is being updated to handle LEIRIs.

The fact that HTML expects different character encodings for different 
parts of an IRI is not a problem of IRI, but of HTML. It's HTML that 
defines how HTML form parameters get encoded. I fail to see how either 
URI or IRI can fix this -- it's outside their scope.

BR, Julian

Received on Tuesday, 24 June 2008 14:19:14 UTC