Re: Confusing use of "URI" to refer to IRIs, and IRI handling in the DOM

On Sun, 29 Jun 2008, Julian Reschke wrote:
> Ian Hickson wrote:
> > On Sun, 29 Jun 2008, Julian Reschke wrote:
> > > 3. The distinction between HTML5-URL and RFC3987-IRI *is* important,
> > > because
> > > 
> > > - it affects the way how identifiers can be delimited; HTML5-URLs 
> > > can contain spaces
> > 
> > No, they're not allowed to contain spaces.
> 
> *Valid* URLs aren't, but the spec spends a considerable amount of space
> dealing with invalid ones.

Well if you're willing to consider invalid ones, what about invalid URIs? 
They can contain spaces too. What's the distinction between an invalid URL 
and an invalid URI?


> > > - mapping of non-ASCII characters in query parts differs from 
> > > RFC3987-IRI.
> > 
> > Only in non-conforming documents.
> 
> (In which case documents with valid IRIs get non-conforming when using 
> the wrong document encoding...)

Right, otherwise documents with valid IRIs but non-UTF-8 encodings 
wouldn't be treated as per the IRI spec, which is bad (presumably) and 
shouldn't be encouraged, and should be brought to the author's attention.

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'

Received on Sunday, 29 June 2008 09:40:11 UTC