Re: URIEquivalence-15

As I've said before, I think the only practical URI comparison
algorithm is "lexicographic identity". But every time I've said that,
I've had this nagging concern about how to deal with characters that
might or might not be escaped.

There's an erratum to XML[1] that tries to tackle a related issue, but
the IRI draft[2] (I haven't read it with great care, so I could be
overlooking something) seems to provide a more complete algorithm for
"normalizing" the possibly escaped characters in a URI.

I'm now inclined to say that the right way to compare URIs is to turn
them into IRIs and test their lexicographic identity.

                                        Be seeing you,
                                          norm

[1] http://www.w3.org/XML/xml-V10-2e-errata#E4
[2] http://www.ietf.org/internet-drafts/draft-duerst-iri-00.txt

-- 
Norman.Walsh@Sun.COM   | It is not failure of others to appreciate your
XML Standards Engineer | abilities that should trouble you, but rather
XML Technology Center  | your failure to appreciate theirs.--Confucius
Sun Microsystems, Inc. | 

Received on Monday, 29 April 2002 10:01:32 UTC