Re:IRI/URI Canonicalization does not address IRIs with IDNs

Eric said:
[[
  I've never
implemented Unicode normalization, but I expect it's not trivial.
]]

You can use a third party library, e.g. IBM's icu library.

It is then quite straightforward.

The icu4j library is fairly large, and I believe larger than needed for 
this problem (since it solves most other I18N problems too). But, I 
don't think this is really a problem in practice.

I would urge the group to specify the right thing, whatever that is, and 
not be too concerned about the detail here.

I believe that fairly soon most Web addresses will have IDNs.

Jeremy

Received on Thursday, 14 February 2008 17:25:27 UTC