Re: IRI/URI Canonicalization does not address IRIs with IDNs

Java 1.6 also includes normalisation classes - see:

http://java.sun.com/javase/6/docs/api/java/text/Normalizer.html



Jeremy Carroll wrote:
>
>
> Eric said:
> [[
>  I've never
> implemented Unicode normalization, but I expect it's not trivial.
> ]]
>
> You can use a third party library, e.g. IBM's icu library.
>
> It is then quite straightforward.
>
> The icu4j library is fairly large, and I believe larger than needed 
> for this problem (since it solves most other I18N problems too). But, 
> I don't think this is really a problem in practice.
>
> I would urge the group to specify the right thing, whatever that is, 
> and not be too concerned about the detail here.
>
> I believe that fairly soon most Web addresses will have IDNs.
>
> Jeremy
>
>
----
David Clarke
Dragon Thoughts Ltd
http://www.dragonthouhts.com

Received on Friday, 15 February 2008 08:29:20 UTC