Re: URL-encode international characters in Java?

You can encode Big-5 and other double byte script characters in UTF16. I 
have seen IE5 is encoding the URLs with "%u" prefix for UTF16. But in 
case of UTF8 we don't have any standard prefix for representing that yet.

-Vinod

>Hi all,
> 
>Is there a standard way to URL-encode non-English characters in Java? For
>example, I know that '?' is URL-encoded as '%3F', but I don't know how or if
>Big-5 characters can be URL-encoded. I've experimented a bit, and found that
>IE will encode things differently based on the charset of the HTML doc which
>contains the form.
> 
>Ideally, I'd like to use functionality available in Java Servlets, or
>another Java code library, but any solutions would be much appreciated. I've
>looked at Java's java.net.URLEncoder class, but it's encode() method won't
>do it, as documented in the JDC's bug database (
>http://developer.java.sun.com/developer/bugParade/bugs/4257115.html
><http://developer.java.sun.com/developer/bugParade/bugs/4257115.html> ).
> 
>Is the only known solution to write my own encoder? If so, where can I find
>a list of the character's that *don't* need to be encoded? Is it just
>[A-Za-z0-9_]?
> 
>Thanks,
>Lenny Turetsky
>

Received on Thursday, 6 July 2000 16:49:17 UTC