- From: Vinod Balakrishnan <vinod@filemaker.com>
- Date: Thu, 6 Jul 2000 13:52:20 -0700
- To: "Lenny Turetsky" <LTuretsky@salesforce.com>, "'www-international@w3c.org'" <www-international@w3c.org>, "'servlet-interest@java.sun.com'" <servlet-interest@java.sun.com>
You can encode Big-5 and other double byte script characters in UTF16. I have seen IE5 is encoding the URLs with "%u" prefix for UTF16. But in case of UTF8 we don't have any standard prefix for representing that yet. -Vinod >Hi all, > >Is there a standard way to URL-encode non-English characters in Java? For >example, I know that '?' is URL-encoded as '%3F', but I don't know how or if >Big-5 characters can be URL-encoded. I've experimented a bit, and found that >IE will encode things differently based on the charset of the HTML doc which >contains the form. > >Ideally, I'd like to use functionality available in Java Servlets, or >another Java code library, but any solutions would be much appreciated. I've >looked at Java's java.net.URLEncoder class, but it's encode() method won't >do it, as documented in the JDC's bug database ( >http://developer.java.sun.com/developer/bugParade/bugs/4257115.html ><http://developer.java.sun.com/developer/bugParade/bugs/4257115.html> ). > >Is the only known solution to write my own encoder? If so, where can I find >a list of the character's that *don't* need to be encoded? Is it just >[A-Za-z0-9_]? > >Thanks, >Lenny Turetsky >
Received on Thursday, 6 July 2000 16:49:17 UTC