W3C home > Mailing lists > Public > www-international@w3.org > April to June 2004

EUC-JP encoding - Bug in IE6?

From: Daniel Goldschmidt <pepper@012.net.il>
Date: Sat, 3 Apr 2004 14:56:18 +0200
To: <www-international@w3.org>
Message-Id: <20040403125441.TLGZ19265.fep11@Blue>
Hi,

I found the following behavior with IE6, while using EUC-JP encoding:

 

I'm having a EUC-JP encoded page with a form, submitted using "GET" method.

 

For example, entering as input the character Unicode U9f3b, the browser
(IE6) encodes it to EUC-JP as C9A1 (%C9%A1in the URL). The server (JBoss)
transcodes it back to u9f3b as expected.

 

Entering as input the character Unicode u9b75, the browser (IE6) encodes it
to EUC-JP as FCE4 (%FC%E4 in the URL). Hmpff.. Here I have problem: The
server transcodes it to uFFFD ("REPLACEMENT CHARACTER"). I checked it
manually and got the same resultes: Java (JDK 1.4.2) does no recognize this
character, but IE6 does. I wrote manually a XML file encoded in EUC-JP with
those characters: IE6 transcoded it to u9b75 and Java transcoded it to
uFFFD.

 

I was thinking that maybe this character in not in EUC-JP encoding, and for
some reason IE6 recognized it. Any idea why IE6 behaves differently from
Java?

 

Thanks,

Daniel

 

 

 

 

 

Daniel Goldschmidt

 

Software Internationalization Solutions

M: +972-54-242580

 <http://www.i18nworld.com/> www.i18nworld.com

"Sure UNIX is user-friendly, It's just picky about who its friends are!"

 
Received on Saturday, 3 April 2004 07:57:04 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:03 GMT