W3C home > Mailing lists > Public > www-international@w3.org > October to December 2007

Re: Effect of IE7 "Send UTF-8 URLs" option

From: Najib Tounsi <ntounsi@emi.ac.ma>
Date: Tue, 23 Oct 2007 00:09:20 +0000
Message-ID: <471D3BB0.9050908@emi.ac.ma>
To: kuro@sonic.net
CC: www-international@w3.org

KUROSAKA Teruhiko wrote:
> Hello,
> Does anyone know what "Send UTF-8 URLs" option in Internet Explorer 7 does?

>From my experience, I suspect that this option instructs IE7 to send the
URL (IRI here) in native script encoded UTF-8. That means that IE7 will
not replace the utf-8 bytes of non-ASCII characters by escape sequences.
Note that the IRI should work, even if it's non-ascii characters are
represented with escape sequences, these are legal. But not all browsers
are smart enough to do that. (I think Opera is OK).

Hope this helps.


> I tried posting a form that includes non-ASCII characters in a form
> variable's value at http://test/XXYY.cgi where XX and YY are Japanese
> characters.  The form itself is written in EUC-JP encoding.
> I was expecting this option would affect how one of both of the URL path
> (XXYY.cgi) and the search-strings (?key=value) is turned into the
> hexadeciaml encoding.
> But my experiment result is that regardless of the option setting, IE 7
> interprets the URL path as UTF-8 and the search-strings as EUC-JP (the
> form's encoding), and generrates the %hh%hh... hex escapes.
> (Note: There is a recnet IE bug that relates to UTF-8 option:
> http://support.microsoft.com/default.aspx/kb/935729
> but what I see is different because I didn't use frames.)
> I am puzzled.

Najib TOUNSI (mailto:tounsi @ w3.org)
Bureau W3C au Maroc (http://www.w3c.org.ma/)
Ecole Mohammadia d'Ingenieurs, BP 765 Agdal-RABAT Maroc (Morocco)
Phone : +212 (0) 37 68 71 50 (P1711)  Fax : +212 (0) 37 77 88 53
Mobile: +212 (0) 61 22 00 30 
Received on Tuesday, 23 October 2007 00:09:33 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:28 UTC