- From: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>
- Date: Fri, 22 Jul 2011 02:57:55 +0200
- To: Martin J. Dürst <duerst@it.aoyama.ac.jp>, "public-iri@w3.org" <public-iri@w3.org>
Leif Halvard Silli, Thu, 21 Jul 2011 21:05:16 +0200:
> For testing of page internal (that is #fragment links), you could
> create an ISO-8859-1 encoded page which contains links to directly
> typed fragments whose first letter begins with a non-ASCII letter from
> the ISO-8859-1 charset. And then you can test how that same page works
> if served/interpreted as another legacy, 8-bit encoding, such s KOI8-R
> etc. This test should compare wheter, for instance, in a ISO-8859-1
> page, href="#Dürst" would hit both id="Dürst" and id="Dürst".
Made some unpolished tests (with far to many links inside ...), based
on some old tests I had laying around:
http://malform.no/testing/html5/urls/
Results:
# With regard to hover display and display on the URL bar, then those
tests show, for fragment URIs:
* that directly typed characters in a URL get semantic display
in all the browsers tested (Firefox, Opera, Chrome/Safari/iCab, IE8)
* that UTF-8 based percentage encoded URLs are given semantic display
# in Firefox, Safari/iCab and Opera
# but not in IE8 or Chrome.
* that not-UTF-8 based percent encoded URLs get semantic display
# in Firefox
# not in any other browser. Caveat Operea. See note.
NOTE: For legacy encoded pages, then Opera makes a difference between
href="#D%FCrst" and href="D%FCrst": the fragment variant does not get
"semantic display" in Opera whereas the non-fragment URL does.
But there is a catch to what Opera (and Firefox too) do for
externally linking URLs: in the Windows-1251 encoded test page, the
'%FC' is turned into the cyrillic soft-sign letter.
# With regard to whether the URL works when activated, then those tests
show, for fragment URLs:
* that directly typed letters always works
* that UTF-8 based, percent encoded URLs
# never work, regardless of page encoding, for Opera and IE8
# should work always, for Firefox and Safari/iCab/Chrome
regardless of character or page encoding.
* that not-UTF-8 based percent encoded URLs are interpreted
"semantically"
# for in IE8: never and not in any encoding
# for Firefox, Safari and iCab: with non-UTF-8 encodings only
# for Chrome and Opera: in any encoding, but seemingly only as long
as the character belongs to the Latin-1 character set.
--
Leif H Silli
Received on Friday, 22 July 2011 00:58:39 UTC