[Bug 6746] case-insensitivity of other than a-z and A-Z, e.g., diacritics

http://www.w3.org/Bugs/Public/show_bug.cgi?id=6746





--- Comment #5 from Boris Zbarsky <bzbarsky@mit.edu>  2009-03-30 01:15:01 ---
The thing is is that "treat case insensitively" is not well defined in Unicode.
 For example, in English 'i' and 'I' are equal in case-insensitive comparisons.
 In Turkish, they are not.  See
http://en.wikipedia.org/wiki/Turkish_dotted_and_dotless_I

In particular, while <SCRIPT> should be treated as a script, <SCR&#304;PT>
should not, though the latter is the Turkish uppercasing of <script>.

So the issue is that you can't even talk about "case insensitive" without first
deciding "which language?" when doing non-ASCII.


-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.

Received on Monday, 30 March 2009 01:15:11 UTC