W3C home > Mailing lists > Public > public-rdf-wg@w3.org > February 2013

Case of language tags

From: Andy Seaborne <andy.seaborne@epimorphics.com>
Date: Thu, 28 Feb 2013 17:26:34 +0000
Message-ID: <512F934A.1080405@epimorphics.com>
To: RDF-WG <public-rdf-wg@w3.org>

Section 3.3: (of the editors draft):

"""
a non-empty language tag as defined by [BCP47]. The language tag must be 
well-formed according to section 2.2.9 of [BCP47], and must be 
normalized to lowercase.
"""

but "lowercase" is locale sensitive.

What is lower case "I"?  It's not always "i".

It isn't in Turkish where there are different dotted and dotless I-like 
letters.

Upper case "I" (U+0049); lower case "ı" (U+0131)
!=
Upper case "İ" (U+0130); lower case "i" (U+0049)

http://www.i18nguy.com/unicode/turkish.png

The ideal solution is to say that the language tag is to be treated as 
case insensitively.

	Andy

(this email is in UTF-8)
Received on Thursday, 28 February 2013 17:27:06 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 16:25:54 GMT