Re: Updated article: Two-letter or three-letter language codes

On Sunday, September 24, 2006, 4:44:03 PM, Stephen wrote:

SD> At 2006.09.24-11:56(+0900), Martin Duerst wrote:
>>Hello Mark,

>>Many thanks for this interesting data. In my mailer, the line endings
>>didn't work very well, so I refomatted it (at the same time, my
>>mailer messed up the non-ASCII stuff, sorry).
>>The only three-letter code I can see is pdf, for which we can blame
>>Steve's company :-).

>>On the other hand, in the list, there are a few items (such as en-us,
>>pt-br) that look perfectly fine. What was wrong with them?

SD> Capitalization.

Capitalization is, IIRC, irrelevant.

However "en-us." and "en-us " (note trailing space and trailing period) are not correct. en-US or En-Us or any other variant therof are fine.


>> >(the second one has a space at the end. this also excludes x-....
>> >where the ... is a subtag longer than 8 -- that has a pretty high
>> >frequency)

>> >Rank   Frequency       tag

>> >102    0.015999%       en-us.
>> >122    0.010068%       en-us



-- 
 Chris Lilley                    mailto:chris@w3.org
 Interaction Domain Leader
 Co-Chair, W3C SVG Working Group
 W3C Graphics Activity Lead
 Co-Chair, W3C Hypertext CG

Received on Sunday, 24 September 2006 14:57:09 UTC