W3C home > Mailing lists > Public > www-international@w3.org > July to September 2009

Language tag lookup tool: bigger and better

From: Richard Ishida <ishida@w3.org>
Date: Thu, 13 Aug 2009 17:28:23 +0100
To: <www-international@w3.org>
Message-ID: <043801ca1c33$144f1f80$3ced5e80$@org>
http://rishida.net/utils/subtags/

The IANA Subtag Registry has been recently updated to contain 220 extlang subtags and the ISO 639-3 language subtags, taking the total number of subtags to almost 8,000.

I have produced a new version of my lookup tool to help with language tagging. In addition to helping you find subtags and lookup the meaning of subtags, it now helps check the well-formedness of a language tag.

The tool provides access to all currently defined subtags, including the new extlang subtags.

Parsing language tags. In addition to trying to make the user interface more friendly, I also added the ability to parse hyphenated tags and discover their structure and check for errors. I’m not claiming with this release that the new parser field tests all the corner cases, but it should provide reports for most of the typical errors.

It reports errors for the following:

- subtags that are not in the registry (by type)
- incorrectly ordered subtags
- duplicate variant tags and multiple tags of other types
- overlong private use subtags

Try this example.
http://rishida.net/utils/subtags/index.php?searchtype=1&searchtext=de-419-DE-alt&submit=Parse

It doesn’t yet handle extensions, but then there aren’t any valid ones to handle yet anyway.

I hope that’s useful.

============
Richard Ishida
Internationalization Lead
W3C (World Wide Web Consortium)

http://www.w3.org/International/
http://rishida.net/
Received on Thursday, 13 August 2009 16:28:36 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 13 August 2009 16:28:37 GMT