- From: Gannon Dick <gannon_dick@yahoo.com>
- Date: Tue, 21 Feb 2012 15:12:09 -0800 (PST)
- To: "eGov IG \(Public\)" <public-egov-ig@w3.org>
- Cc: "tj@iist.unu.edu" <tj@iist.unu.edu>
- Message-ID: <1329865929.37242.YahooMailNeo@web112618.mail.gq1.yahoo.com>
I'm not a big fan of memorizing codes, and I think that programming "to accept" is a bad idea too. I've made a list of the existing language codes(ISO 639-2, 639-5) wrapped up in a name (urn:lang:xxx:) wrapper [1]. The names, like identifiers, have no linguistic content or semantic meaning. The list is nothing special per se, although it would be helpful as a reference for someone encoding data sets. However, the list was extracted from a relational database "view" which contained all possible three letter codes. There are some duplicates in the official documents (e.g. France) [2]. The database method de-dupes properly (at the expense of 18,000 triples versus a little over 1,000 including user defined "local use" codes). The advantage is that you do get a definitive answer about a "possible" code versus the perpetuation of gibberish. It would be a nice "spell check" addition to drupal, although custom drupal modules are a bit beyond my abilities at the moment. --Gannon [1] http://www.rustprivacy.org/2012/urn-lang/ [2] http://www.loc.gov/standards/iso639-2/php/code_list.php ________________________________ From: Phil Archer <phila@w3.org> To: eGov IG (Public) <public-egov-ig@w3.org> Sent: Tuesday, February 21, 2012 9:50 AM Subject: [Minutes] 2012-02-21 The minutes of today's meeting, during which we heard a very interesting presentation [1] from from Tomasz Janowski [2] are on the wiki at [3]. [1] http://www.w3.org/egov/IG/slides/2012-02-21.pdf [2] http://egov.iist.unu.edu/cegov/PEOPLE/STAFF/Janowski-Tomasz [3] http://www.w3.org/egov/IG/meeting/2012-02-21 -- Phil Archer W3C eGovernment http://www.w3.org/egov/ http://philarcher.org +44 (0)7887 767755 @philarcher1
Received on Tuesday, 21 February 2012 23:12:41 UTC