W3C home > Mailing lists > Public > public-egov-ig@w3.org > August 2014

Re: Linked SDMX Data

From: Gannon Dick <gannon_dick@yahoo.com>
Date: Mon, 11 Aug 2014 13:58:26 -0700
Message-ID: <1407790706.51169.YahooMailBasic@web122901.mail.ne1.yahoo.com>
To: public-lod@w3.org, Sarven Capadisli <info@csarven.ca>
Cc: KevinFord <kefo@loc.gov>, "public-locadd@w3.org" <public-locadd@w3.org>, "public-egov-ig@w3.org" <public-egov-ig@w3.org>, "public-opengov@w3.org" <public-opengov@w3.org>
Sorry for the x-post

Hi Sarven,

I noticed you used GeoNames for the "Australian Bureau of Statistics Linked Data" hack mentioned below.  GeoNames does much useful work ... but everyone in the Linked Data business could use a little help.

Domains - in theory, the countries of the world are a group of (federalized data set of ...) (groups of) Court Houses, Jurisdictions, keyed with two and three letter acronyms (ISO 3166).  This set for all practical purposes is a Unicode Code Page, but instead of (16x4)=256 members there are (169x4)=676 Latin Alphabet Capital Letters.  Statistical metrics at the domain level are manipulated with Linear Algebra and Linear Programming. Diacritics (Côte d'Ivoire) or alternate forms (Ivory Coast) do nothing semantically useful, the acronym is the leveler.

So, I rewrote the GeoName table (http://www.geonames.org/countries/) to be:
1) Unicode compliant for XML (HTML entities are HEX escaped)
2) The Geo's, Country Profiles, whatever are local links.  I left those as is and included/matched the MARC System / US Library of Congress Linked Data Service URI's (http://id.loc.gov/vocabulary/countries.html).
3) Finally, I used an SQL RDB to do an Outer Join on the Code Set - all 676 possibilities.  Adding a three character code "synonym" does not increase the code page size.  It is then possible to split this "registry" into lists of codes 1) Present, 2) Missing and 3) "Slack" (in the Linear Programming usage).
4) Put the files in (FODS - (Flat XML) Open Document Spreadsheets format) so that European Civil Servants can not whine about data quality (got your back, DERI, you too ABS).

http://www.rustprivacy.org/2014/balance/gts/geonames_domains.zip

Unfortunately, when RDF Lists of Place Names are filtered through previously written applications the result is often unhelpful additions, however these steps should ameliorate the problem significantly.

--Gannon





--------------------------------------------
On Mon, 8/11/14, Sarven Capadisli <info@csarven.ca> wrote:

 Subject: Re: Linked SDMX Data
 To: public-lod@w3.org
 Date: Monday, August 11, 2014, 7:20 AM
 
 On 2014-08-05 12:08,
 Sarven Capadisli wrote:
 > On 2014-04-23
 15:31, Sarven Capadisli wrote:
 >> On
 2014-04-22 14:18, Sarven Capadisli wrote:
 >>> On 2013-08-08 15:17, Sarven
 Capadisli wrote:
 >>>> On
 03/08/2013 01:04 PM, Sarven Capadisli wrote:
 >>>>> On 02/15/2013 02:42 PM,
 Sarven Capadisli wrote:
 >>>>>> Ahoy hoy,
 >>>>>>
 >>>>>> OECD Linked Data:
 >>>>>> http://oecd.270a.info/
 >>>>>>
 >>>>>> BFS Linked Data:
 >>>>>> http://bfs.270a.info/
 >>>>>>
 >>>>>> FAO Linked Data:
 >>>>>> http://fao.270a.info/
 >>>>>>
 >>>>>> Linked SDMX Data:
 >>>>>> http://csarven.ca/linked-sdmx-data
 >>>>>
 >>>>> ECB Linked Data:
 >>>>> http://ecb.270a.info/
 >>>>
 >>>> IMF Linked Data:
 >>>> http://imf.270a.info/
 >>>
 >>> UIS
 Linked Data:
 >>> http://uis.270a.info/
 >>
 >> FRB Linked
 Data:
 >> http://frb.270a.info/
 >
 > BIS Linked Data:
 > http://bis.270a.info/
 
 ABS Linked Data:
 http://abs.270a.info/
 
 -Sarven
 http://csarven.ca/#i
 
Received on Monday, 11 August 2014 20:58:55 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:00:53 UTC