- From: Martin Hepp (UniBW) <martin.hepp@ebusiness-unibw.org>
- Date: Mon, 09 Nov 2009 23:01:45 +0100
- To: Aldo Bucchi <aldo.bucchi@gmail.com>
- CC: Linked Data community <public-lod@w3.org>
- Message-ID: <4AF89149.7090806@ebusiness-unibw.org>
Hi Aldo, Note that there are multiple branches of the ISO 3166 familiy of codes. See pages 23 and 24 of the GoodRelations Technical Report (http://www.heppnetz.de/projects/goodrelations/GoodRelations-TR-final.pdf) for a more detailed discussion. I am still not aware of any authoritative URI schema for ISO 3166, which is why GoodRelations uses string literals for that code. The key ISO page http://www.iso.org/iso/country_codes.htm does also not refer to any established http or URN URI schema for the ISO 3166 family of codes. I assume that dbPedia URIs may be well suited, but they are not as authoritative. If they have ISO 3166 codes attached via properties, entity consolidation on that basis may be relatively simple. Below, please find an excerpt from the discussion re identifiers for countries in the GoodRelations Technical Report: Country or Region ... GoodRelations could reuse several approaches for ontologies of regions and places for specifying Countries and Regions. However, we suggest a more pragmatic approach of reusing the ISO Standard 3166, in particular ISO 3166-1 (ISO, 2006) and ISO 3166-2 (ISO, 1998). The first defines 2- or 3-letter identifiers for existing countries and a few independent geopolitical entities. ISO 3166-1 alpha-2 defines 2-letter codes for most countries. There exist alternative standards with 3-letter codes and a numerical representation. For the following reasons, we suggest using the 2-letter codes: First, they are well established and people are likely more familiar with them (they are also used for most top-level domains). Second, and more important, the 2-letter variant is the basis for ISO 3166-2, which breaks down the countries from ISO 3166-1 into administrative subdivisions (ISO, 1998). The code elements used in ISO 3166-2 consist of “the alpha-2 code element from ISO 3166-1 followed by a separator and a further string of up to three alphanumeric characters e. g.” (from: http://www.iso.org/iso/en/prods- services/iso3166ma/04background-on-iso-3166/iso3166-2.html). This allows using simple string operations on the respective ISO 3166 codes in order to handle administrative subdivisions. For example, if a certain Offering is said to be valid for Canada (ISO 3166-1 two-letter code “CA”), then one can infer that any longer search string specifying an administrative subdivision of Canada (e.g. British Columbia, ISO 3166-2 “CA-BC”) is also an eligible region. Examples: Canada (CA), Austria (AT), Canada: British Columbia (CA-BC), Italy (IT), Italy: Province of Milano (IT-MI) Note: More complex modeling of Countries and Regions may be useful in some scenarions, and GoodRelations can be imported and extended if necessary. However, most offerings on the Web contain statements on the level of countries only, for which ISO 3166-1 is sufficient and very common. Martin Aldo Bucchi wrote: > Hi, > > I found a dataset that represents countries as two letter country > codes: DK, FI, NO, SE, UK. > I would like to turn these into URIs of the actual countries they represent. > > ( I have no idea on whether this follows an ISO standard or is just > some private key in this system ). > > Any ideas on a set of candidata URIs? I would like to run a complete > coverage test and take care I don't introduce distortion ( that is > pretty easy by doing some heuristic tests against labels, etc ). > > There are some border cases that suggest this isn't ISO3166-1, but I > am not sure yet. ( and if it were, which widely used URIs are based on > this standard? ). > > Thanks! > A > > -- -------------------------------------------------------------- martin hepp e-business & web science research group universitaet der bundeswehr muenchen e-mail: hepp@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! ================================================================= Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: "Semantic Web-based E-Commerce: The GoodRelations Ontology" http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_IEEE_CEC%2709
Received on Monday, 9 November 2009 22:03:42 UTC