- From: Vassilios Peristeras <vassilios44@gmail.com>
- Date: Fri, 1 Mar 2013 21:22:53 +0100
- To: public-egov-ig@w3.org
Hello Peter, >I would be grateful for any pointers if there is work going on to > harmonize classification of datasets on a global or European level. In my opinion, the most appropriate vehicle to "harmonize classification of datasets" would be DCAT. This does not mean that the practical problem you pose here has been solved. The vocs/thesaurus already mentioned by Bastian and others are all good possible candidates to be taken into account: - DCAT is a vocabulary for describing data-catalogs AND datasets. DCAT provides a good description of a dataset and can help cross-querying (e.g. based on licensing type, spatial coverage, etc). The DCAT Theme/Category property is a placeholder where you could use controlled vocabularies to better describe your datasets content. The question remains: what to use there. An agreement is indeed needed. - ADMS is a vocabulary to describe other vocabularies and schemata (simplified definition). As such it could also be used for describing special types of government datasets (e.g. all the codelists, XML schemata, and metadata standards a country has adopted). Very useful this may be for cross-querying catalogs for this type of datasets, but it cannot be used as a global solution to the problem you refer here. - EuroVoc is a multilingual, multidisciplinary thesaurus covering the activities of the EU, the European Parliament in particular. However, practically its coverage is broader and can be used for describing all governmental functions. It is really multi-lingual, quite detailed and mature work. - NACE is the "Statistical Classification of Economic Activities in the European Community". No need for explanation. - COFOG is a Classification of the Functions of Government. Again no need for explanation. Some questions, with possible answers: A) Do we want to come up with single-dimensional classification or we would like to develop a multi-facet one for describing datasets? I assume a multi-facet one would be more useful. B) Is DCAT enough for identifying these facets? I think yes. C) Do we need to agree on common vocabularies for the DCAT property values: definitely yes D) Have we done anything for this? Not yet, but there is discussion already... D) Are Eurovoc, NACE, COFOG good candidates to be used for this purpose? Most, possibly yes Regards, Vassilios Peristeras ________________________________ From: Bastiaan Deblieck [mailto:bastiaan.deblieck@tenforce.com] Sent: Friday, March 01, 2013 3:14 PM To: Charles RUELLE; Peter Krantz; euopendata@lists.okfn.org; public-egov-ig Subject: Re: [euopendata] Classification of open datasets... Hello, Please allow me to give the TenForce view on this situation. At TenForce we are collaborating very closely with the EC and with other open data initiatives throughout Europe. As a commercial company we follow research activities and apply their results in our projects. With regards to this discussion we are strong supporters of: - EUROVOC: http://eurovoc.europa.eu/drupal/ - ADMS: http://en.wikipedia.org/wiki/Asset_Description_Metadata_Schema_(ADMS) - DCAT: http://www.w3.org/2011/gld/wiki/Data_Catalog_Vocabulary We are convinced that these are excellent vocabularies to facilitate multilingual data exchange and linking. We have been using and will be using these "tools" in our projects for government and industry. From this experience we know that EUROVOC is/will be key in anything the EC does in the area of open data. The Open Data Portal of the EU uses DCAT/DCT and is aligned in general terms to be compatible with ADMS, cf. http://open-data.europa.eu/open-data/linked-data Future activities in the area of linked open data on a European scale will almost certainly involve these vocabularies. Contracts like this http://epsiplatform.eu/content/ec-publishes-open-data-tender have been attributed and are moving forward. Best Regards, Bastiaan Deblieck On Fri, Mar 1, 2013 at 1:17 PM, Charles RUELLE <charles.ruelle@gmail.com> wrote: Hi, This topic is very interesting. In France, for data.gouv.fr (the french open data portal) we currently use Eurovoc to describe our datasets. Do you know who is using Eurovoc ? What are others classifications that are used ? Best regards, Charles RUELLE @charlesruelle CTO of Etalab - French Prime Minister's task force for Open Government and Open Data Le 1 mars 2013 à 10:32, Peter Krantz <peter@peterkrantz.se> a écrit : > Hi! > > Many countries are developing national portals with metadata about > open datasets from the public sector. To make datasets easier to find > and to lower the threshold for pan-european (or global) re-use it > would be great if classification of datasets followed a shared > taxonomy. > > There are many candidates that could be used, e.g. Eurovoc [1], NACE > [2]. I would be grateful for any pointers if there is work going on to > harmonize classification of datasets on a global or European level. > > Regards, > > Peter Krantz > http://www.peterkrantz.com > @peterkz_swe > > [1]: http://eurovoc.europa.eu/ - availabble as LOD > [2]: http://ec.europa.eu/competition/mergers/cases/index/nace_all.html > > _______________________________________________ > euopendata mailing list > euopendata@lists.okfn.org > http://lists.okfn.org/mailman/listinfo/euopendata > Unsubscribe: http://lists.okfn.org/mailman/options/euopendata
Received on Friday, 1 March 2013 20:23:20 UTC