W3C home > Mailing lists > Public > public-egov-ig@w3.org > March 2013

Fwd: FW: [euopendata] Classification of open datasets...

From: Vassilios Peristeras <vassilios44@gmail.com>
Date: Fri, 1 Mar 2013 21:22:53 +0100
Message-ID: <CAAUyFzuG8kkubS+knNsgaNsAZU4kb7mqkqkqEPaXYj2EcnKsCQ@mail.gmail.com>
To: public-egov-ig@w3.org
Hello Peter,

>I would be grateful for any pointers if there is work going on to
> harmonize classification of datasets on a global or European level.

In my opinion, the most appropriate vehicle to "harmonize
classification of datasets" would be DCAT.

This does not mean that the practical problem you pose here has been solved.

The vocs/thesaurus already mentioned by Bastian and others are all
good possible candidates to be taken into account:

- DCAT is a vocabulary for describing data-catalogs AND datasets. DCAT
provides a good description of a dataset and can help cross-querying
(e.g. based on licensing type, spatial coverage, etc). The DCAT
Theme/Category property is a placeholder where you could use
controlled vocabularies to better describe your datasets content. The
question remains: what to use there. An agreement is indeed needed.

- ADMS is a vocabulary to describe other vocabularies and schemata
(simplified definition). As such it could also be used for describing
special types of government datasets (e.g. all the codelists, XML
schemata, and metadata standards a country has adopted). Very useful
this may be for cross-querying catalogs for this type of datasets, but
it cannot be used as a global solution to the problem you refer here.

- EuroVoc is a multilingual, multidisciplinary thesaurus covering the
activities of the EU, the European Parliament in particular. However,
practically its coverage is broader and can be used for describing all
governmental functions. It is really multi-lingual, quite detailed and
mature work.

- NACE is the "Statistical Classification of Economic Activities in
the European Community". No need for explanation.

- COFOG is a Classification of the Functions of Government. Again no
need for explanation.

Some questions, with possible answers:
A) Do we want to come up with single-dimensional classification or we
would like to develop a multi-facet one for describing datasets? I
assume a multi-facet one would be more useful.
B) Is DCAT enough for identifying these facets? I think yes.
C)  Do we need to agree on common vocabularies for the DCAT property
values: definitely yes
D) Have we done anything for this? Not yet, but there is discussion already...
D) Are Eurovoc, NACE, COFOG good candidates to be used for this
purpose? Most, possibly yes

Vassilios Peristeras


From: Bastiaan Deblieck [mailto:bastiaan.deblieck@tenforce.com]
Sent: Friday, March 01, 2013 3:14 PM
To: Charles RUELLE; Peter Krantz; euopendata@lists.okfn.org; public-egov-ig
Subject: Re: [euopendata] Classification of open datasets...


Please allow me to give the TenForce view on this situation. At
TenForce we are collaborating very closely with the EC and with other
open data initiatives throughout Europe. As a commercial company we
follow research activities and apply their results in our projects.
With regards to this discussion we are strong supporters of:

- EUROVOC: http://eurovoc.europa.eu/drupal/

- ADMS: http://en.wikipedia.org/wiki/Asset_Description_Metadata_Schema_(ADMS)

- DCAT: http://www.w3.org/2011/gld/wiki/Data_Catalog_Vocabulary

We are convinced that these are excellent vocabularies to facilitate
multilingual data exchange and linking. We have been using and will be
using these "tools" in our projects for government and industry. From
this experience we know that EUROVOC is/will be key in anything the EC
does in the area of open data. The Open Data Portal of the EU uses
DCAT/DCT and is aligned in general terms to be compatible with ADMS,
cf. http://open-data.europa.eu/open-data/linked-data

Future activities in the area of linked open data on a European scale
will almost certainly involve these vocabularies. Contracts like this
http://epsiplatform.eu/content/ec-publishes-open-data-tender have been
attributed and are moving forward.

Best Regards,

Bastiaan Deblieck

On Fri, Mar 1, 2013 at 1:17 PM, Charles RUELLE <charles.ruelle@gmail.com> wrote:


This topic is very interesting.
In France, for data.gouv.fr (the french open data portal) we currently
use Eurovoc to describe our datasets.
Do you know who is using Eurovoc ? What are others classifications
that are used ?

Best regards,

Charles RUELLE
CTO of Etalab - French Prime Minister's task force for Open Government
and Open Data

Le 1 mars 2013 à 10:32, Peter Krantz <peter@peterkrantz.se> a écrit :

> Hi!
> Many countries are developing national portals with metadata about
> open datasets from the public sector. To make datasets easier to find
> and to lower the threshold for pan-european (or global) re-use it
> would be great if classification of datasets followed a shared
> taxonomy.
> There are many candidates that could be used, e.g. Eurovoc [1], NACE
> [2]. I would be grateful for any pointers if there is work going on to
> harmonize classification of datasets on a global or European level.
> Regards,
> Peter Krantz
> http://www.peterkrantz.com
> @peterkz_swe
> [1]: http://eurovoc.europa.eu/ - availabble as LOD
> [2]: http://ec.europa.eu/competition/mergers/cases/index/nace_all.html
> _______________________________________________
> euopendata mailing list
> euopendata@lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/euopendata
> Unsubscribe: http://lists.okfn.org/mailman/options/euopendata
Received on Friday, 1 March 2013 20:23:20 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:00:50 UTC