W3C home > Mailing lists > Public > semantic-web@w3.org > June 2017

using DCAT for scraped data

From: Cristiano Longo <longo@dmi.unict.it>
Date: Sat, 3 Jun 2017 16:32:30 +0200
To: semantic-web@w3.org, Alessio Cimarelli <alessio.cimarelli@gmail.com>
Message-ID: <b9e3e716-1f43-df83-dc80-d414d9a3867f@dmi.unict.it>
Dear All,

I'm writing from an Hackaton at the Open Data Fest 2017 
(opendatafest.it)  in Sicily. We are building an ontology of Albo POP 
(http://albopop.it) using DCAT and its specialization DCAT_ap_it. 
Roughly speaking, an Albo POP is an automated tool which provides an RSS 
feed a set of notices and advices from a Public Administration (usually 
a municipality) by scraping the notices from the web site of the Public 
Administration itself.

We model using dcat the RSS feed we provide as a distribution, but we 
would like to make explicit that the data come from the public 
administration. We adopted the followings:

a) put the notices web page of the public administration by using the 
source property of the dublin core terms  vocabulary, attached to the 
datase;

b) as rights Holder we specify the municipality and

c) as publisher we indicate the developer who created the scaper which 
converts the notices page to RSS.

An example is attached to this mail.

We would like to know if this approach may be considered acceptable. Any 
suggestion is welcome.

Thanks in advance,

Cristiano Longo





Received on Saturday, 3 June 2017 14:33:07 UTC

This archive was generated by hypermail 2.4.0 : Thursday, 24 March 2022 20:41:56 UTC