DQV, ISO 19115/19157 and GeoDCAT-AP

Dear DWBP WG,

I have a few of questions / comments on DQV, specifically related to
its use for geospatial metadata (for this reason, I'm cc'ing also the
SDW WG).

1. Data quality is one of the "elements" included in the current
standards used for geospatial metadata, namely, ISO 19115:2003 and ISO
19157:2013 (see [1, 2] for an overview). I wonder whether you would
consider (at least, partial) compatibility with the approaches defined
in those ISO standards as a possible requirement. More in general, my
question is about the design principles behind DQV. I wonder whether
you see it as a "core" vocabulary, meant to define the main data
quality concepts, common across domains, that can be possibly extended
to address domain-specific requirements.

2. Specifically with respect to "conformance" with quality standards /
benchmarks: ISO also includes it, with the notion of "conformance
result". However, as far as I can see, in DQV you support only one
specific case via dct:conformsTo, i.e., when data are "conformant",
whereas ISO allows also to state that data are NOT conformant.
Moreover, INSPIRE "extends" ISO by allowing to state that conformance
has not been evaluated (see [3]).

3. Following from point (2): GeoDCAT-AP [4] addresses this issue by
modelling conformance results with PROV (using nonetheless also
dct:conformsTo when data are conformant with the referred
specification). For the modelling "pattern" used, feedback has been
asked from the PROV WG (see the relevant mail thread [5]). Actually,
the original proposal was to use EARL - I contributed that approach in
an earlier mail [6]. However, the GeoDCAT-AP WG eventually decided to
switch to PROV, as a more sustainable solution. You can find an
explanation in the description of the relevant issue:

https://joinup.ec.europa.eu/node/142454

Here's an example:

<a:Dataset> prov:wasUsedBy [
 a prov:Activity;
# Conformity degree
  prov:generated [
   dct:type <http://inspire.ec.europa.eu/codelist/DegreeOfConformity/conformant>
;
  prov:qualifiedAssociation [
   prov:hadPlan [
     a prov:Plan;
     prov:wasDerivedFrom [
# Specification
        a prov:Entity, dct:Standard;
       dct:title "COMMISSION REGULATION (EC) No 976/2009 of 19 October
2009 implementing Directive 2007/2/EC of the European Parliament and
of the Council as regards the Network Services"@en
       dct:issued "2009-10-20"^^xsd:date
     ]
   ];
 ];
] .

and the same by using dct:conformsTo:

<a:Dataset> dct:conformsTo [
# Specification
  a dct:Standard;
  dct:title "COMMISSION REGULATION (EC) No 976/2009 of 19 October 2009
implementing Directive 2007/2/EC of the European Parliament and of the
Council as regards the Network Services"@en
  dct:issued "2009-10-20"^^xsd:date ] .

Of course, the PROV-based approach allows to attach additional
information about the provenance of quality metadata - i.e., when the
test has been carried out, by whom, with which methodology and/or test
tool, etc.

Any feedback you would provide on this approach would be very much
relevant for the final specification of GeoDCAT-AP.

Thanks in advance!

Andrea

----
[1]https://geo-ide.noaa.gov/wiki/index.php?title=ISO_Data_Quality
[2]https://geo-ide.noaa.gov/wiki/index.php?title=File:DQ_DataQuality.png
[3]http://eur-lex.europa.eu/eli/reg/com/2008/1205#d1e2036-14-1
[4]https://joinup.ec.europa.eu/node/139283/
[5]https://lists.w3.org/Archives/Public/public-prov-comments/2015May/0001.html
[6]https://lists.w3.org/Archives/Public/public-dwbp-wg/2014Oct/0142.html

-- 
Andrea Perego, Ph.D.
Scientific / Technical Project Officer
European Commission DG JRC
Institute for Environment & Sustainability
Unit H06 - Digital Earth & Reference Data
Via E. Fermi, 2749 - TP 262
21027 Ispra VA, Italy

https://ec.europa.eu/jrc/

----
The views expressed are purely those of the writer and may
not in any circumstances be regarded as stating an official
position of the European Commission.

Received on Sunday, 30 August 2015 22:51:42 UTC