- From: Riccardo Albertoni via GitHub <sysbot+gh@w3.org>
- Date: Wed, 06 Jun 2018 14:42:26 +0000
- To: public-dxwg-wg@w3.org
> The examples you have developed all appear to attach quality information to DCAT resources with information that is external to the DCAT resource itself - with the URI for the dataset description as the object of an axiom. So this information would not actually be in the dcat:Catalog.
Dear @dr-shorthair,
I am not sure to fully understand your remark, which seems to propose an additional requirement related to the DCAT self-containment, not explicit in the issue #57. Anyway, I sense that it implies more than one desiderata, which I am listing below
1. to have dcat element not only as an object of axioms;
2. to have quality statement collected into a single container X;
3. To have X expressable as a native DCAT element.
As to 1, the dcat:Dataset/Distribution are connect to Measurements or Annotations through the properties `dqv:hasQualityMeasurement ` and `dqv:hasQualityAnnotation`.
As to 2, it is possible to collect all the kind of quality information into :myQualityMetadata. :myQualityMetadata is an instance of dqv:QualityMetadata and collects all into the same graph, or in the same Turtle.
We can related dcat:dataset/distribution to myQualityMetadata saying
`dcat:busStopInGenoa dqv:hasQualityMetadata :myQualityMetadata`
For example, assuming :myQualityMetadata is serialized in TRix, we can write the following
```
:myQualityMetadata a dqv:QualityMetadata.
GRAPH :myQualityMetadata {
:busStopInGenoa a dcat:Dataset ;
dqv:hasQualityAnnotation :qualityNote .
:qualityNote
a dqv:UserQualityFeedback ;
oa:hasTarget :busStopInGenoa ;
oa:hasBody :textBody ;
oa:motivatedBy dqv:qualityAssessment ;
prov:wasAttributedTo :consumer1 ;
prov:generatedAtTime "2018-05-27T02:52:02Z"^^xsd:dateTime ;
dqv:inDimension ldqd:completeness
.
:textBody a oa:TextualBody ;
rdf:value "Incomplete dataset: it contains only 20500 out of 30000 existing bus stops" ;
dc:language "en" ;
dc:format "text/plain"
.
:busStopInGenoa
dqv:hasQualityMeasurement :myMeasurement .
:myMeasurement
a dqv:QualityMeasurement ;
dqv:computedOn :busStopInGenoa ;
dqv:isMeasurementOf :completenessWRTExpectedNumberOfEntities ;
dqv:value "0.6833333"^^xsd:decimal ;
prov:wasAttributedTo :myQualityChecker ;
prov:generatedAtTime "2018-05-27T02:52:02Z"^^xsd:dateTime ;
prov:wasGeneratedBy :myQualityChecking
.
:completenessWRTExpectedNumberOfEntities
a dqv:Metric ;
skos:definition "it returns the degree of completeness as ratio between the actual number of entities included in the dataset and the declared expected number of entities."@en ;
dqv:expectedDataType xsd:decimal ;
dqv:inDimension ldqd:completeness .
}
```
As to 3, I do not see any reason why we cannot define a dcat:Distribution or a dcat:Dataset for cataloguing the data quality information serialized in:myQualityMetadata.
We have already dqv:hasQualityMetadata conneting dcat:Dataset/Distribution to dqv:QualityMetadata
Another issue is whether or not we want more explicit ways to say this dcat:Dataset Y contains the quality data of dcat:Dataset X or to say Y has been derived by X in a quality assessment activity.
I tend to consider this a separate issue which might be influenced by the solutions chosen in to solve the Qualified forms [RQF] #79, Provenance information [RPIF] #76, and the dcat core element restructuration.
--
GitHub Notification of comment by riccardoAlbertoni
Please view or discuss this issue at https://github.com/w3c/dxwg/pull/245#issuecomment-395093599 using your GitHub account
Received on Wednesday, 6 June 2018 14:42:29 UTC