W3C home > Mailing lists > Public > public-dxwg-wg@w3.org > March 2019

Re: [dxwg] Summary statistics [RSS] (#84)

From: Riccardo Albertoni via GitHub <sysbot+gh@w3.org>
Date: Sun, 03 Mar 2019 22:32:16 +0000
To: public-dxwg-wg@w3.org
Message-ID: <issue_comment.created-469071937-1551652335-sysbot+gh@w3.org>
>Are you aware of a 'standard' way to provide basic dataset statistics using DQV or any other RDF vocabulary? e.g. minimum/maximum(/average) values for specified dimensions? I'm not seeing anything obvious in DQV or QB :-( I guess it might be a dqv:Metric but I wonder if you could provide guidance on how this might look?

 I am not aware of anything except  the examples mentioned by @agbeltran for the statistics oriented to RDF datasets, perhaps @makxdekkers knows more ?!?.

Anyway, I guess there is more than one way to do it.  For example, using   RDF data cube you can define your own qb:DataStructureDefinition.  

if you want to describe statistic of datasets  such as Average,  Max, Min for the "fields" in the dataset, you might define a qb:DataStructureDefinition  whose dimensions/components include

- the considered dataset
- the considered field
- the considered operator ( i.e. Average, Max, Min.. etc)
- the actual measures

If you provide statistics as quality indicators you can think of using DQV qualityMeasurement,  for example defining a new dqv:dimensioni for each pair of  field and operator.

GitHub Notification of comment by riccardoAlbertoni
Please view or discuss this issue at https://github.com/w3c/dxwg/issues/84#issuecomment-469071937 using your GitHub account
Received on Sunday, 3 March 2019 22:32:17 UTC

This archive was generated by hypermail 2.3.1 : Monday, 29 April 2019 13:45:08 UTC