Re: [dxwg] Summary statistics [RSS] (#84)

>Are you aware of a 'standard' way to provide basic dataset statistics using DQV or any other RDF vocabulary? e.g. minimum/maximum(/average) values for specified dimensions? I'm not seeing anything obvious in DQV or QB :-( I guess it might be a dqv:Metric but I wonder if you could provide guidance on how this might look?

 I am not aware of anything except  the examples mentioned by @agbeltran for the statistics oriented to RDF datasets, perhaps @makxdekkers knows more ?!?.

Anyway, I guess there is more than one way to do it.  For example, using   RDF data cube you can define your own qb:DataStructureDefinition.  

if you want to describe statistic of datasets  such as Average,  Max, Min for the "fields" in the dataset, you might define a qb:DataStructureDefinition  whose dimensions/components include

- the considered dataset
- the considered field
- the considered operator ( i.e. Average, Max, Min.. etc)
- the actual measures

If you provide statistics as quality indicators you can think of using DQV qualityMeasurement,  for example defining a new dqv:dimensioni for each pair of  field and operator.




-- 
GitHub Notification of comment by riccardoAlbertoni
Please view or discuss this issue at https://github.com/w3c/dxwg/issues/84#issuecomment-469071937 using your GitHub account

Received on Sunday, 3 March 2019 22:32:17 UTC