Re: [dxwg] How to specify the number of records in a dataset (#1571)

The number of records gives information about *content*. It is useful to judge and compare both different datasets of same type (same method to cound records) and change of a dataset over time. See http://nomisma.org/datasets for an example of a list of datasets with number of records each. This example happens to use `dcterms:hasPart` with a blank node and `void:entities`to give the number, e.g.:

~~~ttl
<http://numismatics.org/pco/>
        rdf:type             void:Dataset ;
        dcterms:hasPart      [ rdf:type       dcmitype:Collection ;
                               dcterms:type   nmo:TypeSeriesItem ;
                               void:entities  3650
                             ] ;
        dcterms:hasPart      [ rdf:type       dcmitype:Collection ;
                               dcterms:type   nmo:Monogram ;
                               void:entities  309
                             ] .
~~~

I am not sure whether this is best practice and applicable to other kinds of datasets, for instance number of files.

By the way DataCite has a [free text property](https://support.datacite.org/docs/datacite-metadata-schema-v44-recommended-and-optional-properties#13-size) that maps to `dcterms:extent`. According to my understanding of http://dx.doi.org/10.6084/m9.figshare.2075356, the example above would be:

~~~turtle
<http://numismatics.org/pco/>
  rdf:type void:Dataset ;
  dcterms:extent [
    rdf:type dcterms:SizeOrDuration ;
    rdf:value "3650 type series items"
  ] ;
  dcterms:extent [
    rdf:type dcterms:SizeOrDuration ;
    rdf:value "309 monorams"
  ] .
~~~

or (what I would prefer)

~~~turtle
<http://numismatics.org/pco/>
  rdf:type void:Dataset ;
  dcterms:extent "3650 type series items";
  dcterms:extent "309 monograms" .
~~~

I also found the [Ontology of units of Measure](https://github.com/HajoRijgersberg/OM) to support this:

~~~turtle
<http://numismatics.org/pco/>
  rdf:type void:Dataset ;
  dcterms:extent [
    rdf:type om:Measure ;
    om:hasNumericalValue 3650
  ] ;
  dcterms:extent [
    rdf:type om:Measure ;
    om:hasNumericalValue 309 
   ]
~~~

Last but not least Wikidata uses [P4876](https://www.wikidata.org/wiki/Property:P4876) to specify the number of records, see [this list of databases with their number of records](https://w.wiki/6zHN).

-- 
GitHub Notification of comment by nichtich
Please view or discuss this issue at https://github.com/w3c/dxwg/issues/1571#issuecomment-1625986016 using your GitHub account


-- 
Sent via github-notify-ml as configured in https://github.com/w3c/github-notify-ml-config

Received on Friday, 7 July 2023 19:44:32 UTC