W3C home > Mailing lists > Public > public-dxwg-wg@w3.org > July 2018

Re: [dxwg] Best practice for a loosely-structured catalog

From: Alejandra Gonzalez-Beltran via GitHub <sysbot+gh@w3.org>
Date: Thu, 12 Jul 2018 08:27:34 +0000
To: public-dxwg-wg@w3.org
Message-ID: <issue_comment.created-404432434-1531384053-sysbot+gh@w3.org>
Thanks @dr-shorthair. Here an example where distributions where used for a case of multiple files, as there was no other way of representing this.

The example, as provided by the catalogue, is actually in schema.org, but pretty much there is a 1-to-1 mapping.

```
[] a schema:Dataset ;
    schema:creator [ a schema:Organization ;
            schema:name "Ofsted" ] ;
    schema:dateModified "2016-12-12T14:16:44.522Z"^^schema:Date ;
    schema:description "The outstanding providers list includes early years registered providers, maintained schools, independent schools, colleges and providers of work-based learning, adult education and children?s social care.  Two datasets are included: the first lists of all those providers who met the outstanding provider criteria in the most recent year for which data is available; the second is a list of all providers who have met the applicable criteria in any year since 1993. In the second list the year(s) in which that provider was included are also shown." ;
    schema:distribution [ a schema:DataDownload ;
            schema:contentUrl <https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/481154/Outstanding_Providers_List_1993-2014.csv> ;
            schema:fileFormat <file:///Users/agbeltran/work-data-commons/dats-tools/dats/CSV> ;
            schema:name "Outstanding Providers list 1993-2014" ],
        [ a schema:DataDownload ;
            schema:contentUrl <https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/480700/Outstanding_providers_list_2014-15.csv> ;
            schema:fileFormat <file:///Users/agbeltran/work-data-commons/dats-tools/dats/CSV> ;
            schema:name "Outstanding Providers list 2014-2015" ],
        [ a schema:DataDownload ;
            schema:contentUrl <https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/571915/Outstanding_Providers_List_2015-16.ods> ;
            schema:fileFormat <file:///Users/agbeltran/work-data-commons/dats-tools/dats/ODS> ;
            schema:name "Outstanding Providers list 2015-2016" ] ;
    schema:includedInDataCatalog [ a schema:DataCatalog ;
            schema:url <https://data.gov.uk/> ] ;
    schema:keywords "Education" ;
    schema:license [ a schema:CreativeWork ;
            schema:name "Open Government Licence" ;
            schema:url <http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/> ] ;
    schema:name "Outstanding providers list" ;
    schema:url <https://data.gov.uk/dataset/63f9c959-00b6-4c51-b165-47f387ff7881/outstanding-providers-list> .
```
 
 Here goes an attempt to use ```dcterms:relation``` instead:
 
```
[] a dcat:Dataset ;
    dcat:publisher [ a foaf:Organization ;
            rdfs:label "Ofsted" ] ;
    dct:modified "2016-12-12T14:16:44.522Z"^^schema:Date ;
    dct:description "The outstanding providers list includes early years registered providers, maintained schools, independent schools, colleges and providers of work-based learning, adult education and children?s social care.  Two datasets are included: the first lists of all those providers who met the outstanding provider criteria in the most recent year for which data is available; the second is a list of all providers who have met the applicable criteria in any year since 1993. In the second list the year(s) in which that provider was included are also shown." ;
    dcterms:relation [            
            dcat:downloadURL <https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/481154/Outstanding_Providers_List_1993-2014.csv> ;
            dcat:mediaType "text/csv" ;
            dct:title "Outstanding Providers list 1993-2014" ],
        [ 
            dcat:downloadURL <https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/480700/Outstanding_providers_list_2014-15.csv> ;
            dcat:mediaType "text/csv" ;
            dct:title "Outstanding Providers list 2014-2015" ],
        [
            dcat:downloadURL <https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/571915/Outstanding_Providers_List_2015-16.ods> ;
            dct:format <ODS> ;
            dct:title "Outstanding Providers list 2015-2016" ] ;   
    dcat:keywords "Education" ;
    dcterms:license [  
            dct:title "Open Government Licence" ;
            schema:url <http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/> ] ;
    dct:title "Outstanding providers list" ;
    dct:identifier <https://data.gov.uk/dataset/63f9c959-00b6-4c51-b165-47f387ff7881/outstanding-providers-list> .
``` 


So, my questions/comments would be:
- using ```dcterms:relation``` in this way to point to multiple files that are not really distributions is simple and useful way to cover the use case, which wasn't cover in DCAT before
- I'm using [```dcat:downloadURL```](https://w3c.github.io/dxwg/dcat/#Property:distribution_downloadurl) above, but this is wrong as it has domain ```dcat:Distribution``` - what property to use instead? ```dcat:accessURL``` is also for distributions.  
- supporting the use of ```dcterms:relation``` in this way, it is quite likely that developers would choose this simple representation even when the use of ```dcat:distribution``` would be appropriate; so, do we need to encourage the use of the richer semantics representation as per @dr-shorthair examples (through guidance documentation in the spec, a primer, examples, etc) and what would be the consequences of people using the simple representation instead? 
 

-- 
GitHub Notification of comment by agbeltran
Please view or discuss this issue at https://github.com/w3c/dxwg/issues/253#issuecomment-404432434 using your GitHub account
Received on Thursday, 12 July 2018 08:27:40 UTC

This archive was generated by hypermail 2.3.1 : Monday, 25 March 2019 10:33:24 UTC