Re: Question for DCAT "experts"

Hi there,

a colleague of mine, Marvin Frommhold, is researching versioning in the
context of RDF and Linked Data. He contributes the following points:

The following two documents provide a basic introduction to versioning
of datasets:

  * Papakonstantinou, Vassilis et al. “Versioning for Linked Data:
    Archiving Systems and Benchmarks.” BLINK@ ISWC. users.ics.forth.gr,
    2016. Web. <http://ceur-ws.org/Vol-1700/paper-05.pdf>
      o Section 2 of this paper provides an introduction of different
        archiving strategies.
  * Gray, Alasdair J. G. et al. “Dataset Descriptions: HCLS Community
    Profile.” Interest group note, W3C (May 2015)
    http://www.w3.org/TR/hcls-dataset (2015): n. pag. Print.
    <https://www.w3.org/TR/hcls-dataset/>
      o A W3C Interest Group Note that, among other things, discusses
        requirements for dataset versioning.
      o "The Data Catalog Vocabulary (DCAT) [DCAT
        <https://www.w3.org/TR/hcls-dataset/#DCAT>] is used to describe
        datasets in catalogs, but does not deal with the issue of
        dataset evolution and versioning."

He agrees that change sets are related to versioning in that a version
can be described as a set of changes. Fully realized, this allows very
granular tracking of dataset evolution. Makx point is important here:
These changes are granular descriptions about the evolving content of a
dataset, where DCAT so far does little to describe the data itself. If
DCAT started to describe the content and structure of the data, this
would be a considerable expansion of its scope.

The question if a set of changes constitute a new dataset or if a whole
database is a dataset is complicated to me, because I understand
instances of dcat:Dataset as conceptual descriptions of datasets,
largely independent of the structure of the underlying data. In that
sense, a database or a web service independent of the query can also be
datasets. Limiting the data retrieved from it by some API call or SQL
query could then create a new dataset fully contained in the first one.

cheers,
Martin

Am 22/06/17 um 11:00 schrieb Makx Dekkers:
> Yes, I agree it is. Updating 'in place' is a case where the publisher decides that a change does not create a new Dataset. 
>
> I find Karen's suggestion to treat a 'database' as a 'dataset' interesting -- I have always thought of a database as closer to a dcat:Catalog.
>
> Makx.
>
>
> -----Original Message-----
> From: Peter.Winstanley@gov.scot [mailto:Peter.Winstanley@gov.scot] 
> Sent: 22 June 2017 10:52
> To: mail@makxdekkers.com; public-dxwg-wg@w3.org
> Subject: RE: Question for DCAT "experts"
>
> isn't a change set (like a diff) just a special case of versioning?
>
> -----Original Message-----
> From: Makx Dekkers [mailto:mail@makxdekkers.com]
> Sent: 22 June 2017 09:47
> To: public-dxwg-wg@w3.org
> Subject: RE: Question for DCAT "experts"
>
> As far as I remember from the initial work on DCAT, a Dataset is considered to be a kind of blob. Nothing is said about what goes on 'inside' a Dataset. The only thing you see on the outside is the modification date but you don't know what has changed inside. 
> Makx
>
> -----Original Message-----
> From: Karen Coyle [mailto:kcoyle@kcoyle.net]
> Sent: 21 June 2017 17:31
> To: public-dxwg-wg@w3.org
> Subject: Question for DCAT "experts"
>
> Many of you know DCAT quite well, and I'm new to it, so I'm taking the lazy way and directing this as a question to you.
>
> I see in DCAT that there are properties that define frequency and update dates. The update date is
>
> "Most recent date on which the dataset was changed, updated or modified."
>
> The library world has a number of databases that are updated "in place".
> For anyone receiving updates, the updates do not include the entire file, only those records added, changed, or deleted since some set time.
>
> Is this covered by DCAT? If not, I will add a use case and we can discuss.
>
> Thanks,
> kc
> --
> Karen Coyle
> kcoyle@kcoyle.net http://kcoyle.net
> m: 1-510-435-8234 (Signal)
> skype: kcoylenet/+1-510-984-3600
>
>
> ______________________________________________________________________
> This email has been scanned by the Symantec Email Security.cloud service.
> For more information please visit http://www.symanteccloud.com ______________________________________________________________________
>
> *********************************** ******************************** This email has been received from an external party and has been swept for the presence of computer viruses.
> ******************************************************************** 
>
> **********************************************************************
> This e-mail (and any files or other attachments transmitted with it) is intended solely for the attention of the addressee(s). Unauthorised use, disclosure, storage, copying or distribution of any part of this e-mail is not permitted. If you are not the intended recipient please destroy the email, remove any copies from your system and inform the sender immediately by return.
>
> Communications with the Scottish Government may be monitored or recorded in order to secure the effective operation of the system and for other lawful purposes. The views or opinions contained within this e-mail may not necessarily reflect those of the Scottish Government.
>
>
> Tha am post-d seo (agus faidhle neo ceanglan  còmhla ris) dhan neach neo luchd-ainmichte a-mhàin. Chan eil e ceadaichte a chleachdadh ann an dòigh sam bith, a’ toirt a-steach còraichean, foillseachadh neo sgaoileadh,  gun chead. Ma ’s e is gun d’fhuair sibh seo le gun fhiosd’, bu choir cur às dhan phost-d agus lethbhreac sam bith air an t-siostam agaibh, leig fios chun  neach a sgaoil am post-d  gun dàil.  
>
> Dh’fhaodadh gum bi teachdaireachd sam bith bho Riaghaltas na h-Alba air a chlàradh neo air a sgrùdadh airson dearbhadh gu bheil an siostam ag obair gu h-èifeachdach neo airson adhbhar laghail eile. Dh’fhaodadh nach  eil beachdan anns a’ phost-d seo co-ionann ri beachdan Riaghaltas na h-Alba. 
> **********************************************************************
>
>
>
>

-- 
Martin Brümmer
Linked Data Consultat

phone +49 341 26508028
martin.bruemmer@eccenca.com

Postanschrift / Postal address:
eccenca GmbH | Hainstraße 8 | 04109  Leipzig | Germany

eccenca GmbH
Hainstraße 8 | 04109  Leipzig | Germany
Geschäftsführer / Board of Directors: Hans-Chr. Brockmann
Sitz und Registergericht / Domicile and Court of Registry: Leipzig
HRB-Nr. / Commercial Register No.: 29201
USt-ID / VAT registration No.: DE 289172708

Diese Mail kann vertrauliche Informationen enthalten. Wenn Sie nicht Adressat sind, sind Sie nicht zur Verwendung der in dieser Mail enthaltenen Informationen befugt. Bitte benachrichtigen Sie uns sofort über den irrtümlichen Empfang.
This e-mail may contain confidential information. If you are not the addressee you are not authorized to make use of the information contained in this e-mail. Please inform us immediately that you have received it by mistake.

Received on Thursday, 22 June 2017 09:35:14 UTC