- From: Jaroslav Pullmann <jaroslav.pullmann@fit.fraunhofer.de>
- Date: Mon, 10 Jul 2017 15:43:20 +0200
- To: public-dxwg-wg@w3.org
Dear all, in order to support today's discussion I created an overview of related standards and their concept of "Dataset" [1]. It is not meant to be exhaustive, just providing a starting point.. Best regards Jaroslav [1] https://drive.google.com/file/d/0BwQz5mWMUijvLU05cGx1ZHFocDQ On 07.07.2017 14:38, Makx Dekkers wrote: > Jaroslav, > > It is really necessary to assume a particular purpose for a Dataset? It seems to me that requiring a Dataset "to serve a particular, even hypothetical information need and any kind of digital processing and not simply the task of distribution (software)" creates a restriction that might render existing implementation non-conformant. For example, I know if work where Software Distributions are modelled as dcat:Distributions under a Software Solution modelled as a dcat:Dataset. > > As to the CKAN definition of Dataset, I think that every CKAN Dataset is a dcat:Dataset but the inverse would not necessarily be true. > > Makx. > > > > -----Original Message----- > From: Jaroslav Pullmann [mailto:jaroslav.pullmann@fit.fraunhofer.de] > Sent: 07 July 2017 11:19 > To: public-dxwg-wg@w3.org > Subject: Re: Agenda July 10, 2017 / Summary of Dataset related issues > > > Dear all, > > my initial assumption DCAT is about registries of (structured) bulk data was in deed challenged > by the mentioned use cases, since they consider any type of digital artifact including service endpoints. > As already discussed in context of ID6, Web Services are a sample of dynamic Distribution which exposes > a particular representation of the rather abstract Dataset (~ Information resource), so I'd exclude > them from the Dataset layer. > > I'd intuitively expect the content of a Dataset to serve a particular, even hypothetical information > need and any kind of digital processing and not simply the task of distribution (software). > > Here the working definition of a "dataset" in CKAN seems relevant [1]: > > "For CKAN purposes, data is published in units called datasets. A dataset is a parcel of data - > for example, it could be the crime statistics for a region, the spending figures for a government > department, or temperature readings from various weather stations." > > In next telcon I'll try to summarize the issues, but we may ask Andrea to comment on his detailed UC ID20. > > Best regards > Jaroslav > > > [1] http://docs.ckan.org/en/latest/user-guide.html > > > > On 07.07.2017 08:46, Makx Dekkers wrote: >> >> Karen, >> >> Not sure how to introduce the related use cases. >> >> As far as I see, the issue seems to be that people may apply, may want to apply, and are applying, DCAT to various types of data collections. >> >> Questions to consider could be: >> >> 1. is there a need to be able to identify the type of data(set) that is being described? >> 2. if so, how can the type of data be identified -- i.e. in which property and using which (kinds of) controlled vocabularies? >> >> Maybe Jaroslav has other ideas? >> >> Makx. >> >> >> >> -----Original Message----- >> From: Karen Coyle [mailto:kcoyle@kcoyle.net] >> Sent: 06 July 2017 18:24 >> To: public-dxwg-wg@w3.org >> Subject: Re: Agenda July 10, 2017 >> >> Thanks, Makx. It would be helpful if you or Jaroslav (or both!) could introduce this set of use cases that you've suggested, with what you see as a best approach. >> >> kc >> >> On 7/6/17 9:05 AM, Makx Dekkers wrote: >>> Karen, all, >>> >>> If I may, I would suggest not to start the discussion trying to define what a Dataset is. It seems to me that it would be more practical to look at what Simon mentions at the end of his problem statement "a way for a DCAT description to indicate the 'type' of dataset involved". >>> >>> Makx. >>> >>> >>> -----Original Message----- >>> From: Karen Coyle [mailto:kcoyle@kcoyle.net] >>> Sent: 06 July 2017 17:28 >>> To: public-dxwg-wg@w3.org >>> Subject: Agenda July 10, 2017 >>> >>> https://www.w3.org/2017/dxwg/wiki/Meetings:Telecon2017.07.10 >>> >>> This agenda follows Jaroslav's and Makx's suggestions by initiating the discussion of "what is a dataset?" >>> >>> kc >>> -- >>> Karen Coyle >>> kcoyle@kcoyle.net http://kcoyle.net >>> m: 1-510-435-8234 (Signal) >>> skype: kcoylenet/+1-510-984-3600 >>> >>> >>> >> >> -- >> Karen Coyle >> kcoyle@kcoyle.net http://kcoyle.net >> m: 1-510-435-8234 (Signal) >> skype: kcoylenet/+1-510-984-3600 >> >> > > -- > Jaroslav Pullmann > Fraunhofer Institute for Applied Information Technology FIT User-Centered Ubiquitous Computing Schloss Birlinghoven | D-53757 Sankt Augustin | Germany > Phone: +49-2241-143620 | Fax: +49-2241-142146 > > -- Jaroslav Pullmann Fraunhofer Institute for Applied Information Technology FIT User-Centered Ubiquitous Computing Schloss Birlinghoven | D-53757 Sankt Augustin | Germany Phone: +49-2241-143620 | Fax: +49-2241-142146
Received on Monday, 10 July 2017 13:44:07 UTC