Re: Agenda July 10, 2017 / Summary of Dataset related issues / Wiki DCAT in Context

  Dear Karen, dear all

   based on the diagram I created a Wiki page [1] that puts DCAT in context
  of standards and related work (e.g. CKAN). While still preliminary it might
  help to work out the (missing) characteristics of central DCAT concepts.
  Please feel free to edit and augment the page!

   Best regards
  Jaroslav

  [1] https://www.w3.org/2017/dxwg/wiki/DCAT_Context



On 10.07.2017 23:19, Karen Coyle wrote:
> Jaro - Thanks for doing this analysis. However, we need it on our wiki
> in a form that others can contribute. I don't see anything here that
> couldn't be text or a table - can you create a wiki page with the
> information?
> 
> Thanks,
> kc
> 
> On 7/10/17 6:43 AM, Jaroslav Pullmann wrote:
>>
>>   Dear all,
>>
>>    in order to support today's discussion I created an overview
>>   of related standards and their concept of "Dataset" [1].
>>   It is not meant to be exhaustive, just providing a starting point..
>>
>>     Best regards
>>    Jaroslav
>>
>>   [1] https://drive.google.com/file/d/0BwQz5mWMUijvLU05cGx1ZHFocDQ
>>
>>
>>
>> On 07.07.2017 14:38, Makx Dekkers wrote:
>>> Jaroslav,
>>>
>>> It is really necessary to assume a particular purpose for a Dataset?
>>> It seems to me that requiring a Dataset "to serve a particular, even
>>> hypothetical information need and any kind of digital processing and
>>> not simply the task of distribution (software)" creates a restriction
>>> that might render existing implementation non-conformant. For example,
>>> I know if work where Software Distributions are modelled as
>>> dcat:Distributions under a Software Solution modelled as a dcat:Dataset.
>>>
>>> As to the CKAN definition of Dataset, I think that every CKAN Dataset
>>> is a dcat:Dataset but the inverse would not necessarily be true.
>>>
>>> Makx.
>>>
>>>
>>>
>>> -----Original Message-----
>>> From: Jaroslav Pullmann [mailto:jaroslav.pullmann@fit.fraunhofer.de]
>>> Sent: 07 July 2017 11:19
>>> To: public-dxwg-wg@w3.org
>>> Subject: Re: Agenda July 10, 2017 / Summary of Dataset related issues
>>>
>>>
>>>      Dear all,
>>>
>>>        my initial assumption DCAT is about registries of (structured)
>>> bulk data was in deed challenged
>>>      by the mentioned use cases, since they consider any type of
>>> digital artifact including service endpoints.
>>>      As already discussed in context of ID6, Web Services are a sample
>>> of dynamic Distribution which exposes
>>>      a particular representation of the rather abstract Dataset (~
>>> Information resource), so I'd exclude
>>>      them from the Dataset layer.
>>>          I'd intuitively expect the content of a Dataset to serve a
>>> particular, even hypothetical information
>>>      need and any kind of digital processing and not simply the task of
>>> distribution (software).
>>>
>>>      Here the working definition of a "dataset" in CKAN seems relevant
>>> [1]:
>>>          "For CKAN purposes, data is published in units called
>>> datasets. A dataset is a parcel of data -
>>>       for example, it could be the crime statistics for a region, the
>>> spending figures for a government
>>>       department, or temperature readings from various weather stations."
>>>
>>>      In next telcon I'll try to summarize the issues, but we may ask
>>> Andrea to comment on his detailed UC ID20.
>>>
>>>      Best regards
>>>     Jaroslav
>>>      
>>>     [1] http://docs.ckan.org/en/latest/user-guide.html
>>>
>>>         
>>>
>>> On 07.07.2017 08:46, Makx Dekkers wrote:
>>>>
>>>> Karen,
>>>>
>>>> Not sure how to introduce the related use cases.
>>>>
>>>> As far as I see, the issue seems to be that people may apply, may
>>>> want to apply, and are applying, DCAT to various types of data
>>>> collections.
>>>>
>>>> Questions to consider could be:
>>>>
>>>> 1. is there a need to be able to identify the type of data(set) that
>>>> is being described?
>>>> 2. if so, how can the type of data be identified -- i.e. in which
>>>> property and using which (kinds of) controlled vocabularies?
>>>>
>>>> Maybe Jaroslav has other ideas?
>>>>
>>>> Makx.
>>>>
>>>>
>>>>
>>>> -----Original Message-----
>>>> From: Karen Coyle [mailto:kcoyle@kcoyle.net]
>>>> Sent: 06 July 2017 18:24
>>>> To: public-dxwg-wg@w3.org
>>>> Subject: Re: Agenda July 10, 2017
>>>>
>>>> Thanks, Makx. It would be helpful if you or Jaroslav (or both!) could
>>>> introduce this set of use cases that you've suggested, with what you
>>>> see as a best approach.
>>>>
>>>> kc
>>>>
>>>> On 7/6/17 9:05 AM, Makx Dekkers wrote:
>>>>> Karen, all,
>>>>>
>>>>> If I may, I would suggest not to start the discussion trying to
>>>>> define what a Dataset is. It seems to me that it would be more
>>>>> practical to look at what Simon mentions at the end of his problem
>>>>> statement "a way for a DCAT description to indicate the 'type' of
>>>>> dataset involved".
>>>>>
>>>>> Makx.
>>>>>
>>>>>
>>>>> -----Original Message-----
>>>>> From: Karen Coyle [mailto:kcoyle@kcoyle.net]
>>>>> Sent: 06 July 2017 17:28
>>>>> To: public-dxwg-wg@w3.org
>>>>> Subject: Agenda July 10, 2017
>>>>>
>>>>> https://www.w3.org/2017/dxwg/wiki/Meetings:Telecon2017.07.10
>>>>>
>>>>> This agenda follows Jaroslav's and Makx's suggestions by initiating
>>>>> the discussion of "what is a dataset?"
>>>>>
>>>>> kc
>>>>> -- 
>>>>> Karen Coyle
>>>>> kcoyle@kcoyle.net http://kcoyle.net
>>>>> m: 1-510-435-8234 (Signal)
>>>>> skype: kcoylenet/+1-510-984-3600
>>>>>
>>>>>
>>>>>
>>>>
>>>> -- 
>>>> Karen Coyle
>>>> kcoyle@kcoyle.net http://kcoyle.net
>>>> m: 1-510-435-8234 (Signal)
>>>> skype: kcoylenet/+1-510-984-3600
>>>>
>>>>
>>>
>>> -- 
>>> Jaroslav Pullmann
>>> Fraunhofer Institute for Applied Information Technology FIT
>>> User-Centered Ubiquitous Computing Schloss Birlinghoven | D-53757
>>> Sankt Augustin | Germany
>>> Phone: +49-2241-143620 | Fax: +49-2241-142146
>>>
>>>
>>
> 

-- 
Jaroslav Pullmann
Fraunhofer Institute for Applied Information Technology FIT
User-Centered Ubiquitous Computing
Schloss Birlinghoven | D-53757 Sankt Augustin | Germany
Phone: +49-2241-143620 | Fax: +49-2241-142146

Received on Tuesday, 11 July 2017 10:18:08 UTC