Re: Next version of the LOD cloud diagram. Please provide input, so that your dataset is included.

On Sat, Sep 4, 2010 at 3:43 PM, Chris Bizer <chris@bizer.de> wrote:
> Hi Alan,
>
>> I think you should consider having some better quality control
>
> and
>
>> Yes, unfortunate. A similar audit should be done for the sets
>> that are named on the LOD (also "open") cloud.
>
> LOD is an open community effort to which everybody can contribute.
>
> So rather than to criticize the work that other people do on collecting
> meta-information about the datasets in the LOD cloud, you are more than
> welcome to quality-control 20 billion triples.

I have just spent some time evaluating one source and reported to you
the result. Perhaps you might act on this investment in time and thank
me for doing so. You might find that the result was myself and more
people doing such quality control.

-Alan

>
> Best,
>
> Chris
>
>
> -----Ursprüngliche Nachricht-----
> Von: public-lod-request@w3.org [mailto:public-lod-request@w3.org] Im Auftrag
> von Alan Ruttenberg
> Gesendet: Samstag, 4. September 2010 18:47
> An: Anja Jentzsch
> Cc: public-lod@w3.org; Leigh Dodds; Chris Bizer; Jonathan Gray
> Betreff: Re: Next version of the LOD cloud diagram. Please provide input, so
> that your dataset is included.
>
> On Sat, Sep 4, 2010 at 8:35 AM, Anja Jentzsch <anja@anjeve.de> wrote:
>> Hi Alan,
>>
>> CKAN is a repository for all kinds of datasets. Even if datasets are not
> open or only for non-commercial use, they can be listed and information on
> licensing can be noted (Other - Closed, e.g.). This is still a valuable
> information.
>
> Hello Anja,
>
> My comment was not a commentary on CKAN, it was a comment on specific
> data set and it's relation to the LOD cloud - please have a closer
> read.
>
> However, now that you mention it, the opening line on the CKAN website
> says: "CKAN is a registry of open data and content packages." The
> words "open data and content" are linked to
> http://www.opendefinition.org/ which explains what open means (it does
> not mean closed).
>
> So one of two things should be fixed with CKAN - either the statement
> on the front page should be changed to make it clear that it also
> registers closed data, or the closed data entries should be expunged.
>
>> If no license is specified or we did not find the license information,
> CKAN lists the datasets as "not open".
>
> Same comment re: having CKAN present a consistent view of what it does.
>
>> Leigh Dodds had a closer look at the licenses of the LOD datasets some
> time ago [1]. It is sad but true that only about 23% of all datasets come
> along with a clearly defined license.
>
> Yes, unfortunate. A similar audit should be done for the sets that are
> named on the LOD (also "open") cloud.
>
>> Hopefully data publishers will more clearly state the licenses along with
> their datasets to encourage people to use their data.
>
> Here we agree, and part of my work is doing exactly that.
>
> Regards,
> Alan
>
>>
>> Cheers,
>> Anja
>>
>> [1]
> http://iswc2009.semanticweb.org/wiki/index.php/ISWC_2009_Tutorials/Legal_and
> _Social_Frameworks_for_Sharing_Data_on_the_Web#Slides
>>
>> On 03.09.2010 20:43, Alan Ruttenberg wrote:
>>> I think you should consider having some better quality control and
>>> standards around this, as I feel it is somewhat misleading. For
>>> example (and this is one of several), consider CAS which is named in
>>> the diagram. I don't consider the contents of that set to include any
>>> data. Here is an example:
>>>
>>> http://cu.bio2rdf.org/cas:921-60-8
>>>
>>> Subject
>>> http://bio2rdf.org/cas:921-60-8
>>>
>>> Predicate     Object
>>> http://bio2rdf.org/bio2rdf_resource:url
> http://bio2rdf.org/html/cas:921-60-8
>>> (Non-RDF URI)
>>> http://www.w3.org/2002/07/owl#sameAs  http://cas.bio2rdf.org/cas:921-60-8
>>> (External link)
>>>
>>> This is content free.
>>>
>>> In addition, the documentation of that set says it is not open:
>>> http://ckan.net/package/bio2rdf-cas
>>>
>>> Although this URI might be used to link somehow, in my opinion it is
>>> misleading to call this collection a linked open *data* set. Further,
>>> including it will do damage to LOD reputation if anyone actually looks
>>> past that diagram to see what is really there.
>>>
>>> Sincerely,
>>>
>>> Alan Ruttenberg
>>>
>>>
>>> On Fri, Sep 3, 2010 at 2:00 PM, Jonathan Gray<jonathan.gray@okfn.org>
>  wrote:
>>>> FYI, we blogged this here:
>>>>
>>>>
>  http://blog.okfn.org/2010/09/03/next-version-of-the-linked-open-data-cloud-
> based-on-ckan/
>>>>
>>>> All are, of course, most welcone to join ckan-discuss list if there
>>>> are any specific suggestions for features we should add:
>>>>
>>>>  http://lists.okfn.org/mailman/listinfo/ckan-discuss
>>>>
>>>> We will be continuing to develop CKAN's support for LOD/semantic web
>>>> technologies over the coming months (and years)! ;-)
>>>>
>>>> On Fri, Sep 3, 2010 at 5:03 PM, Leigh Dodds<leigh.dodds@talis.com>
>  wrote:
>>>>> Hi Chris, Anja
>>>>>
>>>>> On 3 September 2010 15:17, Chris Bizer<chris@bizer.de>  wrote:
>>>>>> In theory, the list is automatically updated with data from CKAN.
>>>>>>
>>>>>> But as the CKAN server is overloaded today, the list is currently
> corrupted
>>>>>> and only shows a fraction of the datasets.
>>>>>>
>>>>>> We hope that the issue is solved in the next hours!
>>>>>
>>>>> Thanks for the confirmation!
>>>>>
>>>>> Cheers,
>>>>>
>>>>> L.
>>>>>
>>>>> --
>>>>> Leigh Dodds
>>>>> Programme Manager, Talis Platform
>>>>> Talis
>>>>> leigh.dodds@talis.com
>>>>> http://www.talis.com
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Jonathan Gray
>>>>
>>>> Community Coordinator
>>>> The Open Knowledge Foundation
>>>> http://blog.okfn.org
>>>>
>>>> http://twitter.com/jwyg
>>>> http://identi.ca/jwyg
>>>>
>>>>
>>
>>
>
>

Received on Saturday, 4 September 2010 20:15:16 UTC