- From: Boris Villazon-Terrazas <bvillazon@isoco.com>
- Date: Thu, 24 Jul 2014 22:28:01 +0200
- To: Christian Bizer <chris@bizer.de>, public-lod@w3.org
- Message-ID: <53D16C51.3020700@isoco.com>
Thanks Chris, Max and Heiko for your hard work! We will try to do our best to include more Spanish and Latin American datasets Best Boris On 24/07/2014 14:18, Christian Bizer wrote: > > Hi all, > > Max Schmachtenberg, Heiko Paulheim and I have crawled of the Web of > Linked Data and have drawn an updated LOD Cloud diagram based on the > results of the crawl. > > This diagram showing all linked datasets that our crawler managed to > discover in April 2014 is found here: > > http://data.dws.informatik.uni-mannheim.de/lodcloud/2014/ISWC-RDB/LODCloudDiagram.png > > We also analyzed the compliance of the different datasets with the > Linked Data best practices and a paper presenting the results of the > analysis is found below. The paper will appear at ISWC 2014 in the > Replication, Benchmark, Data and Software Track. > > http://dws.informatik.uni-mannheim.de/fileadmin/lehrstuehle/ki/pub/SchmachtenbergBizerPaulheim-AdoptionOfLinkedDataBestPractices.pdf > > The raw data used for our analysis is found on this page: > > http://data.dws.informatik.uni-mannheim.de/lodcloud/2014/ISWC-RDB/ > > Our crawler did discover 77 dataset that do not allow crawling via > their robots.txt files and these datasets were not included into our > analysis and are also not included in the current version of the LOD > Cloud diagram. > > A list of these datasets is found at > http://data.dws.informatik.uni-mannheim.de/lodcloud/2014/ISWC-RDB/tables/notCrawlableDatasets.tsv > > In order to give a comprehensive overview of all Linked Data sets that > are currently online, we would like to draw another version of the LOD > Cloud diagram including the datasets that our crawler has missed as > well as the datasets that do not allow crawling. > > Thus, if you publish or know about linked datasets that are not in the > diagram or in the list of not crawlable datasets yet, please: > > 1.Enter them into the datahub.io data catalog until August 8^th . > > 2.Tag them in the catalog with the tag 'lod' > (http://datahub.io/dataset?tags=lod) > > 3.Send an email to Max and Chris pointing us at the entry in the catalog. > > We will include all datasets into the updated version of the cloud > diagram, that fulfill the following requirements: > > 1.Data items are accessible via dereferencable URIs. > > 2.The dataset sets at least 50 RDF links pointing at other datasets or > at least one other dataset is setting 50 RDF links pointing at your > dataset. > > Instructions on how to describe your dataset in the catalog are found > here: > > https://www.w3.org/wiki/TaskForces/CommunityProjects/LinkingOpenData/DataSets/CKANmetainformation > > Please make sure that you include information about the RDF links > pointing from your dataset into other datasets (field links: ) as well > as a tag indicating the topical category of your dataset, so that we > know how to include it into the diagram. > > Please also include an example URI from your dataset into the catalog. > > We will start to review the new datasets and to draw the updated > version of the LOD cloud diagram after August 8^th . > > So please point us at datasets to be included before this date. > > Cheers, > > Max, Heiko, and Chris > > -- > > Prof. Dr. Christian Bizer > > Data and Web Science Research Group > > Universität Mannheim, Germany > chris@informatik.uni-mannheim.de > > www.bizer.de >
Received on Thursday, 24 July 2014 20:29:04 UTC