Re: ANN: WebDataCommons.org - Offering 3.2 billion quads current RDFa, Microdata and Miroformat data extracted from 65.4 million websites

On 4/17/12 3:29 PM, Martin Hepp wrote:
> That would be a nice first step. And then stopping to claim that the stats show the actual status of the "data web" ;-)

Narrative should be more about: showcasing the least amount of 
structured data on the burgeoning Web of Linked Data :-)

Kingsley
>
>
> On Apr 17, 2012, at 9:23 PM, Dan Brickley wrote:
>
>> How about adding a disclaimer line to the webdatacommons.org site like
>>
>> "Note that the many database-backed sites contain a huge long tail of
>> rarely-visited, rarely-linked pages (e.g. product catalogues), but
>> which increasingly contain useful structured data. It is best not to
>> assume that this collection contains a complete, deep crawl of every
>> site it touches."
>>
>> Dan
> --------------------------------------------------------
> martin hepp
> e-business&  web science research group
> universitaet der bundeswehr muenchen
>
> e-mail:  hepp@ebusiness-unibw.org
> phone:   +49-(0)89-6004-4217
> fax:     +49-(0)89-6004-4620
> www:     http://www.unibw.de/ebusiness/ (group)
>           http://www.heppnetz.de/ (personal)
> skype:   mfhepp
> twitter: mfhepp
>
> Check out GoodRelations for E-Commerce on the Web of Linked Data!
> =================================================================
> * Project Main Page: http://purl.org/goodrelations/
>
>
>
>
>


-- 

Regards,

Kingsley Idehen	
Founder&  CEO
OpenLink Software
Company Web: http://www.openlinksw.com
Personal Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca handle: @kidehen
Google+ Profile: https://plus.google.com/112399767740508618350/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen

Received on Tuesday, 17 April 2012 20:13:39 UTC