Re: ANN: - Offering 3.2 billion quads current RDFa, Microdata and Miroformat data extracted from 65.4 million websites

That would be a nice first step. And then stopping to claim that the stats show the actual status of the "data web" ;-)

On Apr 17, 2012, at 9:23 PM, Dan Brickley wrote:

> How about adding a disclaimer line to the site like
> "Note that the many database-backed sites contain a huge long tail of
> rarely-visited, rarely-linked pages (e.g. product catalogues), but
> which increasingly contain useful structured data. It is best not to
> assume that this collection contains a complete, deep crawl of every
> site it touches."
> Dan

martin hepp
e-business & web science research group
universitaet der bundeswehr muenchen

phone:   +49-(0)89-6004-4217
fax:     +49-(0)89-6004-4620
www: (group) (personal)
skype:   mfhepp 
twitter: mfhepp

Check out GoodRelations for E-Commerce on the Web of Linked Data!
* Project Main Page:

Received on Tuesday, 17 April 2012 19:29:33 UTC