W3C home > Mailing lists > Public > public-lod@w3.org > April 2008

Re: 13.1 billion triples

From: Chris Bizer <chris@bizer.de>
Date: Fri, 4 Apr 2008 16:28:28 +0200
Message-ID: <001a01c89660$26877ee0$418d2da0@wrz03715>
To: "Yves Raimond" <yves.raimond@gmail.com>, <public-lod@w3.org>


Hi Yves,

nice coup :-)

And great new data sources which provide an important enrichment to 
the LOD cloud!

Richard: Updating the cloud diagram seam to develop into day-to-day 
business :-)

Hmm, what do we write as as triple estimate on the LOD project page 
now?

Currently we are having an estimate of 2 billion INTERLINKED triples.

The question is now, how do we count sparcely connected data sources 
like the MySpace or AudioScrobbler wrappers which could potentially 
provide wast amounts of RDF but where most of it can currently not be 
found by RDF crawlers and browsers as it is not interlinked from other 
sources?

The same question applies to our RDF Book Mashup that wraps the Amazon 
book database.

I guess an OKish heuristic could be: Count all triples that descibe 
resources that have at least one RDF link pointing at them.

Yves: Any idea how your figures change when this rule is applied?

A nice weekend to everybody.

Cheers,

Chris





--
Prof. Dr. Chris Bizer
Freie Universitšt Berlin
Phone: +49 30 838 54057
Mail: chris@bizer.de
Web: www.bizer.de

----- Original Message ----- 
From: "Yves Raimond" <yves.raimond@gmail.com>
To: <public-lod@w3.org>
Sent: Friday, April 04, 2008 3:52 PM
Subject: 13.1 billion triples


>
> Hello!
>
> Well, I guess everything is in the title :-) All the services 
> running
> on dbtune.org now provide access to approximately 13.1 billion
> triples.
> Here is the breakdown of such an estimation (really rough, though):
>
> http://blog.dbtune.org/post/2008/04/02/DBTune-is-providing-131-billion-triples
>
> I wish the data sources could provide the linked data themselves, 
> now
> - my poor little server begins to be a bit overwhelmed :)
>
> Cheers,
> y
> 
Received on Friday, 4 April 2008 14:34:24 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:16 UTC