W3C home > Mailing lists > Public > public-lod@w3.org > November 2008

Re: Size matters -- How big is the danged thing

From: Yves Raimond <yves.raimond@gmail.com>
Date: Thu, 20 Nov 2008 15:27:16 +0000
Message-ID: <82593ac00811200727s28564ea2ye29a4ea0f1df9d27@mail.gmail.com>
To: "Giovanni Tummarello" <giovanni.tummarello@deri.org>
Cc: "Jim Hendler" <hendler@cs.rpi.edu>, public-lod@w3.org

On Thu, Nov 20, 2008 at 2:36 PM, Yves Raimond <yves.raimond@gmail.com> wrote:
> On Thu, Nov 20, 2008 at 1:26 PM, Giovanni Tummarello
> <giovanni.tummarello@deri.org> wrote:
>>>
>>> dbtune.org provides at least 14 billion triples (see
>>> http://blog.dbtune.org/post/2008/04/02/DBTune-is-providing-131-billion-triples
>>> + the Musicbrainz D2R server at http://dbtune.org/musicbrainz/, so I
>>> guess you'd need a pretty big phone to aggregate all that :-)
>>
>> .. thus the problem with wrappers should they be counted in ?
>>
>
> Indeed. But after all, even a database exposed via Virtuoso or D2R can
> actually be considered as a wrapper. It's easy enough to estimate the
> number of triples a wrapper provides, by analysing the source data, so
> why not counting them?
>
>>> outdated... For example, at http://www.bbc.co.uk/programmes, we
>>> publish at least 10 billion triples. I guess the number of triples at
>>> http://www.bbc.co.uk/music/beta must be quite large as well.
>>
>> that's like 15 times wikipedia,? how's that composed?
>>
>

Arg, sorry, I mistyped - that's 10 *million*, not 10 billions (and I
consistently mistyped on twitter as well, sorry about that).
But dbtune is really around 15 *billion* (including wrappers).

Sorry for the confusion!

y

> http://www.bbc.co.uk/programmes/
>
> Lots of information about all BBC programmes: brands, series,
> episodes, versions, broadcasts, etc...
>
> Cheers!
> y
>
>> Giovanni
>>
>
Received on Thursday, 20 November 2008 15:27:51 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:20:43 UTC