Re: [semanticweb] ANN: DBpedia 3.5 released

Hi,

2010/4/14 Kingsley Idehen <kidehen@openlinksw.com>:
> When we refer to an "option" we are talking about a mirror rather than
> an alternative place where DBpedia data sets have been loaded.

I deliberately didn't use the word "mirror" as that sets expectations
around offering same features, using same technology, etc. So I meant
what I said: there are other SPARQL endpoints that provide live,
public access to the dbpedia data.

> As for usage levels, the issues have very little to do we sane SPARQL
> query and everything to do with crawlers that actually attempt to
> perform wholesale imports of the entire data set (many attempt this as
> we can seen from the HTTP logs and the payload sizes). In addition,
> remember, we are severing up actual RDF based descriptor resources, and
> these too are crawled wholesale with the intent of populating other data
> spaces (these are also crawled aggressively via LOD and non LOD crawlers).
>
> We are not just providing a SPARQL endpoint, we are also serving RDF
> descriptor resources in a variety of representation formats. And as I've
> stated above, the dominant use pattern is crawling the RDF descriptor
> resources, which (without protection) simply obliterates "across the
> wire bandwidth" as is the case with any document server on a public
> network such as the World Wide Web.

Yes I'm aware of what dbpedia is, and also the challenges of running a
live operational service :)

I was just curious about usage volumes. We all talk about how central
dbpedia is in the LOD cloud picture, and wondered if there was any
publicly accessible metrics to help add some detail that.

Cheers,

L.

-- 
Leigh Dodds
Programme Manager, Talis Platform
Talis
leigh.dodds@talis.com
http://www.talis.com

Received on Wednesday, 14 April 2010 14:18:08 UTC