Re: The Power of Virtuoso Sponger Technology from Olaf Hartig on 2009-10-18 (public-lod@w3.org from October 2009)

From: Olaf Hartig <hartig@informatik.hu-berlin.de>
Date: Sun, 18 Oct 2009 15:57:24 +0200
To: "public-lod@w3.org" <public-lod@w3.org>
Message-Id: <200910181557.27150.hartig@informatik.hu-berlin.de>

Hey,

On Sunday 18 October 2009 09:37:14 Martin Hepp (UniBW) wrote:
> [...]
> So it will boil down to technology that combines (1) crawling and
> caching rather stable data sets with (2) distributing queries and parts
> of queries among the right SPARQL endpoints (whatever actual DB
> technology they expose).
>
> You can keep a text index of the whole Web, if crawling cycles in the
> order of magnitude of weeks are fine. For structured, linked data that
> exposes dynamic database content, "dumb" crawling and caching will not
> scale.

Interesting discussion!

An alternative approach to query federation is the link traversal based query 
execution as implemented in the SemWeb Client Lib. The main idea of this 
approach is to look-up URIs during the query execution itself. With this 
approach you don't rely on the existence of SPARQL endpoints and -even more 
important- you don't have to know all the sources that contribute to the query 
result in advance. Plus, the results are based on the most up-to-date data you 
can get.

Greetings,
Olaf

Received on Sunday, 18 October 2009 13:57:52 UTC