W3C home > Mailing lists > Public > public-lod@w3.org > October 2009

Re: The Power of Virtuoso Sponger Technology

From: Giovanni Tummarello <g.tummarello@gmail.com>
Date: Sun, 18 Oct 2009 15:01:41 +0100
Message-ID: <210271540910180701k46d921f8k134ab5727bad20d7@mail.gmail.com>
To: Olaf Hartig <hartig@informatik.hu-berlin.de>
Cc: "public-lod@w3.org" <public-lod@w3.org>
I'd say, if i understand well

that that works only for queries where you need the extra dereferenced
data just "additionally" e.g. to add a label to your result se
if you need the remote, on the fly reference data to e.g. sort by
price you'd have to fetch all from the remote site ..


On Sun, Oct 18, 2009 at 2:57 PM, Olaf Hartig
<hartig@informatik.hu-berlin.de> wrote:
> Hey,
> On Sunday 18 October 2009 09:37:14 Martin Hepp (UniBW) wrote:
>> [...]
>> So it will boil down to technology that combines (1) crawling and
>> caching rather stable data sets with (2) distributing queries and parts
>> of queries among the right SPARQL endpoints (whatever actual DB
>> technology they expose).
>> You can keep a text index of the whole Web, if crawling cycles in the
>> order of magnitude of weeks are fine. For structured, linked data that
>> exposes dynamic database content, "dumb" crawling and caching will not
>> scale.
> Interesting discussion!
> An alternative approach to query federation is the link traversal based query
> execution as implemented in the SemWeb Client Lib. The main idea of this
> approach is to look-up URIs during the query execution itself. With this
> approach you don't rely on the existence of SPARQL endpoints and -even more
> important- you don't have to know all the sources that contribute to the query
> result in advance. Plus, the results are based on the most up-to-date data you
> can get.
> Greetings,
> Olaf
Received on Sunday, 18 October 2009 14:02:35 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:20:53 UTC