- From: Olaf Hartig <hartig@informatik.hu-berlin.de>
- Date: Sun, 18 Oct 2009 15:57:24 +0200
- To: "public-lod@w3.org" <public-lod@w3.org>
Hey, On Sunday 18 October 2009 09:37:14 Martin Hepp (UniBW) wrote: > [...] > So it will boil down to technology that combines (1) crawling and > caching rather stable data sets with (2) distributing queries and parts > of queries among the right SPARQL endpoints (whatever actual DB > technology they expose). > > You can keep a text index of the whole Web, if crawling cycles in the > order of magnitude of weeks are fine. For structured, linked data that > exposes dynamic database content, "dumb" crawling and caching will not > scale. Interesting discussion! An alternative approach to query federation is the link traversal based query execution as implemented in the SemWeb Client Lib. The main idea of this approach is to look-up URIs during the query execution itself. With this approach you don't rely on the existence of SPARQL endpoints and -even more important- you don't have to know all the sources that contribute to the query result in advance. Plus, the results are based on the most up-to-date data you can get. Greetings, Olaf
Received on Sunday, 18 October 2009 13:57:52 UTC