- From: Seaborne, Andy <andy.seaborne@hp.com>
- Date: Thu, 05 Apr 2007 11:45:23 +0100
- To: Lee Feigenbaum <feigenbl@us.ibm.com>
- CC: public-rdf-dawg@w3.org
Lee Feigenbaum wrote: > I'd like to know if anyone is motivated by Chimezie's comment suggesting > that a FROM * and a FROM NAMED * be added to SPARQL to "provide an > unambiguous way to specify a dataset which corresponds to all the known > named graphs." > > I'm wary of adding this for a couple of reasons: > > 1/ I can't imagine how such a construct would be defined such that it was > any different from the implementation-defined state which currently exists > when FROM and FROM NAMED are omitted. (And, therefore, the construct > doesn't seem to add anything new or newly interoperable to the > specification.) > > 2/ Existing implementations solve this problem within the current bounds > of SPARQL (see the IRC chat log cited for two examples) > > > If you have a strong feeling one way or the other, please let it be known > so that I can gauge whether the group has consensus (and either reply to > Chimezie or slot this item on our teleconference agenda for next week). > > Lee I agree with Lee on both points (1) and (2) above. I don't think it is a simple matter of changing FROM NAMED to include *. Using * does not give consistency. Indeed, of the ways to address this, such an indeterminate construct seems to be the wrong approach. The next issue will be "what does FROM NAMED * resolve to?" (i.e. SELECT DISTINCT ?g { GRAPH ?g {?s ?p ?o } }) The same query sent to a different places will give different answers - that's a feature of the service asked and dataset offered. The idea of a "known universe" only exists as a fixed concept in some situations. The number of graphs has to be fixed in time, else * means different things are different times without indication As a web language the "known universe" is not a meaningful concept. The fact that the application query has to name the named graphs is important. Otherwise, in the extreme, "FROM NAMED *" means all reachable URLs. What we don't have is way of naming datasets - FROM/FROM NAMED provide a partial description of one. SPARQL is more graph centric than dataset centric and the implications of a theory of datasets goes way beyond DAWG. I don't completely understand the example of the XML Query because there it is the documents in the store - the difference is merely "implicit all" (SPARQL) and "explicit, indeterminate all" (using *). Note: The protocol is open to new parameters (HTTP) - so add domain specific indicators there. After all, we have had a serious attempt at removing FROM/FROM NAMED from the query language altogether but the API needs and scripting put it back in again. Andy > > > ----- Forwarded by Lee Feigenbaum/Cambridge/IBM on 04/05/2007 03:16 AM > ----- > > "Chimezie Ogbuji" <ogbujic@ccf.org> > Sent by: public-rdf-dawg-comments-request@w3.org > 04/04/2007 05:05 PM > Please respond to > ogbujic@ccf.org > > > To > public-rdf-dawg-comments@w3.org > cc > > Subject > No way to specify an RDF dataset of all the known named graphs > > > > > > > > This was discussed in #swig > (http://chatlogs.planetrdf.com/swig/2007-04-03.html#T20-38-01) > > SPARQL currently does not provide an unambiguous way to specify a > dataset which corresponds to all the known named graphs. The only way > this can be done is to leave out FROM <..> and FROM NAMED <..> > directives in the prolog (and from the protocol, for SPARQL services). > The corresponding dataset in this case depends on the host application - > not very consistent. The only other alternative is to explicitly > enumerate the known universe in the prolog: > > FROM NAMED G1 > FROM NAMED G2 > ... > FROM NAMED GN > > This is not practical for a dynamic dataset. > > There is plenty of value in querying against the known universe > consistently especially for applications which make use of a dataset as > a named graph partition that can grow indefinitely. Consider XPath > 2.0 / XQuery 1.0 which supports querying a collection of XML documents > without having to explicitly enumerate all the XML documents in the > collection. > > This is a very useful 'database-wide' query pattern which is well > supported in document-management languages but not supported in SPARQL > without assuming the implementation will consistently supply the dataset > corresponding to all the known named graphs in persistence in the > absence of any dataset directives in the prolog or at the protocol > level. > > Other than OWA or CWA issues, I don't see why an explicit syntax for > binding to such a dataset is not supported by SPARQL to provide a > consistent way for applications to dispatch these kinds of queries. > Such a syntax was suggested in the above conversation: > > FROM NAMED * > > [1] http://www.w3.org/TR/xquery-semantics/#sec_fn_doc_collection > -- Hewlett-Packard Limited Registered Office: Cain Road, Bracknell, Berks RG12 1HN Registered No: 690597 England
Received on Thursday, 5 April 2007 10:45:34 UTC