- From: Eric Prud'hommeaux <eric@w3.org>
- Date: Wed, 20 May 2009 11:46:55 -0400
- To: Boris Motik <boris.motik@comlab.ox.ac.uk>
- Cc: 'Alan Ruttenberg' <alanruttenberg@gmail.com>, "'Seaborne, Andy'" <andy.seaborne@hp.com>, public-rdf-text@w3.org, 'Sandro Hawke' <sandro@w3.org>, 'Axel Polleres' <axel.polleres@deri.org>
- Message-ID: <20090520154655.GC28987@w3.org>
On Wed, May 20, 2009 at 05:23:01PM +0200, Boris Motik wrote: > I honestly don't see how this comment is relevant to the present discussion. I think it brought us from "it's a SPARQL problem" to "it's a problem for any app which doesn't support certain entailments". > Each device that does something with RDF needs to decide what it wants to do > with it. Here are its options: > > 1. It can choose to use D-entailment. Well, if this is what the device wants to > do, then I'm afraid it needs to implement D-entailment. rdf:text is no different > from xsd:integer and xsd:decimal in that respect. > > 2. It can choose to use simple entailment. Well, if this is what the device > wants to do, then it should just do it. rdf:text is no different from > xsd:integer and xsd:decimal in that respect either. > > 3. It can choose to do whatever it wants. Well, the precise definitions of > rdf:text don't matter anyway then. > > Nothing in our specification requires an implementation to select 1, 2, or 3. > Hence, this issue is completely orthogonal to rdf:text. At issue here is whether you want the inferences allowed by the rdf:text specification to be available to to tools which do not implement rdf:text-entailment. I argue that this is an achievable goal as the code required to implement rdf:text entailment (either "bob"@en => "bob@en"^^rdf:text or "bob@en"^^rdf:text => "bob"@en can be pushed into the machinery which performs these inferences. You can still use existing machinery to infer literals of type rdf:text so long at as that graph never interacts with any other graph (it's not detectably RDF from the outside world). The likely implementation for e.g. OWLIM or Oracle's Jena Adaptor would be to trigger restrictions of type rdf:text on plain literals, and record rdf:text inferences as plain literals. > Regards, > > Boris > > > > -----Original Message----- > > From: Eric Prud'hommeaux [mailto:eric@w3.org] > > Sent: 20 May 2009 17:19 > > To: Boris Motik > > Cc: 'Alan Ruttenberg'; 'Seaborne, Andy'; public-rdf-text@w3.org; 'Sandro > > Hawke'; 'Axel Polleres' > > Subject: Re: A summary of the proposal for resolving the issues with rdf:text > > --> Could you please check it one more time? > > > > On Wed, May 20, 2009 at 03:39:29PM +0200, Boris Motik wrote: > > > Hello, > > > > > > This is a purely SPARQL problem: SPARQL should specify precisely what the > > > semantics of BGPs under the D-entailment regime is. > > > > Why would SPARQL be the only device ever be used to access an RDF graph? > > > > Do you favor that every device used to access and RDF graph implement > > all of D-entailment and rdf:text and anything like it that comes along? > > > > In that case, aren't you demanding any deployed code which may be used > > to access an RDF graph (Oracle, Jena, Sesame, Virtuoso, Allegrograph, > > ...), through query or graph API or parsed directly into an application > > data structure, be updated to support various entailments? > > > > > > > I am just going to briefly speculate as to how this might be done. I > > strongly > > > believe this should be done declaratively -- that is, without taking into > > > account implementations. Hence, one might use the following definition: > > > > > > Given an RDF graph G and a BBP Q, a substitution s for variables in Q is > > > an answer to G and Q iff G D-entails s(Q). > > > > > > Take the following example: > > > > > > G = { <a, b, "01"^^xsd:integer> } > > > Q = { <a, b, ?x> } > > > > > > Then, the following substitutions are answers to Q over G: > > > > > > s1 = { ?x --> "1"^^xsd:integer } > > > s2 = { ?x --> "01"^^xsd:integer } > > > s3 = { ?x --> "1"^^xsd:decimal } > > > s4 = { ?x --> "001.000"^^xsd:decimal } > > > etc. > > > > > > Clearly, such a definition is not practical, so the SPARQL WG should think > > of a > > > solution. One possible solution would be to say that each literal needs to > > be > > > normalized; in this case, one would return only s1 as a result. There are > > > clearly other possibilities as well, so I will stop speculating here. > > > > > > > > > This is a purely SPARQL problem, and not that of RDF, XML Schema, or > > rdf:text. > > > SPARQL can define answers to such queries however it wants; probably the > > only > > > constraint is that the answers should be sound w.r.t. D-entailment. > > Furthermore, > > > we should clearly separate concerns here. I see the stack of specifications > > as > > > follows: > > > > > > 1. RDF defines the notion of D-entailment from an RDF graph. For this, you > > need > > > to have a lexical space, a value space, and a lexical-to-value mapping for > > each > > > datatype you are using. > > > > > > 2. Various datatypes provide these, and thus define the D-consequences of an > > RDF > > > graph. As the above example shows, there can be many consequences, but > > that's > > > already a problem with basic XML Schema datatypes. > > > > > > 3. By relying on the definitions of D-entailment in RDF and the datatypes, > > > SPARQL has to find a way to make some sense of the examples such as the one > > > given above. This definition should probably be independent from the actual > > set > > > of datatypes (because people may and will add their own datatypes). > > > > > > > > > rdf:text resides at level 2 of this stack and is therefore completely > > > independent of the SPARQL questions. Furthermore, as the above example > > shows, > > > these questions exist even without rdf:text. Therefore, I believe the > > rdf:text > > > WG has done its job. > > > > > > Regards, > > > > > > Boris > > > > > > > -----Original Message----- > > > > From: Alan Ruttenberg [mailto:alanruttenberg@gmail.com] > > > > Sent: 20 May 2009 14:05 > > > > To: Boris Motik > > > > Cc: Eric Prud'hommeaux; Seaborne, Andy; public-rdf-text@w3.org; Sandro > > Hawke; > > > > Axel Polleres > > > > Subject: Re: A summary of the proposal for resolving the issues with > > rdf:text > > > > --> Could you please check it one more time? > > > > > > > > Hello Boris, > > > > > > > > In what forum do you suggest this be addressed? > > > > > > > > -Alan > > > > > > > > On Wed, May 20, 2009 at 7:38 AM, Boris Motik > > > > <boris.motik@comlab.ox.ac.uk> wrote: > > > > > Hello, > > > > > > > > > > I fully appreciate use case and I agree with your observation: this is > > > > something > > > > > that has to be addressed. I don't think, however, that solving this > > problem > > > > is > > > > > in the domain of rdf:text. The rdf:text specification merely defines yet > > > > another > > > > > datatype by specifying it in exactly the same way as this is done in XML > > > > Schema. > > > > > This datatype is just like any other XML Schema datatype; hence, the job > > > > from > > > > > rdf:text's point of view is done. > > > > > > > > > > Furthermore, the addition of rdf:text to the mix of the supported > > datatypes > > > > adds > > > > > no new conceptual problems to SPARQL: the situation with rdf:text is no > > > > > different than with, say, xsd:integer (there are other examples as > > well). > > > > For > > > > > example, assume that you have an RDF graph > > > > > > > > > > G = { <a, b, "1"^xsd:integer> } > > > > > > > > > > but you ask the query > > > > > > > > > > Q = { <a, b, "1.0"^^xsd:decimal> }. > > > > > > > > > > Clearly, G D-entails Q, so Q should be answered as TRUE in G. It is not > > the > > > > > business of XML Schema to specify how this is to be achieved: XML Schema > > > > merely > > > > > specifies what the correct answer to the above question is. It is a > > SPARQL > > > > > implementation such as OWLIM that should think of how to support such a > > > > > definition. > > > > > > > > > > I don't know whether a solution to the above problem (with xsd:integer > > and > > > > > xsd:decimal) exists. If not, I agree that one should be developed; > > however, > > > > we > > > > > would not go to the XML Schema WG and ask them to specify how should > > SPARQL > > > > > handle this case, would we? > > > > > > > > > > The problem with rdf:text is *precisely* the same as the one that I > > outlined > > > > > above. At an abstract level, it can be stated as "Several syntactic > > forms of > > > > > literals get mapped to the semantically identical data values". AS > > > > demonstrated > > > > > above, this problem exists without rdf:text, so I don't see how rdf:text > > > > brings > > > > > anything new into the whole picture. Thus, you can apply to the rdf:text > > > > case > > > > > exactly the same solution that you would apply to xsd:integer and > > > > xsd:decimal. > > > > > If such a solution doesn't exist yet, then the SPARQL WG should address > > > > these > > > > > issues, and it should do so in general for all datatypes (xsd:integer, > > > > > xsd:decimal, and so on), not just for rdf:text. > > > > > > > > > > To summarize, I think that the work from the point of view of the > > rdf:text > > > > WG is > > > > > *done* and that we should not do anything else in this forum. > > > > > > > > > > Regards, > > > > > > > > > > Boris > > > > > > > > > >> -----Original Message----- > > > > >> From: Eric Prud'hommeaux [mailto:eric@w3.org] > > > > >> Sent: 20 May 2009 13:18 > > > > >> To: Boris Motik > > > > >> Cc: 'Seaborne, Andy'; 'Alan Ruttenberg'; public-rdf-text@w3.org; > > 'Sandro > > > > >> Hawke'; 'Axel Polleres' > > > > >> Subject: Re: A summary of the proposal for resolving the issues with > > > > rdf:text > > > > >> --> Could you please check it one more time? > > > > >> > > > > >> On Wed, May 20, 2009 at 09:29:00AM +0200, Boris Motik wrote: > > > > >> > Hello, > > > > >> > > > > > >> > I don't see the benefit of option 1, as it makes things unnecessarily > > > > >> complex. > > > > >> > The fewer exceptions we have, the easier it will be to actually > > implement > > > > a > > > > >> > conformant system. The dichotomy between plain und typed literals is > > just > > > > an > > > > >> > example of an exception that just makes implementation difficult. > > Instead > > > > of > > > > >> > introducing more special cases, I think we should unify these > > whenever > > > > >> possible. > > > > >> > > > > > >> > Furthermore, I'm not sure whether sorting out things such as the ones > > > > >> pointed > > > > >> > out below is necessary to finalize the rdf:text specification. Please > > > > note > > > > >> that > > > > >> > rdf:text already has a well-defined lexical and value space, and this > > is > > > > >> *the > > > > >> > only* thing that we need to be able to plug rdf:text into the model > > > > theory > > > > >> of > > > > >> > RDF. That is, given RDF graphs G1 and G2 possibly containing rdf:text > > > > >> literals > > > > >> > and/or plain literals, using the definitions from the present > > rdf:text > > > > >> > specification one can unambiguously answer the question whether G1 D- > > > > entails > > > > >> G2. > > > > >> > For example, if G1 is > > > > >> > > > > > >> > <a, b, "abc@en"^^rdf:text> > > > > >> > > > > > >> > and G2 is > > > > >> > > > > > >> > <a, b, "abc"@en> > > > > >> > > > > > >> > then, according to the existing RDF model theory document, G1 D- > > entails > > > > G2 > > > > >> and > > > > >> > vice versa. I don't see what else is there for the rdf:text > > specification > > > > to > > > > >> do: > > > > >> > I really think that the specification is complete. If SPARQL or other > > > > >> > specifications want to apply rdf:text in a different way and create > > > > special > > > > >> > cases, they are free to do so; however, I don't think it is in scope > > of > > > > the > > > > >> > rdf:text specification to solve all such problems. > > > > >> > > > > >> (Hesitantly re-stating use case), consider the use case of the OWLIM > > > > >> plugin for Sesame. If OWLIM forward chains some triples into the > > > > >> Sesame repository with objects like "bob"@en, existing SPARQL queries > > > > >> on the existing Sesame engine will match them as expected. RIF rules > > > > >> can consume those triples and know that any rules applying to a domain > > > > >> of rdf:text apply. > > > > >> > > > > >> Constrast that with an OWLIM which emits triples with objects like > > > > >> "bob@en"^^rdf:text . These triples will not match conventional queries > > > > >> intended to discover e.g. all the folks named "Bob". The Sesame SPARQL > > > > >> implementation can be extended, but then we are in Pat's scenario of > > > > >> fixing RDF by visiting all the deployed code. > > > > >> > > > > >> I expect that any design of rdf:text would have it reacting to plain > > > > >> literals as if they had a datatype of rdf:text and the appropriate > > > > >> lexical transformation. I propose that the simplest complete design is > > > > >> one where the inference of rdf:text objects results in their > > > > >> expression as plain literals, avoiding a dualism between > > > > >> "bob@en"^^rdf:text and "bob"@en which would lose interroperability > > > > >> with existing queries, graph APIs, XPaths operating on SPARQL Results, > > > > >> non-OWL inferencing systems, ... > > > > >> > > > > >> > > > > >> > Regards, > > > > >> > > > > > >> > Boris > > > > >> > > > > > >> > > -----Original Message----- > > > > >> > > From: public-rdf-text-request@w3.org [mailto:public-rdf-text- > > > > >> request@w3.org] > > > > >> > > On Behalf Of Eric Prud'hommeaux > > > > >> > > Sent: 20 May 2009 03:18 > > > > >> > > To: Seaborne, Andy > > > > >> > > Cc: Alan Ruttenberg; public-rdf-text@w3.org; Boris Motik; Sandro > > Hawke; > > > > >> Axel > > > > >> > > Polleres > > > > >> > > Subject: Re: A summary of the proposal for resolving the issues > > with > > > > >> rdf:text > > > > >> > > --> Could you please check it one more time? > > > > >> > > > > > > >> > > On Tue, May 19, 2009 at 03:57:11PM +0000, Seaborne, Andy wrote: > > > > >> > > > Apologies: > > > > >> > > > > > > > >> > > > > On Fri, May 15, 2009 at 11:50 AM, Seaborne, Andy > > > > >> <andy.seaborne@hp.com> > > > > >> > > wrote: > > > > >> > > > >> Monday PM end before 18:00 (GMT+1) > > > > >> > > > >> Thursday PM. > > > > >> > > > >> Tuesday @17:00 (GMT+1) for a short call; end before 17:30. > > > > >> > > > > > > > >> > > > I can't make the slot. > > > > >> > > > > > > > >> > > > Input: please consider interoperability of data between OWL and > > RDF. > > > > >> Option > > > > >> > > 1 is better for that than option 2 as Eric points out. > > > > >> > > > > > > > >> > > > This is also the least change to LC and IMHO is not a substantive > > > > change > > > > >> (it > > > > >> > > follows on from the current graph exchange intent) to add the text > > > > needed > > > > >> for > > > > >> > > SPARQL. Roughly: the scoping graph of an rdf-text aware D- > > entailment > > > > for > > > > >> BGP > > > > >> > > matching includes the RDF forms and does not include ^^rdf:text. > > (Non- > > > > >> aware > > > > >> > > entailment regimes would merely treat as a datatype form.) > > > > >> > > > > > > >> > > does anyone oppose option 1 (plain literals are considered to > > satisfy > > > > >> > > entailments constrained to type rdf:text and entailments of type > > > > rdf:text > > > > >> are > > > > >> > > expressed as plain literals in the RDF graph)? (i'm wondering if we > > can > > > > >> work > > > > >> > > this out before we work out scheduling this phone call.) > > > > >> > > > > > > >> > > > > > > >> > > > Andy > > > > >> > > > > > > > >> > > > > -----Original Message----- > > > > >> > > > > From: Alan Ruttenberg [mailto:alanruttenberg@gmail.com] > > > > >> > > > > Sent: 19 May 2009 16:01 > > > > >> > > > > To: Axel Polleres > > > > >> > > > > Cc: Seaborne, Andy; public-rdf-text@w3.org; Boris Motik; Sandro > > > > Hawke; > > > > >> > > > > eric@w3.orf > > > > >> > > > > Subject: Re: A summary of the proposal for resolving the issues > > > > with > > > > >> > > > > rdf:text --> Could you please check it one more time? > > > > >> > > > > > > > > >> > > > > On Mon, May 18, 2009 at 10:03 AM, Axel Polleres > > > > >> <axel.polleres@deri.org> > > > > >> > > > > wrote: > > > > >> > > > > > Alan, since you were calling for the TC, is that fixed now? > > > > >> > > > > > Otherwise, I am afraid it is not possible before Friday. > > > > >> > > > > > > > > >> > > > > Yes, let's have whoever can make it meet at 5:30 BST = 12:30 > > Boston > > > > >> > > > > time. > > > > >> > > > > Zakim, meet on irc #rdftext for the code. I will send a code > > > > earlier > > > > >> if > > > > >> > > > > I can. > > > > >> > > > > > > > > >> > > > > -Alan > > > > >> > > > > > > >> > > > > >> > > > > >> office: +1.617.258.5741 32-G528, MIT, Cambridge, MA 02144 USA > > > > >> mobile: +1.617.599.3509 > > > > >> > > > > >> (eric@w3.org) > > > > >> Feel free to forward this message to any list for any purpose other > > than > > > > >> email address distribution. > > > > > > > > > > > > -- -eric office: +1.617.258.5741 32-G528, MIT, Cambridge, MA 02144 USA mobile: +1.617.599.3509 (eric@w3.org) Feel free to forward this message to any list for any purpose other than email address distribution.
Received on Wednesday, 20 May 2009 15:47:08 UTC