- From: Arthur Ryman <ryman@ca.ibm.com>
- Date: Wed, 6 Aug 2014 08:15:01 -0400
- To: Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de>
- Cc: Holger Knublauch <holger@topquadrant.com>, "public-rdf-sha." <public-rdf-shapes@w3.org>
Dimitris, You wrote: "@Arthur, Eric, Jose It would be great if you could shed some light in the actual differences between ShEx and Shapes (besides the syntax and the property group extensions by ShEx)" Shapes is a high-level RDF vocabulary that let's you assert many commonly occurring constraints such as occurrence and range. It is not a programming language. It is not extensible. It does not have a formal semantics. It has no extension mechansim. It is aimed at developers who understand similar constraints in OO, database, etc. It is also intended to be a source of metadata for a variety of other tools such as query builders, form builders, and documentation generators. ShEx is a programming language that can express the constraints defined by Shapes, and many others since you can write down arbitrarily complex ShEx expressions. It is complementary to Shapes in that it let's you express more constraints. The price you pay for the increased expressiveness is that you need to understand a new programming language. BTW, there was some question of what is meant by a high-level RDF vocabulary. This is of course subjective, however Shapes, RDFS and OWL are high-level since they introduce terms for high-level concepts rather then define them using rules that use terms for lower level concepts. For example, the SPIN vocabulary for expressing SPARQL as RDF is a low-level vocabulary. Regards, ___________________________________________________________________________ Arthur Ryman, PhD Chief Data Officer, Rational Chief Architect, Portfolio & Strategy Management Distinguished Engineer | Master Inventor | Academy of Technology Toronto Lab | +1-905-413-3077 (office) | +1-416-939-5063 (mobile) From: Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de> To: Holger Knublauch <holger@topquadrant.com>, Cc: "public-rdf-sha." <public-rdf-shapes@w3.org> Date: 08/03/2014 04:32 AM Subject: Re: Proposed change to the charter, section 4. Deliverables, Recommendation Track On Sun, Aug 3, 2014 at 9:21 AM, Holger Knublauch <holger@topquadrant.com> wrote: A pragmatic proposal: I do believe there is consensus that this WG can potentially create some useful and relevant output that could lead to broader use cases for semantic web technology as a whole. There are several proposals on the table that are potentially complementary to each other. Assuming there are enough people who actually sign up for the work, why not produce multiple deliverables that cover more use cases? 1) Shapes + SPIN, with an explicit mandate to have semantics that are executable by SPARQL engines, from day one. The alternative would be ShEx and this could be figured out in the beginning of the WG. This deliverable would be like an extension to SPARQL. plus 2) OWL closed world semantics, so that existing OWL ontologies can be reused. This deliverable would basically be an "appendix" to the OWL 2 spec. This would allow the interest groups to stay on their home turf without blocking each other, because blocking each other would be the worst outcome of all. I see no technical difficulties with such as stack, because these technologies are complementary to each other: we use OWL closed world + SPIN all the time, and it works well in practice. In fact I believe both specifications have a good and stable starting point so that we could proceed with the process very swiftly. >From my POV it would be more interesting to do a comparison in action by implementing PROV Constraints (http://www.w3.org/TR/prov-constraints/) in all proposed solutions (ShEx, SPIN, Shapes, ICV, OWL). I think PROV is a nice use case because it has a moderate complexity and can reveal weaknesses / strengths in terms of expressivity, readability & compactness. A plus would be to capture the constraints of the PROV ontology in CWA. Paul Groth already made an attempt to port these constraints in SPIN [1] but I am not aware if these are complete or optimized. In addition this would be a great contribution to the PROV community. @Arthur, Eric, Jose It would be great if you could shed some light in the actual differences between ShEx and Shapes (besides the syntax and the property group extensions by ShEx) Best, Dimitris [1] https://github.com/pgroth/prov-constraints-validator-spin/blob/master/python/prov-constraints.py Peter, would this be an acceptable direction for you? Thanks, Holger On 8/3/14, 9:14 AM, Peter F. Patel-Schneider wrote: I have indicated that I am satisfied with the current mention of possible solutions at the end of Section 3. The work currently mentioned there is targeted towards RDF validation. I am not in favour of including work that is not targeted towards RDF validation. If SPARQL fits into this category then I am not in favour of including SPARQL in this part of the charter at all. I have never indicated that all possible solutions have to be included in the charter. Suggesting this is just inflammatory. I have never indicated that no possible solutions can be included in the charter. Suggesting this is also inflammatory. I am in favour of including multiple possible solutions, and in favour of including the ones that I think have sufficient gravitas. Why do you think that SPARQL should be given the prominent place in the charter that it has in your suggested wording for deliverables? peter On 08/02/2014 03:37 PM, Irene Polikoff wrote: Sort of. I believe you are interpreting "a blank sheet" in a very literal sense of the words which is clearly not what I meant. Other than that, I wanted to clarify if you are saying: 1) It is OK to state the possible solution in the charter as long as it is not mentioned in the deliverables section or 2) It is OK to state the possible solution in the charter as long as it is not SPARQL or 3) The charter should not mention any possible solutions anywhere or 4) The charter should equally mention all possible solutions that may exist in the world irrespective of their maturity and the number of such possible alternatives or 5) some combination of the above or something else altogether Irene -----Original Message----- From: Peter F. Patel-Schneider [mailto:pfpschneider@gmail.com] Sent: Saturday, August 02, 2014 6:11 PM To: Irene Polikoff; 'Arthur Ryman'; public-rdf-shapes@w3.org Subject: Re: Proposed change to the charter, section 4. Deliverables, Recommendation Track You indeed have misunderstood my position. I don't think that I have previously indicated that starting with a blank sheet is suboptimal, although I do agree that it is generally not a good idea. No charter draft that I have seen does start with a blank sheet, so this is somewhat of a moot point. There are several implemented and deployed systems that claim to provide constraints for RDF. Not all of them are given meaning in terms of SPARQL. There is new work that claims to provide constraints for RDF. It is not based on SPARQL at all. I have seen no work showing that SPARQL is adequate as an underpinning of all these systems. Under these circumstances I feel that it is completely wrong to have SPARQL mentioned at all in the deliverables of the working group. It this clear enough for you? peter On 08/02/2014 02:54 PM, Irene Polikoff wrote: Peter, I thought you agreed that starting with a blank sheet has proven to be suboptimal to ensuring success of a working group and its timely progress. And that while you were not against SPARQL per se, you were concerned with mandating its use in case the working group finds that it does not satisfy requirements. Since my proposed wording addresses both of the issues, I must have misunderstood your position. Irene -----Original Message----- From: Peter F. Patel-Schneider [mailto:pfpschneider@gmail.com] Sent: Saturday, August 02, 2014 5:09 PM To: Irene Polikoff; 'Arthur Ryman'; public-rdf-shapes@w3.org Subject: Re: Proposed change to the charter, section 4. Deliverables, Recommendation Track Again, your proposed wording goes far beyond simply mentioning a possible solution. Your proposed wording says that a particular approach is to be followed unless it is shown to be inadequate. I do not support this proposed wording. I do support wording like what is currently at the end of Section 3. peter On 08/02/2014 01:21 PM, Irene Polikoff wrote: Because it identifies a possible solution and says that if it proves to be inadequate, the working group will choose another approach. Thus, the prime candidate is identified, but the group is not mandated to use it. -----Original Message----- From: Peter F. Patel-Schneider [mailto:pfpschneider@gmail.com] Sent: Saturday, August 02, 2014 4:18 PM To: Irene Polikoff; 'Arthur Ryman'; public-rdf-shapes@w3.org Subject: Re: Proposed change to the charter, section 4. Deliverables, Recommendation Track Why would you think that this would be acceptable to me? This goes well beyond pointing at possible solutions or mentioning possible starting points. peter On 08/02/2014 01:13 PM, Irene Polikoff wrote: Then, Arthur's strawman with the modification below should be acceptable: The WG MUST produce: 1. A high-level RDF vocabulary that expresses commonly occurring constraints. 2. The semantics of the high-level constraints expressed in terms of SPARQL or, if SPARQL proves to be unsuitable for the use cases determined by the group, in an alternative language. 3. An RDF extension mechanism for expressing additional constraints, expressed in SPARQL. The WG MAY produce: 1. A new compact, human readable syntax for expressing constraints with a corresponding semantics expressed in SPARQL or, if SPARQL proves to be unsuitable for the use cases determined by the group, in an alternative language. 2. A specification of how constraint validation interacts with inference. 3. A specification for graph normalization. [1] http://www.w3.org/2014/data-shapes/charter Irene -----Original Message----- From: Peter F. Patel-Schneider [mailto:pfpschneider@gmail.com] Sent: Saturday, August 02, 2014 1:17 AM To: Irene Polikoff; 'Arthur Ryman'; public-rdf-shapes@w3.org Subject: Re: Proposed change to the charter, section 4. Deliverables, Recommendation Track I don't see much wrong in pointing at potential solutions, and even less mentioning possible starting points, but that's not what the proposed change to the charter is. Instead the proposed change mandates SPARQL as the semantics of constraints. I think that this is in advance of any worked-out proposal for actually using SPARQL for this purpose. If it turns out that SPARQL can be used to specify the semantics of constraints, then it may be reasonable to use SPARQL for this purpose. However, mandating SPARQL's use even before its suitability has been demonstrated doesn't seem to me to be a good idea. peter On 08/01/2014 06:54 PM, Irene Polikoff wrote: Not necessarily. As Jeremy Carroll said in one of the previous e-mails "W3C experience indicates that a WG with a blank sheet at the beginning often delivers the wrong stuff, late - and the broad ship that sails ends up as a narrow clique on arrival". It is better to set a charter that identifies the prime candidate. As the work proceeds, the group may decide that the candidate doesn?t meet key requirements, but having initial stake in the ground is important to the effectiveness of the group, its ability to deliver on time and quality of its deliverables. Irene -----Original Message----- From: Peter F. Patel-Schneider [mailto:pfpschneider@gmail.com] Sent: Friday, August 01, 2014 5:53 PM To: Arthur Ryman; public-rdf-shapes@w3.org Subject: Re: Proposed change to the charter, section 4. Deliverables, Recommendation Track All these arguments may be fine, but isn't that up to the working group to decide? peter On 08/01/2014 02:42 PM, Arthur Ryman wrote: Peter, Thx for the response. I'm glad that we agree on making the compact, human-friendly syntax optional. Here is my rationale for advocating for the use of SPARQL. 1. We need to be precise about the meaning of constraints. Therefore, we need to select some formalism for expressing the semantics. 2. The candidates for expressing semantics include: 2.1 Natural language 2.2 OWL ICV 2.3 SPARQL 2.4 Z 2.5 some other existing formal language 2.6 a new specification language that the wg invents Pros and Cons 2.1 Natural language is imprecise and non-executable 2.2 OWL ICV is well-defined and executable, but not a W3C standard and is not expressive enough as a general constraint language 2.3 SPARQL is a W3C standard, is very expressive, and is executable 2.4 Z is very expressive but not well known and is non-executable 2.5 There are many other formal specification languages. Does anyone want to advocate for one? 2.6 A new formalism - not the core focus of the workgroup. 3. Some one will need to build a reference implementation. A SPARQL implementation will be easy to build if the semantics of the constraints are in SPARQL. 4. We want the specification to be implemented and adopted. SPARQL is a known quantity and many implementations exist. Regards, __________________________________________________________________ _ _ _ _ _____ Arthur Ryman, PhD Chief Data Officer, Rational Chief Architect, Portfolio & Strategy Management Distinguished Engineer | Master Inventor | Academy of Technology Toronto Lab | +1-905-413-3077 (office) | +1-416-939-5063 (mobile) From: "Peter F. Patel-Schneider" <pfpschneider@gmail.com> To: Arthur Ryman/Toronto/IBM@IBMCA, public-rdf-shapes@w3.org, Date: 08/01/2014 05:22 PM Subject: Re: Proposed change to the charter, section 4. Deliverables, Recommendation Track I do agree that the emphasis in the charter on creating a human-readable syntax is misplaced and that this deliverable should be made optional. The proposal here, however, does very much more than fixing this problem, and I view most of the other changes in the proposal as undesirable. Why should the group be required to specify semantics in terms of SPARQL? There hasn't even been anything to show that SPARQL is adequate for this purpose. Why should the group be required to specify two ways of expressing constraints, and one of them be SPARQL itself? There hasn't been anything to show that this division is needed. Why should a human-readable syntax for constraints be new? There hasn't been anything to show that existing syntaxes are unsuitable for this purpose. I don't think that it is appropriate to tie the hands of the working group in any of these ways. I do agree that the group should be required to define the meaning of constraints. This was a peculiar lack in the deliverables. So my proposal would be to change the recommendation track deliverables to something like: 1. A syntax and semantics for shapes specifying how to construct shape expressions and how shape expressions are evaluated against RDF graphs. 2. An RDF vocabulary for expressing these shapes in RDF triples, so they can be stored, queried, analyzed, and manipulated with normal RDF tools. 3. OPTIONAL A specification of how shape verification interacts with inference. 4. OPTIONAL A compact, human-readable syntax for expressing shapes. I would prefer the third deliverable to be required, but I'm not going to complain if it is optional. Although there are three syntaxes mentioned in these deliverables, in keeping with the usual RDF situation there is nothing saying that all of these three need to be different or even that they are not all the same. (Well, nothing beyond the implausibility of using RDF triples as the basis of a compact, human-readable syntax.) peter On 08/01/2014 12:44 PM, Arthur Ryman wrote: The output of the wg is defined by its deliverables. Here is the current text [1] Recommendation Track: 1. Compact, human readable syntax for expressing constraints on RDF graph patterns (aka shapes), suitable for the use cases determined by the group. This syntax might be a variation of an existing standard, such as templates for SPARQL, or something new, such as ShExC. 2. An RDF vocabulary, such as Resource Shapes 2.0, for expressing these shapes in RDF triples, so they can be stored, queried, analyzed, and manipulated with normal RDF tools. The WG MAY produce a Recommendation for graph normalization. This text is not acceptable to IBM because of the primary emphasis it places on defining a possibly new compact, human readable syntax. I believe this concern has been expressed repeatedly by many people on the mailing list. Many people have indicated a strong preference for building on existing standards. However, we have not seen any corresponding modification of the charter. I'd therefore like to propose a strawman change to this section of the charter and invite comment. Here is the proposed new text: The WG MUST produce: 1. A high-level RDF vocabulary that expresses commonly occurring constraints. 2. The semantics of the high-level constraints expressed in terms of SPARQL. 3. An RDF extension mechanism for expressing additional constraints, expressed in SPARQL. The WG MAY produce: 1. A new compact, human readable syntax for expressing constraints with a corresponding semantics expressed in SPARQL. 2. A specification for graph normalization. [1] http://www.w3.org/2014/data-shapes/charter Regards, __________________________________________________________________ _ _ _ _ _____ Arthur Ryman, PhD Chief Data Officer, Rational Chief Architect, Portfolio & Strategy Management Distinguished Engineer | Master Inventor | Academy of Technology Toronto Lab | +1-905-413-3077 (office) | +1-416-939-5063 (mobile) -- Dimitris Kontokostas Department of Computer Science, University of Leipzig Research Group: http://aksw.org Homepage:http://aksw.org/DimitrisKontokostas
Received on Wednesday, 6 August 2014 12:15:34 UTC