- From: Richard Cyganiak <richard@cyganiak.de>
- Date: Wed, 29 Sep 2010 14:42:09 +0100
- To: public-rdf-dawg-comments@w3.org
This is a comment on the latest SPARQL 1.1 Protocol draft [1]. It concerns an issue that was already present in the SPARQL 1.0 version of the Protocol. I write this as the maintainer of two popular open source SPARQL protocol clients: Snorql (in use at DBpedia [2], DBLP-in-RDF [3], data.semanticweb.org [4], and elsewhere) and Pubby [5] (in use at GeoLinkedData [6], for the TCM dataset [7] and elsewhere). I would like to highlight a limitation of the SPARQL protocol that causes quite a bit of pain for users of my tools and increased support costs. I seek the help of the WG in addressing these issues. Background: The SPARQL ecosystem has matured significantly in the last few years, and the work of this WG will bring SPARQL another major step forward. In the early days, a lot of SPARQL requests were sent simply by a human entering a SPARQL query into a query form, and the results were inspected manually. Software clients were built ad-hoc and for a specific endpoint. Now we increasingly have software clients that are generic and should work with any conformant SPARQL service. This generally works, except in one area: error messages. Services often have to reject SPARQL queries. That is a fact of life. It can be because of query authoring errors, because of resource limitations on the service side, because of dialects and unsupported extensions, and for many other reasons. In such cases, most services deliver more or less helpful error messages as part of the response. Thus, if a user or software developer interacts directly with the endpoint, then they typically see the error messages and can use them to resolve the issue. But users and developers increasingly interact with SPARQL services indirectly, through generic software libraries or generic SPARQL query clients that have been built without a specific vendor's service implementation in mind. These generic clients should pass error messages from the server onwards to the user. But doing that in a reliable way is not possible with the current state of the SPARQL protocol. The consequence is that my tools often can only tell the user that “The SPARQL service reported an error”. That makes it very hard for users to resolve the issue, or for me the developer to help them when I get support requests. According to the spec, there appear to be at least three different formats for indicating an error: 1. XML. I don't know anything about WSDL, but my reading of 2.1.1.3 is that I could indicate an error like so (but can I really, and should I use some XML namespace, and what would the media type be?): <malformed-query> <fault-details>Parse error at [5:17], unexpected ' '</fault- details> </malformed-query> 2. Plain text. According to the example in 2.2.1.3.9, I could just send plain text: Parse error at [5:17], unexpected ' ' 3. HTML. According to the example in 2.2.1.3.10, I could just send an HTML page containing the message: <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd "> <html> <head><title>SPARQL Processing Service: Query Request Refused</ title></head> <body> <h1>D'oh!</h1> <p>Parse error at [5:17], unexpected ' '</p> </body> </html> There is no guidance for service implementers to choose one of these options. There is no guidance for client implementers on how to express a preference. There is no guidance for client implementers on how to recognize the format of a response. In summary, the specification makes no effort towards establishing an interoperable means of delivering error messages to the client. In fact, in Section 2.2.1 “HTTP Bindings for SPARQL Query”, I find: > The fault serialization of queryHttpGet and queryHttpPost is also > intentionally under constrained. This intentional failure should be re-thought. My proposal would be: 1. To state in the HTTP binding that clients SHOULD use the XML fault message format when reporting faults. 2. To provide a human-readable account of the XML format for fault messages (It is currently only specified as snippets of XML Schema and WSDL. What's the XML namespace, if any? What's the root element? What's the media type?). This could be done in the Protocol spec, or possibly in the SPARQL Query Results XML Format spec. 2. To provide a full example of the XML fault message format among the HTTP binding examples. 3. To remove the HTML example in 2.2.1.3.10. An error message embedded in an HTML page is not machine-readable, therefore the use of HTML as a way of reporting errors in a machine-oriented protocol is likely to be bad practice in most cases and should not be encouraged in the specification. Best, Richard [1] http://www.w3.org/TR/2010/WD-sparql11-protocol-20100126/ [2] http://dbpedia.org/snorql/ [3] http://data.semanticweb.org/snorql/ [4] http://dblp.l3s.de/d2r/snorql/ [5] http://www4.wiwiss.fu-berlin.de/pubby/ [6] http://geo.linkeddata.es/ [7] http://www.open-biomed.org.uk/rdf-tcm/
Received on Wednesday, 29 September 2010 13:42:47 UTC