W3C home > Mailing lists > Public > semantic-web@w3.org > December 2010

Re: Best practice for exposing proprietary databases or services as SPARQL endpoints

From: Martin Hepp <martin.hepp@ebusiness-unibw.org>
Date: Wed, 15 Dec 2010 02:21:01 +0100
Cc: semantic-web@w3.org
Message-Id: <42D0945A-0F50-4AB3-969A-223AB804FB15@ebusiness-unibw.org>
To: public-lod@w3.org
Dear all:
Are there really no experiences beyond academic research regarding  
this task? I had assumed it was a pretty standard requirement...

Best

Martin

On 11.12.2010, at 09:33, Martin Hepp wrote:

> Dear all:
>
> There are many different ways of exposing existing relational  
> databases as SPARQL, e.g. as summarized by [1], namely Virtuoso's  
> RDF Views, D2RQ, and Triplify.
>
> I am looking for best practices / recommendations for the following  
> scenario:
>
> 1. There is a large and highly dynamic product or services database;  
> part of the data (e.g. prices) may even come from external Web  
> services (think of airfare, hotel prices).
> 2. I want to make this accessible as a SPARQL endpoint using  
> GoodRelations and FOAF.
> 3. The mapping from the original data structures to the proper RDF  
> must be hand-crafted anyway, so automation of this process is not  
> important
> 4. Creating RDF dumps is not feasible due to
>
> - the dynamics of the data
> - the combinatorial complexity (not all combinations may be  
> materialized in the database; think of product variants).
>
> Key requirements for me are:
>
> 1. Maturity of the software (alpha / beta releases are no option)
> 2. Scalability - the SPARQL endpoint must handle tens of thousands  
> of request per hour
> 3. Resource management for the endpoint - it must be possible to  
> protect the SPARQL endpoint from costly queries and return just a  
> subset or refuse a query
> 4. Resource management for the underlying RDBMS or Web services - it  
> must be possible to protect the original RDBMS and involved Web  
> services from excessive traffic (both willful ("Semantic DDoS") and  
> unintentional (PhD students' Python scrips gone wild).
>
> What would you recommend? My main point is really: Which tools /  
> architecture would you recommend if failure is not an option?
>
> Thanks for any opinions!
>
>
> Best
>
> Martin
>
> [1] A Survey of Current Approaches for  Mapping of Relational  
> Databases to RDF (PDF), Satya S. Sahoo, Wolfgang Halb, Sebastian  
> Hellmann, Kingsley Idehen, Ted Thibodeau Jr, Sören Auer, Juan  
> Sequeda, Ahmed Ezzat, 2009-01-31.
> http://www.w3.org/2005/Incubator/rdb2rdf/RDB2RDF_SurveyReport.pdf
>
> --------------------------------------------------------
> martin hepp
> e-business & web science research group
> universitaet der bundeswehr muenchen
>
> e-mail:  hepp@ebusiness-unibw.org
> phone:   +49-(0)89-6004-4217
> fax:     +49-(0)89-6004-4620
> www:     http://www.unibw.de/ebusiness/ (group)
>         http://www.heppnetz.de/ (personal)
> skype:   mfhepp
> twitter: mfhepp
>
> Check out GoodRelations for E-Commerce on the Web of Linked Data!
> =================================================================
> * Project Main Page: http://purl.org/goodrelations/
> * Quickstart Guide for Developers: http://bit.ly/quickstart4gr
> * Vocabulary Reference: http://purl.org/goodrelations/v1
> * Developer's Wiki: http://www.ebusiness-unibw.org/wiki/GoodRelations
> * Examples: http://bit.ly/cookbook4gr
> * Presentations: http://bit.ly/grtalks
> * Videos: http://bit.ly/grvideos
>
>
Received on Wednesday, 15 December 2010 01:23:38 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 21:45:40 GMT