W3C home > Mailing lists > Public > public-lod@w3.org > December 2010

Best practice for exposing proprietary databases or services as SPARQL endpoints

From: Martin Hepp <mfhepp@gmail.com>
Date: Sat, 11 Dec 2010 09:33:15 +0100
Message-Id: <C16C6576-4B4A-4153-A9E2-B73B4EC97CDC@gmail.com>
To: public-lod@w3.org
Dear all:

There are many different ways of exposing existing relational  
databases as SPARQL, e.g. as summarized by [1], namely Virtuoso's RDF  
Views, D2RQ, and Triplify.

I am looking for best practices / recommendations for the following  
scenario:

1. There is a large and highly dynamic product or services database;  
part of the data (e.g. prices) may even come from external Web  
services (think of airfare, hotel prices).
2. I want to make this accessible as a SPARQL endpoint using  
GoodRelations and FOAF.
3. The mapping from the original data structures to the proper RDF  
must be hand-crafted anyway, so automation of this process is not  
important
4. Creating RDF dumps is not feasible due to

- the dynamics of the data
- the combinatorial complexity (not all combinations may be  
materialized in the database; think of product variants).

Key requirements for me are:

1. Maturity of the software (alpha / beta releases are no option)
2. Scalability - the SPARQL endpoint must handle tens of thousands of  
request per hour
3. Resource management for the endpoint - it must be possible to  
protect the SPARQL endpoint from costly queries and return just a  
subset or refuse a query
4. Resource management for the underlying RDBMS or Web services - it  
must be possible to protect the original RDBMS and involved Web  
services from excessive traffic (both willful ("Semantic DDoS") and  
unintentional (PhD students' Python scrips gone wild).

What would you recommend? My main point is really: Which tools /  
architecture would you recommend if failure is not an option?

Thanks for any opinions!


Best

Martin

[1] A Survey of Current Approaches for  Mapping of Relational  
Databases to RDF (PDF), Satya S. Sahoo, Wolfgang Halb, Sebastian  
Hellmann, Kingsley Idehen, Ted Thibodeau Jr, Sören Auer, Juan Sequeda,  
Ahmed Ezzat, 2009-01-31.
http://www.w3.org/2005/Incubator/rdb2rdf/RDB2RDF_SurveyReport.pdf

--------------------------------------------------------
martin hepp
e-business & web science research group
universitaet der bundeswehr muenchen

e-mail:  hepp@ebusiness-unibw.org
phone:   +49-(0)89-6004-4217
fax:     +49-(0)89-6004-4620
www:     http://www.unibw.de/ebusiness/ (group)
          http://www.heppnetz.de/ (personal)
skype:   mfhepp
twitter: mfhepp

Check out GoodRelations for E-Commerce on the Web of Linked Data!
=================================================================
* Project Main Page: http://purl.org/goodrelations/
* Quickstart Guide for Developers: http://bit.ly/quickstart4gr
* Vocabulary Reference: http://purl.org/goodrelations/v1
* Developer's Wiki: http://www.ebusiness-unibw.org/wiki/GoodRelations
* Examples: http://bit.ly/cookbook4gr
* Presentations: http://bit.ly/grtalks
* Videos: http://bit.ly/grvideos
Received on Saturday, 11 December 2010 08:33:46 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:30 UTC