Update on Intro to the Use Case from Juan Sequeda on 2010-04-13 (public-rdb2rdf-wg@w3.org from April 2010)

From: Juan Sequeda <juanfederico@gmail.com>
Date: Tue, 13 Apr 2010 11:14:51 -0500
To: public-rdb2rdf-wg@w3.org
Message-ID: <w2sf914914c1004130914g9163ccbw9dc09a133eb8ad01@mail.gmail.com>

Note: this should be on the wiki, but Dan doesn't have write access to the
wiki, yet

Introduction

In addition to the chartered requirements set out for the R2RML
language [*RDB2RDF,
Working Group Charter, Requirements, **
http://www.w3.org/2009/08/rdb2rdf-charter.html#scope*<http://www.w3.org/2009/08/rdb2rdf-charter.html#scope>
*], *have identified a number of real world scenarios and specific
application requirements. We provide a taxonomy of the application specific
requirements and a set of use cases that provide coverage over the feature
set. Use cases span those derived from real-world scenarios, hypothetical
scenarios, and small examples intended to illustrate specific issues.

Application Specific Requirements

Integrating relational database content with the Semantic Web is not a
stand-alone process, but a process of integrating that content with other
content from the Semantic Web. For simplicity we abstract and enumerate
such integration pair wise, where the first web site is a web site
containing relational data to be mapped and the second web site may be any

(please vote)

1. Semantic Web site
2. Source of RDF or OWL

Actual applications may have traditional names. For example, ETL-based
datawarehouse data federation and web-site mashup. In the abstract we
distinguish three refinements of mapping relationally stored data to RDF.

1. Structured

Consider only highly structured database content, and treat string/text
fields as atomic data types of secondary interest.

1. Structured + Semistructured

Text fields in the database are elemental.

1. Structured + Microparsed Tagged Text

Text fields in the database are parsed and tagged per an existing ontology.

Each of these may be combined with a Semantic Web site of one of the above
forms, arbitrary RDF/OWL, that combination may comprised data integration or
mash-up, where mash-up means there are no joins. (predicates on data from 2
or more sources).

Different applications have different roles for an ontology. These range
from none. That is table and column names and other parts of SQL-DDL are
exploited as parts of RDF triples and combined with the underliying data.
In such cases an RDFS or OWL file may be synthesized from the SQL DDL. The
resulting file is called the putative ontology.

An application may require the relational content be mapped to an existing
domain ontology. Thus, an existing ontology serves as the basis of
federation. Some applications, in the fashion of datawarehouses, require
that an ontology be created specific to the goal of federating several
existing databases, in the classic mediator-based architecture [cite?]

1. Expressivity
1. Node Label Generation

Graph node names are synthesized from a function of database attributes

1. Datatype expression
1. Simple
2. Relational data (cells) are mapped to rdf datatypes per SQL XSD
mapping.
3. Micorparsing

Relational data are parsed and mapped to rdf graphs.

Juan Sequeda
+1-575-SEQ-UEDA
www.juansequeda.com

Received on Tuesday, 13 April 2010 16:15:24 UTC