Re: R2RML Views for unique blank nodes from ashok malhotra on 2012-05-08 (public-rdb2rdf-wg@w3.org from May 2012)

From: ashok malhotra <ashok.malhotra@oracle.com>
Date: Tue, 08 May 2012 12:02:39 -0700
To: public-rdb2rdf-wg@w3.org
Message-ID: <4FA96DCF.7040700@oracle.com>

Hi David:
Here is the issue as I understand it.
If you start with the Relational table
[Alice, 10]
[Alice, 10]

And use naive R2RML on it, you would get
   _:1<IOU#BORROWER>  "Alice".
   _:1<IOU#AMOUNT>  10.
   _:2<IOU#BORROWER>  "Alice".
   _:2<IOU#AMOUNT>  10.

Since Blank node identifiers are generated from column values, there is no way to generate
different identifiers for _:1 and _:2.  Generating the same identifier for them would lose cardinality
which can be a problem in some cases.

I don't see how to use views to solve this problem.  The solutions I see are
1. If you are using Oracle or Postgres use RowID or RowNum to generate blank node identifiers.
2. Use a join to add a column to the table that contains integers in ascending order.
  (Were you thinking of something like this for your view solution?)
3. Use some other mechanism to generate bnode identifiers, perhaps a function.
4. Add a facility to R2RML that would add a column like RowID

It is this last solution that folks are considering.
All the best, Ashok

On 5/8/2012 11:14 AM, David McNeil wrote:
> One of the key issues from today's working group discussion was whether the Direct Mapping of a table with duplicate rows could be represented in R2RML. The observation was made that it cannot be represented in R2RML. Is this still the case if R2RML Views are considered? It seems to me that in many cases a Direct Mapping of a table with duplicate rows could be represented in R2RML with an R2RML View. I think that is a reasonable way to handle the issue in the case of a custom mapping, perhaps it could be used for the Direct Mapping as well? I suppose this might not be acceptable to some if it relied on vendor specific SQL.
>
> -David

Received on Tuesday, 8 May 2012 19:01:46 UTC