W3C home > Mailing lists > Public > semantic-web@w3.org > July 2012

Re: Direct mapping for spreadsheets

From: Gary Trakhman <gary.trakhman@gmail.com>
Date: Fri, 27 Jul 2012 16:49:35 -0400
Message-ID: <CAJvqBXjvbccA6rDVQn6Xw03hmmPHAjd8Tn0opFr76kBcPv8mgw@mail.gmail.com>
To: David Booth <david@dbooth.org>
Cc: Michael Hausenblas <michael.hausenblas@deri.org>, Martynas Jusevičius <martynas@graphity.org>, Semantic Web <semantic-web@w3.org>
Revelytix has a product (Spyder) that among other things can perform
mappings via R2RML on CSV and excel, exposing your data as a sparql
endpoint.
http://www.revelytix.com/content/spyder

On Fri, Jul 27, 2012 at 4:06 PM, David Booth <david@dbooth.org> wrote:
> Cambridge Semantics has done fantastic work on integrating spreadsheets
> with RDF:
> http://www.cambridgesemantics.com/products/anzo-express
> They've been doing it for years.
>
> David
>
> On Fri, 2012-07-27 at 16:36 +0100, Michael Hausenblas wrote:
>> > I was looking into RDFizing of spreadsheets (GRDDL on Microsoft's
>> > SpreadsheetML, more specifically).
>>
>> IIRC, Dave Reynolds has something called D2RML (see [1], slide 26 etc)  ....
>>
>> Cheers,
>>          Michael
>>
>> [1] http://www.slideshare.net/der42/industrialized-linked-data
>>
>> --
>> Dr. Michael Hausenblas, Research Fellow
>> DERI - Digital Enterprise Research Institute
>> NUIG - National University of Ireland, Galway
>> Ireland, Europe
>> Tel.: +353 91 495730
>> WebID: http://sw-app.org/mic.xhtml#i
>>
>> On 27 Jul 2012, at 16:25, Martynas Jusevičius wrote:
>>
>> > Hey all,
>> >
>> > I was looking into RDFizing of spreadsheets (GRDDL on Microsoft's
>> > SpreadsheetML, more specifically).
>> >
>> > I know there are multiple tools, products, and mappings (such as
>> > XLWrap, TopBraid Composer, Google Refine etc).
>> > However I need a generic mapping, and none of them seem to do the job.
>> > I just need to lift the spreadsheet data to the RDF level, and from
>> > there I will be able to map it to a higher-level vocabulary using
>> > SPARQL CONSTRUCT queries (with or without user assistance).
>> >
>> > The closest thing to what I'm thinking about is the RDB to RDF direct
>> > mapping [1].
>> > Obviously spreadsheets do not have primary keys, column names and
>> > datatypes. They seem however to be a more general case than the
>> > relational, and they still have table (worksheet) names as well
>> > row/column indices.
>> >
>> > Trying to follow the R2R mapping, I came up with this basic example:
>> >
>> >  @base <http://foo.example/spreadsheet.xlsx> .
>> >
>> >  <#sheet1/1> <#sheet1/A> "content of A1" .
>> >  <#sheet1/1> <#sheet1/B> "content of B1" .
>> >  <#sheet1/2> <#sheet1/A> "content of A2" .
>> >  <#sheet1/2> <#sheet1/B> "content of B2" .
>> >
>> > This has an issue with addressing resources within packages, which has
>> > been widely discussed [2] but not solved, AFAIK.
>> >
>> > Has something like this been already attempted? I don't want to
>> > reinvent the wheel.
>> >
>> > [1] http://www.w3.org/TR/rdb-direct-mapping/
>> > [2] http://lists.w3.org/Archives/Public/www-tag/2008Oct/0126.html
>> >
>> > Martynas
>> > graphity.org
>> >
>>
>>
>>
>>
>
> --
> David Booth, Ph.D.
> http://dbooth.org/
>
> Opinions expressed herein are those of the author and do not necessarily
> reflect those of his employer.
>
>
Received on Saturday, 28 July 2012 18:21:40 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 1 March 2016 07:42:35 UTC