W3C home > Mailing lists > Public > public-csv-wg@w3.org > February 2014

Re: CSV2RDF and R2RML

From: Dan Brickley <danbri@google.com>
Date: Wed, 19 Feb 2014 17:02:36 +0000
Message-ID: <CAK-qy=7_wpD_iuy_BrAUjP_Z5G7BfMF4jZX5aJ0+=X9VBg6BmQ@mail.gmail.com>
To: Gregg Kellogg <gregg@greggkellogg.net>
Cc: Alfredo Serafini <seralf@gmail.com>, James McKinney <james@opennorth.ca>, Ivan Herman <ivan@w3.org>, Andy Seaborne <andy@apache.org>, "public-csv-wg@w3.org" <public-csv-wg@w3.org>
On 19 February 2014 16:50, Gregg Kellogg <gregg@greggkellogg.net> wrote:
> On Feb 19, 2014, at 8:23 AM, Alfredo Serafini <seralf@gmail.com> wrote:
>
> Hi
> this is a really nteresting topic! really good ideas :-)
>
> I suggest to left the multiple tables problem outside the general
> discussion, and think only about multiple sheets, as using multiple tables
> in the same sheet it's really difficult to imagine how can be mapped
> automatically. It seems to me a task involving some NLP as well as parsing
> from destructured (or bad structured or bard formatted, etc) word files.
>
>
> I could see how, using my CSV-LD proposal, we could identify the break in
> tables and associate a new context with the next set. The idea would be to
> use an empty line (no columns, just a line separator) to essentially
> terminate processing of the previous table and start processing anew as if
> this were the first line of a new CSV.

Do we have a CSV with multiple tables in the github repo or wiki yet
yet? I don't see anything in https://github.com/w3c/csvw but we didn't
fix a filetree style for such things yet.

Can someone propose a concrete example CSV file associated with one or
more of our use cases in https://www.w3.org/2013/csvw/wiki/Use_Cases ?
(or propose a new use case)

I'd love to see whether CSV-LD addresses the use case. But let's also
get the use case on record. Jeremy mentioned in today's call that
people are discussing requirements which we haven't documented
usecases for yet. If we don't write these down, people won't know what
problems we think we've solved.

For example, in our Use_Cases list Jeni offered a "Publication of
Statistics" usecase in which
http://www.ons.gov.uk/ons/rel/lms/labour-market-statistics/january-2014/table-a01.xls
is one of several Excel formatted examples. Opening that in
LibreOffice I see multiple tabs/sheets (with confusing names). For now
let's assume these get unpacked into (a) one sheet per CSV file (b)
one very wide CSV with blank columns separating (c) one very long CSV
with blank lines separating. Gregg, is that the kind of example you
think CSV-LD would address?  Care to have a go at working through the
details, starting at the usecase end?

cheers,

Dan
Received on Wednesday, 19 February 2014 17:03:05 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:21:38 UTC