- From: Jeni Tennison <jeni@jenitennison.com>
- Date: Wed, 31 Dec 2014 18:31:27 +0000
- To: public-csv-wg@w3.org, Ed Summers <ehs@pobox.com>
Ed,
IIRC, we decided not to support things like ‘empty cell means take the value from the row above’ in v1.0, but there is a notation for character-separated values within cells, so that if you have:
title,subject
Moby Dick,Whaling
The Prince and the Pauper,”Royalty,Poverty”
you can state that the comma within the subject column is used as a separator between values.
Jeni
-----Original Message-----
From: Ed Summers <ehs@pobox.com>
Reply: Ed Summers <ehs@pobox.com>>
Date: 31 December 2014 at 12:33:40
To: public-csv-wg@w3.org <public-csv-wg@w3.org>>
Subject: columns with multiple values
> I was wondering if I could get some guidance on whether CSVW supports CSV files where a
> column contains multiple values.
>
> Some of you are probably familiar with the data curation tool OpenRefine [1]. While it
> is primarily oriented around tabular data, you can also load JSON and XML into it. Since
> these formats allow for one-to-many relationships, multiple values get split across
> multiple rows.
>
> So for example if I load this JSON into OpenRefine:
>
> [
> {
> "title": "Moby Dick”,
> "subject": [
> "Whaling"
> ]
> },
> {
> "title": "The Prince and the Pauper”,
> "subject": [
> "Royalty",
> "Poverty"
> ]
> }
> ]
>
> it exports as CSV that looks like this:
>
> Moby Dick,Whaling
> The Prince and the Pauper,Royalty
> ,Poverty
>
> I see there has been some discussion on the Wiki of duplicating rows [2], but I wasn’t sure
> if this use case was addressed by the model and/or vocabulary at all.
>
> Thanks for any guidance you can provide,
>
> //Ed
>
> [1] http://openrefine.org/
> [2] http://www.w3.org/2013/csvw/wiki/CSV-LD#Representing_Multiple_Values
>
--
Jeni Tennison
http://www.jenitennison.com/
Received on Wednesday, 31 December 2014 18:31:54 UTC