Re: columns with multiple values

Ed,

IIRC, we decided not to support things like ‘empty cell means take the value from the row above’ in v1.0, but there is a notation for character-separated values within cells, so that if you have:

title,subject
Moby Dick,Whaling
The Prince and the Pauper,”Royalty,Poverty”

you can state that the comma within the subject column is used as a separator between values.

Jeni

-----Original Message-----
From: Ed Summers <ehs@pobox.com>
Reply: Ed Summers <ehs@pobox.com>>
Date: 31 December 2014 at 12:33:40
To: public-csv-wg@w3.org <public-csv-wg@w3.org>>
Subject:  columns with multiple values

> I was wondering if I could get some guidance on whether CSVW supports CSV files where a  
> column contains multiple values.
>  
> Some of you are probably familiar with the data curation tool OpenRefine [1]. While it  
> is primarily oriented around tabular data, you can also load JSON and XML into it. Since  
> these formats allow for one-to-many relationships, multiple values get split across  
> multiple rows.
>  
> So for example if I load this JSON into OpenRefine:
>  
> [
> {
> "title": "Moby Dick”,
> "subject": [
> "Whaling"
> ]
> },
> {
> "title": "The Prince and the Pauper”,
> "subject": [
> "Royalty",
> "Poverty"
> ]
> }
> ]
>  
> it exports as CSV that looks like this:
>  
> Moby Dick,Whaling
> The Prince and the Pauper,Royalty
> ,Poverty
>  
> I see there has been some discussion on the Wiki of duplicating rows [2], but I wasn’t sure  
> if this use case was addressed by the model and/or vocabulary at all.
>  
> Thanks for any guidance you can provide,
>  
> //Ed
>  
> [1] http://openrefine.org/
> [2] http://www.w3.org/2013/csvw/wiki/CSV-LD#Representing_Multiple_Values  
>  

--  
Jeni Tennison
http://www.jenitennison.com/

Received on Wednesday, 31 December 2014 18:31:54 UTC