- From: Ed Summers <ehs@pobox.com>
- Date: Wed, 31 Dec 2014 06:33:40 -0600
- To: public-csv-wg@w3.org
- Message-Id: <45A64CD6-5120-4EE8-BB17-6475888AA8B9@pobox.com>
I was wondering if I could get some guidance on whether CSVW supports CSV files where a column contains multiple values.
Some of you are probably familiar with the data curation tool OpenRefine [1]. While it is primarily oriented around tabular data, you can also load JSON and XML into it. Since these formats allow for one-to-many relationships, multiple values get split across multiple rows.
So for example if I load this JSON into OpenRefine:
[
{
"title": "Moby Dick”,
"subject": [
"Whaling"
]
},
{
"title": "The Prince and the Pauper”,
"subject": [
"Royalty",
"Poverty"
]
}
]
it exports as CSV that looks like this:
Moby Dick,Whaling
The Prince and the Pauper,Royalty
,Poverty
I see there has been some discussion on the Wiki of duplicating rows [2], but I wasn’t sure if this use case was addressed by the model and/or vocabulary at all.
Thanks for any guidance you can provide,
//Ed
[1] http://openrefine.org/
[2] http://www.w3.org/2013/csvw/wiki/CSV-LD#Representing_Multiple_Values
Received on Wednesday, 31 December 2014 12:34:16 UTC