Re: Use Case: Multiple Data Sets in Single File

This use case sounds very similar to
https://www.w3.org/2013/csvw/wiki/Use_Cases#Relational_Data_and_Row-Formats.

If I understand correctly, instead of using column identifiers to
indicate a different table format, you concatenate tables into the
same CSV file with a couple of line breaks between the tables. I guess
the semantic model is the same but with different relationships?

On 20 February 2014 11:39, Craig Russell <craig@craig-russell.co.uk> wrote:
> Hello All,
>
> I often come across CSV files which include multiple data sets in a single
> document. While a spreadsheet permits multiple worksheets, CSV does not. It
> might be interesting for the group to consider this scenario also.
>
> For example:
>
> Google Analytics allows a CSV export of report data. Some reports contain
> multiple sets of data e.g. Visits over Time and Visits by source. The
> downloaded CSV file may include multiple data sets, which are separated by a
> couple of line breaks (example attached). There is, at present, no clear
> machine readable way of differentiating these two data sets within a single
> file. This requires some manual reformatting before further processing is
> possible.
>
> In the example provided, note also the use of commented lines (at the top of
> the file) and the total/average figures at the bottom of each data set.
> These must also be considered when reformatting the file for processing, as
> must the uses of the '%' and '£' symbols and the multiple date formats.
>
> NB: I'm happy for this example data set to be posted on the public Wiki if
> required.
>
> Cheers
>
> -Craig
>
> --
> Craig Russell
> e: craig@craig-russell.co.uk
> w: craig-russell.co.uk
> t: @craig552uk



-- 
Adam Retter

eXist Developer
{ United Kingdom }
adam@exist-db.org
irc://irc.freenode.net/existdb

Received on Sunday, 23 February 2014 01:40:40 UTC