Use Case: Multiple Data Sets in Single File

Hello All,

I often come across CSV files which include multiple data sets in a single
document. While a spreadsheet permits multiple worksheets, CSV does not. It
might be interesting for the group to consider this scenario also.

For example:

Google Analytics allows a CSV export of report data. Some reports contain
multiple sets of data e.g. Visits over Time and Visits by source. The
downloaded CSV file may include multiple data sets, which are separated by
a couple of line breaks (example attached). There is, at present, no clear
machine readable way of differentiating these two data sets within a single
file. This requires some manual reformatting before further processing is
possible.

In the example provided, note also the use of commented lines (at the top
of the file) and the total/average figures at the bottom of each data set.
These must also be considered when reformatting the file for processing, as
must the uses of the '%' and '£' symbols and the multiple date formats..

NB: I'm happy for this example data set to be posted on the public Wiki if
required.

Cheers

-Craig

-- 
Craig Russell
e: craig@craig-russell.co.uk
w: craig-russell.co.uk
t: @craig552uk

Received on Thursday, 20 February 2014 16:34:05 UTC