- From: Craig Russell <craig@craig-russell.co.uk>
- Date: Thu, 20 Feb 2014 10:39:44 +0000
- To: public-csv-wg@w3.org
- Message-ID: <CA+2XN_3myGxzM7j_WXJQxc=JCJ_Ma5yCSKxOs-mcNq2s1PiNVA@mail.gmail.com>
Hello All, I often come across CSV files which include multiple data sets in a single document. While a spreadsheet permits multiple worksheets, CSV does not. It might be interesting for the group to consider this scenario also. For example: Google Analytics allows a CSV export of report data. Some reports contain multiple sets of data e.g. Visits over Time and Visits by source. The downloaded CSV file may include multiple data sets, which are separated by a couple of line breaks (example attached). There is, at present, no clear machine readable way of differentiating these two data sets within a single file. This requires some manual reformatting before further processing is possible. In the example provided, note also the use of commented lines (at the top of the file) and the total/average figures at the bottom of each data set. These must also be considered when reformatting the file for processing, as must the uses of the '%' and '£' symbols and the multiple date formats.. NB: I'm happy for this example data set to be posted on the public Wiki if required. Cheers -Craig -- Craig Russell e: craig@craig-russell.co.uk w: craig-russell.co.uk t: @craig552uk
Attachments
- text/csv attachment: Analytics_www.craig-russell.co.uk_All_Traffic_20140120-20140219.csv
Received on Thursday, 20 February 2014 16:34:05 UTC