W3C home > Mailing lists > Public > public-csv-wg@w3.org > February 2014

Re: CSV+ file lines with differing number of columns

From: Tim Finin <finin@cs.umbc.edu>
Date: Thu, 20 Feb 2014 08:33:56 -0500
Message-ID: <53060444.9070207@cs.umbc.edu>
To: public-csv-wg@w3.org
On 2/19/14, 12:21 PM, Tandy, Jeremy wrote:
 > #1: this is a good example to include as a use case. I think there's
 > enough text here already ... it would be great if you could move
 > this across to the wiki...

I'll do this.

 > #2: your example ...
 >     :e4 type         PER
 >     :e4 mention      "Bart" D00124 283-286
 >     :e4 per:siblings :e7    D00124 283-286 173-179 274-281
 >     :e4 per:age      "10"   D00124 180-181 173-179 182-191 0.9
 > ... seems to be quite regular; from your description, the column
 > headings might be:
 >
 > 
subject,predicate,object,document-id,string-offset-1,string-offset-2,string-offset-3,confidence
 >
 > thus, in comma delimited form, your variable-length rows become:
 > :e4,type,PER,,,,,
 > :e4,mention,"Bart",D00124,283-286,,,
 > :e4,per:siblings,:e7,D00124,283-286,173-179,274-281,
 > :e4,per:age,"10",D00124,180-181,173-179,182-191,0.9

In general, the optional certainty measure can come at the end of any
of the rows.  There was also an option for relations that allowed
people to include two provenance items (doc,offset,offset,offset).
Using a 'universal relation' format with a fixed number of fields
would then require twelve columns.
Received on Thursday, 20 February 2014 13:34:28 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:21:38 UTC