Re: [Turtle] Misc initial thoughts

On 2011-03-02, at 17:31, Nathan wrote:

> Richard Cyganiak wrote:
>> Andy,
>> On 28 Feb 2011, at 20:36, Andy Seaborne wrote:
>>> A data-format N-triples / N-Quads would be a subset of Turtle, with the same IRI resolution rules and same syntax for IRI tokens.  And in UTF-8.
>>> 
>>> As these formats are used as dump formats, pinning down details would be a help to data publishers and consumers.
>>> 
>>> A MIME type which is not text/plain would be helpful.
>> I think having a proper spec for this “N-Triples done right” is a great plan, including support for quads, IRI resolution, UTF-8, and proper media type.
>> However I wouldn't necessarily see it as a subset of Turtle. I'd prefer for Turtle to remain as it is defined now, as a triples-only format without multigraphs/quads.
> 
> quads != triples surely, perhaps there needs to be two then, N-Triples and N-Quads.

Agreed.

I don't like the idea of having formats in the wild which might reasonably be expected to contain triples, or quads and there being no easy way to tell before you start parsing it. Additionally, the ease of parsing is the main selling point of N-T/Q, and no knowing if there will be 3 or 4 columns makes it trickier.

FWIW, N-Quads seems quite popular for data.gov.uk data, and N-Triples/N-Quads are good lowest common denominator formats, for bulk dumps and the like.

I would like to see an update to make UTF-8 legal though, and some clarity on BASE URI resolution.

- Steve

-- 
Steve Harris, CTO, Garlik Limited
1-3 Halford Road, Richmond, TW10 6AW, UK
+44 20 8439 8203  http://www.garlik.com/
Registered in England and Wales 535 7233 VAT # 849 0517 11
Registered office: Thames House, Portsmouth Road, Esher, Surrey, KT10 9AD

Received on Thursday, 3 March 2011 07:21:47 UTC