W3C home > Mailing lists > Public > semantic-web@w3.org > February 2015

Record separators for JSON & Turtle?

From: Paul Houle <ontology2@gmail.com>
Date: Fri, 27 Feb 2015 09:39:59 -0500
Message-ID: <CAE__kdQ3UQfgeqVYoxYt9NH+G-A4KWLgu4MuhwXL+HdjbFBM4g@mail.gmail.com>
To: "semantic-web@w3.org" <semantic-web@w3.org>, Linked Data community <public-lod@w3.org>
I noticed this on HN this morning:

https://www.tbray.org/ongoing/When/201x/2015/02/26/JSON-Text-Sequences

and was thinking this could be answer to the scalability problems we're
encountering with large (billion fact) turtle files since (1) current
turtle parsers can't restart after failure,  and (2) it might not even be
possible to restart after failure in a 100% correct way if multiple line
quotes are allowed.

It seems like embedding restart markers that aren't allowed inside quotes
after the period would be an effective answer for this.



-- 
Paul Houle
Expert on Freebase, DBpedia, Hadoop and RDF
(607) 539 6254    paul.houle on Skype   ontology2@gmail.com
http://legalentityidentifier.info/lei/lookup
Received on Friday, 27 February 2015 14:48:45 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 1 March 2016 07:42:57 UTC