- From: Gregg Kellogg <gregg@greggkellogg.net>
- Date: Mon, 28 Apr 2014 10:43:54 -0700
- To: Joachim Baran <joachim.baran@gmail.com>
- Cc: Arto Bendiken <arto@bendiken.net>, W3C Ruby RDF mailing list <public-rdf-ruby@w3.org>
- Message-Id: <DAAFF4CB-24DA-4C5F-A157-ACA5907C4A13@greggkellogg.net>
On Apr 28, 2014, at 10:24 AM, Joachim Baran <joachim.baran@gmail.com> wrote: > Hello, > > Thank you for your suggestions! > > I am using RDF::Reader now, since it appeared the fastest option. > > It works well, but I had to substitute \' with just ' in my input. Not sure what the RDF standard says about this -- I could not quickly find a complete list of valid escape sequences. This is governed by the N-Quads spec [1]. Basically a single-quote should not have to be escaped, but literals are all surrounded by double quotes. CR/LF and '\' also need to be escaped. Gregg > Best wishes, > > Kim [1] http://www.w3.org/TR/n-quads/#sec-literals > On 28 April 2014 09:26, Arto Bendiken <arto@bendiken.net> wrote: > Hi Joachim, > > On Mon, Apr 28, 2014 at 6:12 PM, Joachim Baran <joachim.baran@gmail.com> wrote: > > Hello, > > > > I am trying to load an 800MB N-Quads file via: > > > > graph = RDF::Graph.load('myfile.nq', :format => format) > > > > That process has not finished yet and I am wondering if there are > > performance optimization parameters that I can provide to speed up the > > loading process. > > I'm afraid the RDF::Graph implementation wasn't really designed for > inputs of that size. > > Depending on what access patterns you need on the resulting in-memory > object, it'd likely be faster to convert the N-Quads file to a more > compact format and then simply use a file-backed RDF::Reader on top of > it. > > Both RDF::Reader and RDF::Graph mix in RDF::Enumerable, which is the > basis for most higher-level retrieval operations in RDF.rb, so you > should be able to perform quad-pattern matches and the like, with the > proviso that each one does turn into a rewind + full scan of the file. > > With the operating system caching the file contents in memory for you, > you'll be limited chiefly by the parser speed (hence the RDF::Raptor > gem might be the place to start). > > Kind regards, > Arto > > -- > Arto Bendiken | @bendiken | http://ar.to >
Received on Monday, 28 April 2014 17:44:25 UTC