W3C home > Mailing lists > Public > semantic-web@w3.org > February 2014

Re: [Virtuoso 7] Upload large RDF file into quad store

From: Max Schmachtenberg <max@informatik.uni-mannheim.de>
Date: Wed, 05 Feb 2014 16:34:10 +0100
Message-ID: <52F259F2.4000308@informatik.uni-mannheim.de>
To: semantic-web@w3.org
Hi,

you can use the Bulk Loader script from Virtuoso 
(http://virtuoso.openlinksw.com/dataspace/doc/dav/wiki/Main/VirtBulkRDFLoaderScript)

You can find an example on how to use it with a DBpedia dump as example 
here: 
http://joernhees.de/blog/2010/10/31/setting-up-a-local-dbpedia-mirror-with-virtuoso/

Regards
Max

On 02/05/2014 04:19 PM, Souza, Renan F. S. wrote:
> Dear all,
>
> I am using virtuoso-open-source-7.0.0.
> What is a good way of uploading a large file (say 5GB) into Virtuoso 7 
> database? It is a Turtle file format, around 50M triples.
>
> Solutions I tried:
> 1) Upload the entire file at once
> I tried more than once and all times it got stuck while having exactly 
> 39% uploaded using Conductor quad store uploader, then nothing seems 
> to change after that.
>
> 2) Split the file into may shorter files (~150 MB each) and upload 
> each of them separately.
> 2.1. I tried to upload them programmatically using Open RDF API for 
> Java, but it takes forever to upload a single file. I could not wait, 
> so I do not know exactly how long it takes.
> 2.2. I tried to upload them manually using Conductor quad store 
> uploader. These smaller files seem get uploaded very quickly in the 
> beginning, but it somehow stops working after having some of the data 
> uploaded, then I am unable to upload more files.
>
> I do not know what else to do.
>
> Have you ever had to do something like this? What did you do?
>
> Thanks a lot in advance.
>
> -
> Thank you!
> Regards,
>
> Souza, Renan F. S.
> Bachelor of Computer Science Student
> Federal University of Rio de Janeiro, Brazil
> Missouri State University, Springfield, MO
>
> +55-21-99257-3934
> Personal email: renan-francisco@hotmail.com 
> <mailto:renan-francisco@hotmail.com>


-- 
Max Schmachtenberg
Chair of Information Systems V
Web-based Systems Group
Universitšt Mannheim
B6, 26, Room C1.07
D-68159 Mannheim
Phone: +49 621 181 3705
Mail: max@informatik.uni-mannheim.de
Web: dws.informatik.uni-mannheim.de
Received on Wednesday, 5 February 2014 22:32:23 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 1 March 2016 07:42:48 UTC