W3C home > Mailing lists > Public > www-rdf-interest@w3.org > April 2003

[ANN] Java RDF crawler

From: Matt Biddulph <matt@picdiary.com>
Date: Mon, 21 Apr 2003 14:57:33 +0100
To: rdfweb-dev@vapours.rdfweb.org, www-rdf-interest@w3.org
Message-ID: <20030421135733.GA19024@picdiary.com>

Just released: version 0.1 of an RDF crawler (aka scutter) using Java
and Jena that spiders the web (following rdfs:seeAlso) gathering up RDF
data and storing it in any of Jena's backend stores (in-memory, Berkeley
DB, mysql, etc). It does multithreaded downloading, and retains
provenance information which it uses to maintain consistency over
multiple runs.

The code's only had me as a user so far; comments and corrections highly
appreciated.

More information at http://www.hackdiary.com/archives/000030.html

Cheers,
Matt.
Received on Monday, 21 April 2003 09:57:36 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:44:41 UTC