[ANN] Java RDF crawler

Just released: version 0.1 of an RDF crawler (aka scutter) using Java
and Jena that spiders the web (following rdfs:seeAlso) gathering up RDF
data and storing it in any of Jena's backend stores (in-memory, Berkeley
DB, mysql, etc). It does multithreaded downloading, and retains
provenance information which it uses to maintain consistency over
multiple runs.

The code's only had me as a user so far; comments and corrections highly
appreciated.

More information at http://www.hackdiary.com/archives/000030.html

Cheers,
Matt.

Received on Monday, 21 April 2003 09:57:36 UTC