- From: Matt Biddulph <matt@picdiary.com>
- Date: Mon, 21 Apr 2003 14:57:33 +0100
- To: rdfweb-dev@vapours.rdfweb.org, www-rdf-interest@w3.org
Just released: version 0.1 of an RDF crawler (aka scutter) using Java and Jena that spiders the web (following rdfs:seeAlso) gathering up RDF data and storing it in any of Jena's backend stores (in-memory, Berkeley DB, mysql, etc). It does multithreaded downloading, and retains provenance information which it uses to maintain consistency over multiple runs. The code's only had me as a user so far; comments and corrections highly appreciated. More information at http://www.hackdiary.com/archives/000030.html Cheers, Matt.
Received on Monday, 21 April 2003 09:57:36 UTC