- From: Andreas Bolka <andreas.bolka@gmx.net>
- Date: Fri, 21 Apr 2006 04:47:27 +0200
- To: semantic-web@w3.org
I'm happy to announce that we have released a first version of Wikipedia3 - Wikipedia in RDF today. Wikipedia3 is a conversion of the English Wikipedia into RDF. Currently we extract basic per-page metadata and structural information like link and category relationships. This is described using a custom ontology (http://www.systemone.at/2006/03/wikipedia) and enriched with elements from Dublin Core and SKOS. An example is worth a thousand words, please have a look at - http://labs.systemone.at/wikipedia3/samples.rdf (RDF/XML) - http://labs.systemone.at/wikipedia3/samples.ttl (Turtle) The available dataset is currently based on the Wikimedia dump of the English Wikipedia from 2006-03-26. It consists of roughly 47 million triples (47'054'407, to be precise) and comes in all major RDF serialization formats: RDF/XML, Turtle and N-Triples. Interested? For more details as well as links to the actual files have a look at http://labs.systemone.at/wikipedia3 . If you have questions, find a glitch in the dataset, have suggestions or ideas, or you want to let us know about things done using Wikipedia3 please do not hesistate to drop me an email. -- Best regards, Andreas Bolka System One, Vienna, Austria http://www.systemone.at/
Received on Friday, 21 April 2006 07:28:05 UTC