Re: Updated GeoSpecies Data Set

Peter DeVries wrote:
> I have fixed a number of issues and improved LOD linkages for the 
> GeoSpecies data set.
>
> You can read about it here:
>
> http://about.geospecies.org/
>
> You can browse it here:
>
> http://lod.geospecies.org/
>
> The RDF dump can be obtained here:
>
> Here is the new RDF dump
>
> http://lod.geospecies.org/geospecies.rdf.tar.gz   (1,320,182 Triples)
>
> The data set currently contains information and linked data for: 
> 15,803 Species, 1,217 Familes, 189 Orders. We have approximately 6,500 
> species observations, but are awaiting release on the majority of 
> those. The current data set includes 12 sample observation records 
> with geo <http://www.w3.org/2003/01/geo/> and geonames 
> <http://www.geonames.org/> links. There is also a growing number of 
> GeoSpecies annotated articles and presentations in the bibtex 
> <http://purl.org/net/nknouf/ns/bibtex#> and bibio 
> <http://bibliontology.com/> vocabularies. The knowledge base is 
> currently linked to DBpedia <http://dbpedia.org/About>, Freebase 
> <http://www.freebase.com/>, Bio2RDF <http://bio2rdf.org/>, Uniprot 
> <http://www.uniprot.org/>, uBio <http://www.ubio.org/> data sources, 
> and uses some of the umbel <http://umbel.org/> subject concepts. See 
> the projects <http://about.geospecies.org/projects/index.html> page 
> information on proper attribution. Until they have been fully 
> documented, the bulk of the observation records are not currently 
> available.
>
> I have attempted to link to dbpedia, bio2rdf, uniprot and freebase 
> when possible using skos:closeMatch. Of the 15,803 species, 5,577 are 
> linked to dbpedia and wikipedia, 8,896 are linked to bio2rdf and 
> uniprot. There are also foaf:isPrimaryTopicOf links to 8,900 
> Wikispecies pages. Similar linkages are made at the other taxonomic 
> levels of kingdom, phylum, class, order and family.
>
> Here the the page for the Silver-bordered Fritillary 
> Butterfly /Boloria selene/ Denis and Schiffermuller 1775
>
> http://lod.geospecies.org/ses/ICmLC.html
>
> The "entity" is 
>
> http://lod.geospecies.org/ses/ICmLC
>
> The RDF is 
>
> http://lod.geospecies.org/ses/ICmLC.rdf
>
> The levels above species and family are in XHTML with RDFa, but also 
> have a straight RDF representation.
>
> Order Carnivora
>
> http://lod.geospecies.org/orders/jtSaY.xhtml
>
> RDF version
>
> http://lod.geospecies.org/orders/jtSaY.rdf
>
> This is only a fraction of the world's species but it includes all the 
> world's Mammals, and North American Birds.
>
> I will be working to improve the data set's depth, breadth and 
> linkages overtime, and would appreciate any comments or suggestions :-)
>
> My long term plan is to also add biologically relevant assertions to 
> allow useful semantic queries about species.
>
> - Pete
>
> ---------------------------------------------------------------
> Pete DeVries <http://spiders.entomology.wisc.edu/pjd/index.html>
> Department of Entomology
> University of Wisconsin - Madison
> 445 Russell Laboratories
> 1630 Linden Drive
> Madison, WI 53706
> Email: pdevries@wisc.edu <mailto:pdevries@wisc.edu>
> GeoSpecies Knowledge Base <http://species.geospecies.org/>
> About the GeoSpecies Knowledge Base <http://about.geospecies.org/>
> ------------------------------------------------------------
Very nice!

btw - don't forget to add entries to the following pages:

1. 
http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/Statistics
2. http://esw.w3.org/topic/DataSetRDFDumps



-- 


Regards,

Kingsley Idehen	      Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO 
OpenLink Software     Web: http://www.openlinksw.com

Received on Friday, 18 September 2009 11:12:52 UTC