- From: Hammond, Tony <tony.hammond@springernature.com>
- Date: Tue, 14 Mar 2017 09:21:48 +0000
- To: public-lod@w3.org
(Corrected message - as plain text this time) We wanted to share some news from last week's press release [1] about the public datasets release from Springer Nature SciGraph [2]: "Springer Nature SciGraph collates information from across the research landscape, such as funders, research projects and grants, conferences, affiliations, and publications. Currently the knowledge graph contains 155 million facts (triples) about objects of interest to the scholarly domain. Additional data, such as citations, patents, clinical trials, and usage numbers will follow in stages, so that by the end of 2017 Springer Nature SciGraph will grow to more than 1 billion triples." Springer Nature SciGraph is a new Linked Open Data platform aggregating data sources from Springer Nature and key partners from the scholarly domain. The datasets are being distributed as RDF data (N-Triples format) in a set of bzip2 files. These datasets comprise both SciGraph ontology and instance data which includes the last five years of all Springer Nature journal articles (2012-2016). It totals some 155m triples (3.7 GB compressed, or 32 GB uncompressed). All the data is being released under a Creative Commons Attribution-NonCommercial (CC-BY-NC) 4.0 International License. These public datasets comprise an initial snapshot taken from our knowledge graph. Periodic releases will expand this data footprint both in terms of size and richness of interlinked entities. For now, these data points do not support dereference although we aim to add support for this key piece of linked data functionality in the near future. A further point to note is that we use our own SciGraph ontology to define types and relations. We will, however, begin to provide mappings from this vocabulary to other well-known vocabularies. In the current release we provide links to two external datasets: ANZSRC (Australian and New Zealand Standard Research Classification) Fields of Research codes [3], and GRID (Global Research Identifier Database) identifiers [4]. If you want to keep updated about SciGraph developments, we would encourage you to join our public Google group [5]. This is a public mailing list where our users can share questions, insights and feature requests about the SciGraph datasets. To engage with the Linked Open Data research community and to encourage reuse of these datasets we are aiming to organize a hackathon in London in Q2, 2017. More information on this will follow. By way of context, please note that this data release follows on from our earlier data publishing efforts as Nature in 2012 and 2015 [6,7] and Springer in 2015 [8]. It represents a significant consolidation and maturation of this earlier work. The SciGraph Team [1] http://www.springernature.com/gp/group/media/press-releases/springer-nature-scigraph--supporting-open-science-and-the-wider-understanding-of-research/12129614 [2] http://www.springernature.com/scigraph [3] https://vocabs.ands.org.au/anzsrc-for [4] https://grid.ac/ [5] https://groups.google.com/forum/#!forum/scigraph-public [6] https://lists.w3.org/Archives/Public/public-lod/2012Apr/0061.html [7] https://lists.w3.org/Archives/Public/public-lod/2015Apr/0005.html [8] https://lists.w3.org/Archives/Public/public-lod/2015Mar/0035.html
Received on Tuesday, 14 March 2017 09:22:23 UTC