W3C home > Mailing lists > Public > public-lod@w3.org > March 2017

Announcement: Springer Nature SciGraph - Public datasets release

From: Hammond, Tony <tony.hammond@springernature.com>
Date: Mon, 13 Mar 2017 11:54:17 +0000
Message-ID: <CADQetFYk7MjzZhg3KW6Q=QPN4bCxx=W-8_d-SMBy0r2CV0Xnjw@mail.gmail.com>
To: public-lod@w3.org
We wanted to share some news from last week's press release [1] about the
public datasets release from Springer Nature SciGraph [2]: "Springer Nature
SciGraph collates information from across the research landscape, such as
funders, research projects and grants, conferences, affiliations, and
publications. Currently the knowledge graph contains 155 million facts
(triples) about objects of interest to the scholarly domain. Additional
data, such as citations, patents, clinical trials, and usage numbers will
follow in stages, so that by the end of 2017 Springer Nature SciGraph will
grow to more than 1 billion triples." Springer Nature SciGraph is a new
Linked Open Data platform aggregating data sources from Springer Nature and
key partners from the scholarly domain. The datasets are being distributed
as RDF data (N-Triples format) in a set of bzip2 files. These datasets
comprise both SciGraph ontology and instance data which includes the last
five years of all Springer Nature journal articles (2012-2016). It totals
some 155m triples (3.7 GB compressed, or 32 GB uncompressed). All the data
is being released under a Creative Commons Attribution-NonCommercial
(CC-BY-NC) 4.0 International License. These public datasets comprise an
initial snapshot taken from our knowledge graph. Periodic releases will
expand this data footprint both in terms of size and richness of
interlinked entities. For now, these data points do not support dereference
although we aim to add support for this key piece of linked data
functionality in the near future. A further point to note is that we use
our own SciGraph ontology to define types and relations. We will, however,
begin to provide mappings from this vocabulary to other well-known
vocabularies. In the current release we provide links to two external
datasets: ANZSRC (Australian and New Zealand Standard Research
Classification) Fields of Research codes [3], and GRID (Global Research
Identifier Database) identifiers [4]. If you want to keep updated about
SciGraph developments, we would encourage you to join our public Google
group [5]. This is a public mailing list where our users can share
questions, insights and feature requests about the SciGraph datasets. To
engage with the Linked Open Data research community and to encourage reuse
of these datasets we are aiming to organize a hackathon in London in Q2,
2017. More information on this will follow. By way of context, please note
that this data release follows on from our earlier data publishing efforts
as Nature in 2012 and 2015 [6,7] and Springer in 2015 [8]. It represents a
significant consolidation and maturation of this earlier work. The SciGraph
Team [1]
http://www.springernature.com/gp/group/media/press-releases/springer-nature-scigraph--supporting-open-science-and-the-wider-understanding-of-research/12129614
[2] http://www.springernature.com/scigraph [3]
https://vocabs.ands.org.au/anzsrc-for [4] https://grid.ac/ [5]
https://groups.google.com/forum/#!forum/scigraph-public [6]
https://lists.w3.org/Archives/Public/public-lod/2012Apr/0061.html [7]
https://lists.w3.org/Archives/Public/public-lod/2015Apr/0005.html [8]
https://lists.w3.org/Archives/Public/public-lod/2015Mar/0035.html
Received on Monday, 13 March 2017 19:34:00 UTC

This archive was generated by hypermail 2.3.1 : Monday, 13 March 2017 19:34:01 UTC