W3C home > Mailing lists > Public > semantic-web@w3.org > March 2017

Announcement: Springer Nature SciGraph - Public datasets release

From: Hammond, Tony <tony.hammond@springernature.com>
Date: Tue, 14 Mar 2017 09:26:07 +0000
Message-ID: <CADQetFZFF1sXHiB_ZPdquZvh_FircYeVD03PHjRQCmq5MgQ+=g@mail.gmail.com>
To: semantic-web@w3.org
We wanted to share some news from last week's press release [1] about
the public datasets release from Springer Nature SciGraph [2]:

"Springer Nature SciGraph collates information from across the
research landscape, such as funders, research projects and grants,
conferences, affiliations, and publications. Currently the knowledge
graph contains 155 million facts (triples) about objects of interest
to the scholarly domain. Additional data, such as citations, patents,
clinical trials, and usage numbers will follow in stages, so that by
the end of 2017 Springer Nature SciGraph will grow to more than 1
billion triples."

Springer Nature SciGraph is a new Linked Open Data platform
aggregating data sources from Springer Nature and key partners from
the scholarly domain.

The datasets are being distributed as RDF data (N-Triples format) in a
set of bzip2 files. These datasets comprise both SciGraph ontology and
instance data which includes the last five years of all Springer
Nature journal articles (2012-2016). It totals some 155m triples (3.7
GB compressed, or 32 GB uncompressed). All the data is being released
under a Creative Commons Attribution-NonCommercial (CC-BY-NC) 4.0
International License.

These public datasets comprise an initial snapshot taken from our
knowledge graph. Periodic releases will expand this data footprint
both in terms of size and richness of interlinked entities. For now,
these data points do not support dereference although we aim to add
support for this key piece of linked data functionality in the near
future.

A further point to note is that we use our own SciGraph ontology to
define types and relations. We will, however, begin to provide
mappings from this vocabulary to other well-known vocabularies. In the
current release we provide links to two external datasets: ANZSRC
(Australian and New Zealand Standard Research Classification) Fields
of Research codes [3], and GRID (Global Research Identifier Database)
identifiers [4].

If you want to keep updated about SciGraph developments, we would
encourage you to join our public Google group [5]. This is a public
mailing list where our users can share questions, insights and feature
requests about the SciGraph datasets.

To engage with the Linked Open Data research community and to
encourage reuse of these datasets we are aiming to organize a
hackathon in London in Q2, 2017. More information on this will follow.

By way of context, please note that this data release follows on from
our earlier data publishing efforts as Nature in 2012 and 2015 [6,7]
and Springer in 2015 [8]. It represents a significant consolidation
and maturation of this earlier work.

The SciGraph Team


[1] http://www.springernature.com/gp/group/media/press-releases/springer-nature-scigraph--supporting-open-science-and-the-wider-understanding-of-research/12129614
[2] http://www.springernature.com/scigraph
[3] https://vocabs.ands.org.au/anzsrc-for
[4] https://grid.ac/
[5] https://groups.google.com/forum/#!forum/scigraph-public
[6] https://lists.w3.org/Archives/Public/public-lod/2012Apr/0061.html
[7] https://lists.w3.org/Archives/Public/public-lod/2015Apr/0005.html
[8] https://lists.w3.org/Archives/Public/public-lod/2015Mar/0035.html
Received on Tuesday, 14 March 2017 09:26:43 UTC

This archive was generated by hypermail 2.4.0 : Thursday, 24 March 2022 20:41:55 UTC