W3C home > Mailing lists > Public > public-lod@w3.org > July 2012

ePrints as Linked Data

From: Hugh Glaser <hg@ecs.soton.ac.uk>
Date: Sun, 8 Jul 2012 19:31:14 +0000
To: "JISC-REPOSITORIES@JISCMAIL.AC.UK" <JISC-REPOSITORIES@JISCMAIL.AC.UK>
CC: "public-lod@w3.org Data" <public-lod@w3.org>
Message-ID: <39DBC639-11CE-4E29-BAC9-8FB45A3C6BA9@soton.ac.uk>
Hi.
The short story:
There is a Linked Data site at
http://foreign.rkbexplorer.com/
which has all the valid Linked Data currently being published by the ePrints archives listed in roar.eprints.org (or at least that which I managed to get.)
You can thus view ePrints things at http://www.dotac.info/explorer using the ePrint URI e.g.
http://www.dotac.info/explorer/#person/http://oro.open.ac.uk/id/person/ext-em5
And you can get their Community of Practice at e.g.
http://www.rkbexplorer.com/network/?type=person-person&format=tsv&uri=http://oro.open.ac.uk/id/person/ext-em5

Longer story:
Quite a while ago I harvested eprints and OAI data into
http://eprints.rkbexplorer.com/ and http://oai.rkbexplorer.com/
But because I wanted Linked Data, since I used OAI-PMH, the URIs were not the ones from the ePrints archives, but were on the rkbexplorer domains.
And it was quite a challenge to use OAI-PMH, and also a challenge to keep the data very up to date.
Recently I decided to start again, using classic Linked Data URI resolution.
ePrints publishes sitemaps.xml and other ways of finding the URIs.
So I resolved all URIs and put them into the http://foreign.rkbexplorer.com/ store.
There are 106230728 triples there.

There is a SPARQL endpoint, but please be gentle with it - it is running on an old machine with a lot of other services, so I will have to close it if it gets hammered. It is 3store, so it is SPARQL 1.0 at best.

Perhaps more interesting is that the ePrints URIs can be used to fire up http://www.dotac.info/explorer directly e.g.
http://www.dotac.info/explorer/#person/http://oro.open.ac.uk/id/person/ext-em5
http://www.dotac.info/explorer/#publication/http://data.open.ac.uk/oro/2956
So you could put a link there from your repository page for eprints.

Another thing that might be interesting is to interpolate who people work with onto their pages, or wherever you feel like.
This is done at ECS in Southampton:
http://www.ecs.soton.ac.uk/people/ha
So to get the same data for an ePrints URI do e.g. 
http://www.rkbexplorer.com/network/?type=person-person&format=tsv&uri=http://oro.open.ac.uk/id/person/ext-ha2294
Various formats and Networks are available, see http://www.rkbexplorer.com/network/ .

Finally, along with the RDF store, comes a CoReference Service (sameAs store).
This is a store that is a restricted sameAs store, that primarily has ePrints URIs in, and their outgoing stuff.
So
http://foreign.rkbexplorer.com/crs/export/?uri=http://oro.open.ac.uk/id/person/ext-em5
shows a link to the RAE2008 data (see next message).
http://foreign.rkbexplorer.com/crs/export/?uri=http://oro.open.ac.uk/id/person/ext-ha2294
Has more extensive sameAs information.
Of course, all this sameAs data is in http://sameas.org as well, e.g.
http://sameas.org/?uri=http://oro.open.ac.uk/id/person/ext-em5

By the way, the (coreference) data sometimes gets better as time goes by, especially when someone looks at it, so if you look at it and it seems a bit sparse, if you come back later it may have improved.

Of course if lots of people use it, the poor little server will get slow, but I'll worry about that if it happens :-)

Please contact me if you want any help with anything.
Best
Hugh
--
Hugh Glaser
Chief Architect
Seme4 Limited
18 Soho Square
LONDON
W1D 3QL
Mobile: +44 7595334155
Main: +44 2070601590

hugh.glaser@seme4.com
www.seme4.com

Seme4 - the experts in semantic web and linked data applications

Notice of Confidentiality. This e-mail message (including any attached
documents) is proprietary and confidential to Seme4 Limited and/or its
affiliates and may contain legally privileged information. It is intended
for the named recipient(s) only. If you are not the intended recipient,
you may not review, retain, copy or distribute this message and we ask you
to notify the sender immediately, then delete this message from your
system. Thank you for your cooperation.
Received on Sunday, 8 July 2012 19:31:50 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:41 UTC