Memento compliant DBpedia Archive with Triple Pattern Fragments support

Announcing the second generation, Memento compliant DBpedia Archive
with Triple Pattern Fragments support

Since 2010, the Research Library of the Los Alamos National Laboratory
has operated a DBpedia Archive that provided access via Subject URIs
and supported datetime negotiation as defined in the Memento protocol
[1]. This allowed access to prior versions of any DBpedia subject page
by accessing the current page, following a link to a TimeGate provided
by that page, and subsequently negotiating in the datetime dimension
with that TimeGate.

In a collaboration between the Los Alamos National Laboratory and
Ghent University, the DBpedia Archive has recently received a major
overhaul. DBpedia versions 2.0 through 2015 were converted to HDT
files [2]. A streamlined process [3] was devised to clean up source
files to be processed by the C++ HDT software [4].  The Java version
of the HDT software [5] was enhanced to support processing large
source files. The DBpedia HDT files were then exposed using the Linked
Data Fragment server software [6], which was augmented with native
support for the Memento protocol.

As a result, the new DBpedia Archive supports datetime negotiation for:
* Subject URIs - In this case, a resource at http://dbepdia.org
provides a link to a corresponding TimeGate at the DBpedia Archive.
For additional information, see [7].
* Triple Pattern Fragments - In this case, a resource at
http://fragments.dbpedia.org provides a link to a corresponding
TimeGate at the DBpedia Archive. For additional information, see [8].
The capability to perform datetime negotiation for Triple Pattern
Fragments is especially powerful because it enables solving temporal
SPARQL queries using a Memento-compliant Triple Pattern Fragment
client [9].

A slide deck [10] and a video recording of a presentation [11]
detailing the effort to create the new DBpedia Archive are available.

With greetings from the project team:

Los Alamos National Laboratory: Lyudmila Balakireva, Harihar Shankar,
Herbert Van de Sompel
Ghent University: Miel Vander Sande, Ruben Verborgh

[1] RFC 7089 https://tools.ietf.org/html/rfc7089
[2] HDT, Binary Representation of RDF http://www.rdfhdt.org/
[3] DBpedia cleanup scripts for HDT creation
https://bitbucket.org/hariharshankar/dbpedia_hdt/overview
[4] C++ HDT processing software https://github.com/LinkedDataFragments/hdt-cpp
[5] Java HDT processing software https://github.com/luda171/hdt-java
[6] Linked Data Fragment Server https://github.com/LinkedDataFragments/Server.js
[7] DBpedia Archive, Subject URI access
http://mementoweb.org/depot/native/dbpedia/
[8] DBpedia Archive, Triple Pattern Fragments access
http://mementoweb.org/depot/native/fragments/
[9] Triple Pattern Fragment Client
https://github.com/LinkedDataFragments/Client.js/
[10] http://www.slideshare.net/hvdsomp/dbpedia-archive-using-memento-triple-pattern-fragments-and-hdt
[11] https://vimeo.com/163210737

-- 
Herbert Van de Sompel
Digital Library Research & Prototyping
Los Alamos National Laboratory, Research Library
http://public.lanl.gov/herbertv/
http://orcid.org/0000-0002-0715-6126

==

Received on Wednesday, 1 June 2016 22:25:30 UTC