W3C home > Mailing lists > Public > public-lod@w3.org > June 2016

Memento compliant DBpedia Archive with Triple Pattern Fragments support

From: Herbert Van de Sompel <hvdsomp@gmail.com>
Date: Wed, 1 Jun 2016 16:25:02 -0600
Message-ID: <CAOywMHfD9QeT_mxoVQrmPqEznoZdzzqqczbJ+e9C-fzg2fk01Q@mail.gmail.com>
To: "<public-lod@w3.org>" <public-lod@w3.org>
Announcing the second generation, Memento compliant DBpedia Archive
with Triple Pattern Fragments support

Since 2010, the Research Library of the Los Alamos National Laboratory
has operated a DBpedia Archive that provided access via Subject URIs
and supported datetime negotiation as defined in the Memento protocol
[1]. This allowed access to prior versions of any DBpedia subject page
by accessing the current page, following a link to a TimeGate provided
by that page, and subsequently negotiating in the datetime dimension
with that TimeGate.

In a collaboration between the Los Alamos National Laboratory and
Ghent University, the DBpedia Archive has recently received a major
overhaul. DBpedia versions 2.0 through 2015 were converted to HDT
files [2]. A streamlined process [3] was devised to clean up source
files to be processed by the C++ HDT software [4].  The Java version
of the HDT software [5] was enhanced to support processing large
source files. The DBpedia HDT files were then exposed using the Linked
Data Fragment server software [6], which was augmented with native
support for the Memento protocol.

As a result, the new DBpedia Archive supports datetime negotiation for:
* Subject URIs - In this case, a resource at http://dbepdia.org
provides a link to a corresponding TimeGate at the DBpedia Archive.
For additional information, see [7].
* Triple Pattern Fragments - In this case, a resource at
http://fragments.dbpedia.org provides a link to a corresponding
TimeGate at the DBpedia Archive. For additional information, see [8].
The capability to perform datetime negotiation for Triple Pattern
Fragments is especially powerful because it enables solving temporal
SPARQL queries using a Memento-compliant Triple Pattern Fragment
client [9].

A slide deck [10] and a video recording of a presentation [11]
detailing the effort to create the new DBpedia Archive are available.

With greetings from the project team:

Los Alamos National Laboratory: Lyudmila Balakireva, Harihar Shankar,
Herbert Van de Sompel
Ghent University: Miel Vander Sande, Ruben Verborgh

[1] RFC 7089 https://tools.ietf.org/html/rfc7089
[2] HDT, Binary Representation of RDF http://www.rdfhdt.org/
[3] DBpedia cleanup scripts for HDT creation
https://bitbucket.org/hariharshankar/dbpedia_hdt/overview
[4] C++ HDT processing software https://github.com/LinkedDataFragments/hdt-cpp
[5] Java HDT processing software https://github.com/luda171/hdt-java
[6] Linked Data Fragment Server https://github.com/LinkedDataFragments/Server.js
[7] DBpedia Archive, Subject URI access
http://mementoweb.org/depot/native/dbpedia/
[8] DBpedia Archive, Triple Pattern Fragments access
http://mementoweb.org/depot/native/fragments/
[9] Triple Pattern Fragment Client
https://github.com/LinkedDataFragments/Client.js/
[10] http://www.slideshare.net/hvdsomp/dbpedia-archive-using-memento-triple-pattern-fragments-and-hdt
[11] https://vimeo.com/163210737

-- 
Herbert Van de Sompel
Digital Library Research & Prototyping
Los Alamos National Laboratory, Research Library
http://public.lanl.gov/herbertv/
http://orcid.org/0000-0002-0715-6126

==
Received on Wednesday, 1 June 2016 22:25:30 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 1 June 2016 22:25:30 UTC