W3C home > Mailing lists > Public > public-lod@w3.org > June 2013

[ANN] Public release of Glimmer RDF search engine and demo

From: Peter Mika <pmika@yahoo-inc.com>
Date: Thu, 20 Jun 2013 18:08:51 +0000
To: "public-vocabs@w3.org" <public-vocabs@w3.org>, "public-lod@w3.org" <public-lod@w3.org>, "semantic-web@w3c.org" <semantic-web@w3c.org>, "public-rdfa@w3.org" <public-rdfa@w3.org>
CC: Tim Potter <tep@yahoo-inc.com>, Sebastiano Vigna <vigna@di.unimi.it>, Paolo Boldi <boldi@di.unimi.it>
Message-ID: <CDE911D1.1F685%pmika@yahoo-inc.com>
(apologies for cross-posting)

All,

The Semantic Search research group at Yahoo! Labs is pleased to announce
the open-source code release and public demo of Glimmer, a search engine
for RDF data. 

Glimmer, the search engine [1] provides support for offline distributed
indexing of RDF data using Hadoop MapReduce. It also contains an online
ranking component using a state-of-the-art method based on BM25F,
previously published as [2]. Both of these components are built on top of
MG4J, a highly-scalable open-source search engine written entirely in Java
[3]. Glimmer is available on Github under an Apache 2.0 license.

The Glimmer demo [4] allows searching over 750m triples of data, the
subset of the Web Data Commons [5] collection that uses the schema.org
namespace. We choose to demonstrate Glimmer on this dataset because so far
it has been only accessible as a static download. We hope that providing
API access will make it easier to analyze the data (which previously
required AWS payment) and to develop innovative applications. We plan to
add more collections to the demo in the future. The demo makes it possible
to search the data by keywords or by selecting a class from the taxonomy
shown on the right. It is also possible to restrict matches to the values
of particular predicates and combine such matches using the standard
boolean operators. See [6] for more information.

We welcome bug reports as well as feature suggestions for future release
of Glimmer and the public demo [7].

Lastly, we would like to thank the help and support of Paolo Boldi and
Sebastiano Vigna (University of Milano) as well as Yahoo's Open Source
Working Group. 

Best regards,

The SemSearch group @ Yahoo! Labs


[1] https://github.com/yahoo/Glimmer

[2] Roi Blanco, Peter Mika, Sebastiano Vigna: Effective and Efficient
Entity Search in RDF Data. International Semantic Web Conference (ISWC)
2011, pages 83-97. http://www.dc.fi.udc.es/~roi/publications/iswc2011.pdf

[3] http://mg4j.dsi.unimi.it/

[4] http://glimmer.research.yahoo.com/

[5] http://www.webdatacommons.org/

[6] https://github.com/yahoo/Glimmer/wiki/Web-App-Help

[7] https://github.com/yahoo/Glimmer/issues
Received on Thursday, 20 June 2013 18:12:18 UTC

This archive was generated by hypermail 2.3.1 : Thursday, 20 June 2013 18:12:19 UTC