W3C home > Mailing lists > Public > semantic-web@w3.org > May 2017

[ANN] WebIsALOD - Large-scale Hypernymy Dataset Released

From: Heiko Paulheim <heiko@informatik.uni-mannheim.de>
Date: Thu, 18 May 2017 10:08:12 +0200
To: public-lod@w3.org, semantic-web@w3.org
Message-ID: <406ff5ae-3420-2000-478c-fdd475dae5cb@informatik.uni-mannheim.de>
Dear all,

the Data and Web Science group at University of Mannheim is happy to 
announce the first release of the WebIsA database [1] as a Linked Open 
Data endpoint. The dataset contains 11.7 million hypernym or subsumption 
relations ("is a") collected from the Web (e.g., "iPhone 4 is a 
smartphone"), using a set of Hearst-like patterns (see the paper [2] for 
details). We provide the data together with confidence scores, rich 
provenance information, as well as interlinks to DBpedia and YAGO. All 
in all, the dataset contains more than 470M triples.

The dataset is available at [3] as a Linked Data endpoint, a SPARQL 
endpoint, and downloadable dumps.

All the best,
Sven Hertling
Heiko Paulheim

[1] http://webdatacommons.org/isadb
[2] Julian Seitner, Christian Bizer, Kai Eckert, Stefano Faralli, Robert 
Meusel, Heiko Paulheim and Simone Paolo Ponzetto: A Large Database of 
Hypernymy Relations Extracted from the Web. In: LREC 2016.
[3] http://webisa.webdatacommons.org/

Prof. Dr. Heiko Paulheim
Data and Web Science Group
University of Mannheim
Phone: +49 621 181 2652
B6, 26, Room B1.16
D-68159 Mannheim

Mail: heiko@informatik.uni-mannheim.de
Web: www.heikopaulheim.com
Received on Thursday, 18 May 2017 08:08:46 UTC

This archive was generated by hypermail 2.3.1 : Thursday, 18 May 2017 08:08:48 UTC