W3C home > Mailing lists > Public > semantic-web@w3.org > August 2014

CfP: 1st Challenge on Linked Data for Information Extraction

From: Heiko Paulheim <heiko@informatik.uni-mannheim.de>
Date: Fri, 01 Aug 2014 16:54:34 +0200
Message-ID: <53DBAA2A.20609@informatik.uni-mannheim.de>
To: "public-lod@w3.org community" <public-lod@w3.org>, semantic-web@w3.org, semanticweb@yahoogroups.com, ML-news@googlegroups.com
*apologies for cross-posting*

1st Challenge on Linked Data for Information Extraction
    organized in connection with the LD4IE workshop
            at ISWC 2014, Riva del Garda, Italy


            Submissions due September 12, 2014

*The best solution is awarded a Springer book voucher worth 250!*

The Linked Data for Information Extraction challenge explores how 
structured data on web pages can be used to train information extraction 
systems extracting that information from other sources as well. It is 
based on a subset of the Web Data Commons Microformats dataset [1].

For the challenge, original annotated pages are provided, as well as the 
triples extracted from them. Based on that information, participants 
have to design an information extraction system for extracting that 
information from other web pages. In this year's challenge, we focus on 
hCard data [2], i.e., information about persons. The use case of such a 
system could be the assembly of a large database on person data.

The systems are evaluated on a test set of annotated web pages, from 
which all annotations have been removed. The participants have to 
extract triples from those pages and send in their resulting triple 
files. The submitted files are evaluated against the gold standard of 
the original triples, ranking the solutions by F-measure.

A short description of each solution is included in the LD4IE workshop 
proceedings, and presented at the workshop [3].

For more detail on the datasets, tasks, results/paper submission and 
evaluation, see

[1] http://webdatacommons.org/structureddata/index.html
[2] http://microformats.org/wiki/hcard
[3] http://oak.dcs.shef.ac.uk/ld4ie2014/


Heiko Paulheim, University of Mannheim, Germany
Robert Meusel, University of Mannheim, Germany

Dr. Heiko Paulheim
Research Group Data and Web Science
University of Mannheim
Phone: +49 621 181 2646
B6, 26, Room C1.08
D-68159 Mannheim

Mail: heiko@informatik.uni-mannheim.de
Web: www.heikopaulheim.com
Received on Friday, 1 August 2014 14:54:59 UTC

This archive was generated by hypermail 2.4.0 : Tuesday, 5 July 2022 08:45:38 UTC