W3C home > Mailing lists > Public > semantic-web@w3.org > April 2015

Babelfy v1.0: Multilingual Word Sense Disambiguation and Entity Linking Together!

From: Andrea Moro <andrea8moro@gmail.com>
Date: Mon, 20 Apr 2015 09:52:15 +0200
Message-ID: <CAAEd8zJKBpSRKxPTeQgSYt8LRLV7Pjx5Q7sOfVc9BGJicU+EyQ@mail.gmail.com>
To: undisclosed-recipients:;
====================================================================

Babelfy v1.0: Multilingual Word Sense Disambiguation and Entity Linking
together!

http://babelfy.org

====================================================================

As an output of the "MultiJEDI" Starting Grant <http://multijedi.org/>,
funded by the European Research Council

and headed by Prof. Roberto Navigli
<http://wwwusers.di.uniroma1.it/~navigli>, the Linguistic Computing
Laboratory <http://lcl.uniroma1.it/> of the Sapienza

University of Rome is proud to announce the release of Babelfy
<http://babelfy.org/> v1.0.

Babelfy [1,2] is a joint, unified approach to Word Sense Disambiguation and
Entity Linking

in language of choice. The system is based on a loose identification of
candidate meanings

coupled with a densest subgraph heuristic which selects high-coherence
semantic

interpretations. Its performance on standard word sense disambiguation and
entity linking

tasks is on a par with, or surpasses, those of language- and task-specific
state-of-the-art

systems.

Babelfy draws primarily on BabelNet (http://babelnet.org), a very large
encyclopedic

dictionary and semantic network. BabelNet 3.0 covers 271 languages and
provides both

lexicographic and encyclopedic knowledge for all the open-class parts of
speech, including

nearly 14 million concepts and named entities, thanks to the seamless
integration of WordNet,

Wikipedia, Wiktionary, OmegaWiki, Wikidata and the Open Multilingual
WordNet.

New features in Babelfy v1.0:

* 271 languages covered plus a novel language-agnostic setting!

* Available via easy-to-use Java and HTTP RESTful APIs.

* The input context can be either a text or a bag of words where you can mix
up languages!

* Plenty of tunable parameters for the disambiguation procedure such as
setting your own

threshold, enabling multiple scored annotations of the same fragment,
restricting the

annotations to WordNet, Wikipedia or BabelNet, input the offsets that you
want to be linked,

provide pre-annotated tokens as disambiguation context, disable/enable the
most common

sense heuristic, multi-word expressions and the densest subgraph heuristic.

* Three different scores are now output: the disambiguation score, a
coherence score and

a global relevance score.

* Disambiguation and entity linking is performed using BabelNet, thereby
implicitly annotating

according to several different inventories such as WordNet, Wikipedia,
Wiktionary, OmegaWiki,

etc.

[1] Andrea Moro, Alessandro Raganato, Roberto Navigli. Entity Linking meets
Word Sense <http://www.transacl.org/wp-content/uploads/2014/05/54.pdf>

Disambiguation: a Unified Approach
<http://www.transacl.org/wp-content/uploads/2014/05/54.pdf>. Transactions
of the Association for Computational

Linguistics (TACL), 2, pp. 231-244 (2014).

[2] Andrea Moro, Francesco Cecconi, Roberto Navigli. Multilingual Word
Sense Disambiguation

and Entity Linking for Everybody. Proc. of the 13th International Semantic
Web Conference,

Posters and Demonstrations (ISWC 2014), pp. 25-28, Riva del Garda, Italy,
19-23 October 2014.
Received on Monday, 20 April 2015 07:52:43 UTC

This archive was generated by hypermail 2.4.0 : Thursday, 24 March 2022 20:41:47 UTC