W3C home > Mailing lists > Public > public-lod@w3.org > February 2010

Re: DBpedia-based entity recognition service / tool?

From: Matthias Samwald <samwald@gmx.at>
Date: Tue, 2 Feb 2010 15:21:21 +0100
Message-ID: <A8CA93A4CDC44D3CA8E7CC91218BE964@ms>
To: "Davide Palmisano" <davide@asemantics.com>
Cc: <public-lod@w3.org>
Hi David,

Thanks for the hint. I remember trying LUPedia a few months ago -- now it 
has a defined API, which is a good addition. Unfortunately, the quality of 
results could be improved quite a bit.

Here is a scientific statement that I would like to see annotated:
"Albizia julibrissin has anxiolytic-like effects that are mediated by the 
changes of the serotonergic nervous system, especially 5-HT1A receptors."

LUPedia is unable to identify any entities in this string, although DBpedia 
would contain them.
http://dbpedia.org/resource/Albizia_julibrissin
http://dbpedia.org/resource/Anxiolytic
http://dbpedia.org/page/5-HT1A_receptor

et cetera.

It seems to recognize person names, as for the string "Michael Jackson", the 
following URIs are returned:

# http://dbpedia.org/resource/Parademon
# http://dbpedia.org/resource/Michael_Jackson

The first result is a bit puzzling (DBpedia tells me that 'In the DC 
Universe, Parademons are monstrous shock troops of Apokolips used by 
Darkseid to maintain the order of Apokolips.').

LUPedia does not seem to do any kind of stemming either, as submitting the 
string "Michael Jacksons" reduces the list of extracted URIs to:

# http://dbpedia.org/resource/Parademon

LUPedia in its current form will not perform too well in practical settings.

Cheers,
Matthias Samwald

--------------------------------------------------
From: "Davide Palmisano" <davide@asemantics.com>
Sent: Tuesday, February 02, 2010 2:27 PM
To: "Matthias Samwald" <samwald@gmx.at>
Cc: <public-lod@w3.org>
Subject: Re: DBpedia-based entity recognition service / tool?

> Hi Matthias,
>
> have you ever tried this http://lupedia.ontotext.com/ ? Perhaps it may 
> help.
>
> cheers,
>
> Davide
>
> On Tue, Feb 2, 2010 at 1:26 PM, Matthias Samwald <samwald@gmx.at> wrote:
>> Dear LOD community,
>>
>> I would be glad to hear your advice on how to best accomplish a simple 
>> task:
>> extracting DBpedia entities (identified with DBpedia URIs) from a string 
>> of
>> text. With good accuracy and recall, possibly with some options to
>> constraint the recognized entities to some subset of DBpedia, based on
>> categories. The tool or service should be performant enough to process 
>> large
>> numbers of strings in a reasonable amount of time.
>> Given the prolific creation of tiny tools and services in this community 
>> I
>> am puzzled about my inability to find anything that accomplishes this 
>> task.
>> Could you point me to something like that? Are there tools/services for
>> Wikipedia that I could use?
>> Zemanta seems to be too much geared towards 'enhanced blogging', while
>> OpenCalais does not return Wikipedia/DBpedia identifiers. Please correct 
>> me
>> if I am wrong.
>>
>> Cheers,
>> Matthias
>>
>> 
Received on Tuesday, 2 February 2010 14:21:51 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:25 UTC