W3C home > Mailing lists > Public > public-lod@w3.org > February 2010

Re: DBpedia-based entity recognition service / tool?

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Tue, 02 Feb 2010 11:43:55 -0500
Message-ID: <4B68564B.5030009@openlinksw.com>
To: Matthias Samwald <samwald@gmx.at>
CC: public-lod@w3.org
Matthias Samwald wrote:
> Hi David,
>
> Thanks for the hint. I remember trying LUPedia a few months ago -- now 
> it has a defined API, which is a good addition. Unfortunately, the 
> quality of results could be improved quite a bit.
>
> Here is a scientific statement that I would like to see annotated:
> "Albizia julibrissin has anxiolytic-like effects that are mediated by 
> the changes of the serotonergic nervous system, especially 5-HT1A 
> receptors."
Since MarkMail hasn't indexed this page, could you make an HTML page 
somewhere, with the excerpt above in a paragraph, then reply with the 
doc URL, so I can quickly test to see how close we can get to what you 
seek via our Sponger Middleware.


Kingsley
>
> LUPedia is unable to identify any entities in this string, although 
> DBpedia would contain them.
> http://dbpedia.org/resource/Albizia_julibrissin
> http://dbpedia.org/resource/Anxiolytic
> http://dbpedia.org/page/5-HT1A_receptor
>
> et cetera.
>
> It seems to recognize person names, as for the string "Michael 
> Jackson", the following URIs are returned:
>
> # http://dbpedia.org/resource/Parademon
> # http://dbpedia.org/resource/Michael_Jackson
>
> The first result is a bit puzzling (DBpedia tells me that 'In the DC 
> Universe, Parademons are monstrous shock troops of Apokolips used by 
> Darkseid to maintain the order of Apokolips.').
>
> LUPedia does not seem to do any kind of stemming either, as submitting 
> the string "Michael Jacksons" reduces the list of extracted URIs to:
>
> # http://dbpedia.org/resource/Parademon
>
> LUPedia in its current form will not perform too well in practical 
> settings.
>
> Cheers,
> Matthias Samwald
>
> --------------------------------------------------
> From: "Davide Palmisano" <davide@asemantics.com>
> Sent: Tuesday, February 02, 2010 2:27 PM
> To: "Matthias Samwald" <samwald@gmx.at>
> Cc: <public-lod@w3.org>
> Subject: Re: DBpedia-based entity recognition service / tool?
>
>> Hi Matthias,
>>
>> have you ever tried this http://lupedia.ontotext.com/ ? Perhaps it 
>> may help.
>>
>> cheers,
>>
>> Davide
>>
>> On Tue, Feb 2, 2010 at 1:26 PM, Matthias Samwald <samwald@gmx.at> wrote:
>>> Dear LOD community,
>>>
>>> I would be glad to hear your advice on how to best accomplish a 
>>> simple task:
>>> extracting DBpedia entities (identified with DBpedia URIs) from a 
>>> string of
>>> text. With good accuracy and recall, possibly with some options to
>>> constraint the recognized entities to some subset of DBpedia, based on
>>> categories. The tool or service should be performant enough to 
>>> process large
>>> numbers of strings in a reasonable amount of time.
>>> Given the prolific creation of tiny tools and services in this 
>>> community I
>>> am puzzled about my inability to find anything that accomplishes 
>>> this task.
>>> Could you point me to something like that? Are there tools/services for
>>> Wikipedia that I could use?
>>> Zemanta seems to be too much geared towards 'enhanced blogging', while
>>> OpenCalais does not return Wikipedia/DBpedia identifiers. Please 
>>> correct me
>>> if I am wrong.
>>>
>>> Cheers,
>>> Matthias
>>>
>>>
>
>


-- 

Regards,

Kingsley Idehen	      
President & CEO 
OpenLink Software     
Web: http://www.openlinksw.com
Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter: kidehen 
Received on Tuesday, 2 February 2010 16:44:23 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:25 UTC