W3C home > Mailing lists > Public > public-lod@w3.org > February 2010

RE: DBpedia-based entity recognition service / tool?

From: <Rafi.Shachar@reuters.com>
Date: Tue, 2 Feb 2010 20:30:43 +0000
To: samwald@gmx.at, public-lod@w3.org
Message-ID: <3A2D7967C26115498422B68C25E946828668F3@LONSMSXM03.emea.ime.reuters.com>
Matthias,

OpenCalais does have links to DBpedia URIs for large subset of entities.
The DBpedia URIs are not included in OpenCalais output but in the
LinkedData endpoint. For example,
http://d.opencalais.com/er/geo/city/ralg-geo1/f08025f6-8e95-c3ff-2909-0a
5219ed3bfa

The entities which have links to DBpedia are documented here:
http://www.opencalais.com/documentation/linked-data-entities

Rafi

-----Original Message-----
From: public-lod-request@w3.org [mailto:public-lod-request@w3.org] On
Behalf Of Matthias Samwald
Sent: Tuesday, February 02, 2010 2:26 PM
To: public-lod@w3.org
Subject: DBpedia-based entity recognition service / tool?

Dear LOD community,

I would be glad to hear your advice on how to best accomplish a simple
task: 
extracting DBpedia entities (identified with DBpedia URIs) from a string
of 
text. With good accuracy and recall, possibly with some options to 
constraint the recognized entities to some subset of DBpedia, based on 
categories. The tool or service should be performant enough to process
large 
numbers of strings in a reasonable amount of time.
Given the prolific creation of tiny tools and services in this community
I 
am puzzled about my inability to find anything that accomplishes this
task.
Could you point me to something like that? Are there tools/services for 
Wikipedia that I could use?
Zemanta seems to be too much geared towards 'enhanced blogging', while 
OpenCalais does not return Wikipedia/DBpedia identifiers. Please correct
me 
if I am wrong.

Cheers,
Matthias 




This email was sent to you by Thomson Reuters, the global news and information company.
Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters. 
Received on Thursday, 4 February 2010 12:27:39 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:25 UTC