W3C home > Mailing lists > Public > public-lod@w3.org > February 2010

Re: DBpedia-based entity recognition service / tool?

From: Mike Bergman <mike@mkbergman.com>
Date: Thu, 04 Feb 2010 10:28:00 -0600
Message-ID: <4B6AF590.4000902@mkbergman.com>
To: Juan Sequeda <juanfederico@gmail.com>, public-lod@w3.org
CC: nathan@webr3.org
Hi All,

On 2/4/2010 9:32 AM, Juan Sequeda wrote:
> On Thu, Feb 4, 2010 at 5:10 AM, Nathan <nathan@webr3.org
> <mailto:nathan@webr3.org>> wrote:
>
>     Juan Sequeda wrote:
>      > we followed several domain term extraction techniques.
>
>     any chance you could name drop / point to a few of the techniques - very
>     interested in this myself and in all honesty, no idea where to start
>     (other than a crude string split and check word combinations against a
>     dictionary - not very practical!)
>
>
> yes, that would be very naive :)
>
> Look into the Term Extraction [1] area of Information extraction. There
> are several techniques which can be combined including POS tagging,
> Phrase chunking, etc...

Only 1-2 use Wikipedia as a reference corpus, but you may want to 
look at the 17 or so tools and services under the Vocabulary 
Prompting Tools section in this recent post [2].

Thanks, Mike

[2] 
http://www.mkbergman.com/862/the-sweet-compendium-of-ontology-building-tools/

>
> [1] http://en.wikipedia.org/wiki/Terminology_extraction
>
>     Many Regards,
>
>     Nathan
>
>
Received on Thursday, 4 February 2010 16:29:18 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:25 UTC