BibClassify SKOS-based auto-keyword assignment from Dan Brickley on 2009-03-12 (public-esw-thes@w3.org from March 2009)

From: Dan Brickley <danbri@danbri.org>
Date: Thu, 12 Mar 2009 15:18:25 +0100
To: SKOS <public-esw-thes@w3.org>
Message-ID: <49B919B1.5020508@danbri.org>

Hi folks

I just stumbled across this:

http://infoscience.epfl.ch/hacking/bibclassify/index.html
http://cdsweb.cern.ch/hacking/bibclassify/
http://cdsweb.cern.ch/hacking/bibclassify/extraction-algorithm.html

"Unlike other similar systems, BibClassify does not use any machine 
learning or AI methodologies - just plain phrase matching using regular 
expressions: it exploits the conformation and richness of the thesaurus 
to produce accurate results. It is then clear that BibClassify performs 
best on top of rich, well-structured, subject thesauri expressed in the 
RDF SKOS language. The simple text mode, described in 1.1, has been 
retained only for historical and demonstrative reasons. "


I haven't actually found or tried the code yet, but it looks very 
promising. Hope it works with huge thesauri too! If anyone tries it, 
please report back here...

BTW to several of you - I'm miles behind on replying to various SKOS 
(eg. skosdex/lucene and API) mails. Sorry about that. Late but not 
forgotten :)

cheers,

Dan

--
http://danbri.org/

Received on Thursday, 12 March 2009 14:19:07 UTC