- From: Dan Brickley <danbri@danbri.org>
- Date: Thu, 12 Mar 2009 15:18:25 +0100
- To: SKOS <public-esw-thes@w3.org>
Hi folks I just stumbled across this: http://infoscience.epfl.ch/hacking/bibclassify/index.html http://cdsweb.cern.ch/hacking/bibclassify/ http://cdsweb.cern.ch/hacking/bibclassify/extraction-algorithm.html "Unlike other similar systems, BibClassify does not use any machine learning or AI methodologies - just plain phrase matching using regular expressions: it exploits the conformation and richness of the thesaurus to produce accurate results. It is then clear that BibClassify performs best on top of rich, well-structured, subject thesauri expressed in the RDF SKOS language. The simple text mode, described in 1.1, has been retained only for historical and demonstrative reasons. " I haven't actually found or tried the code yet, but it looks very promising. Hope it works with huge thesauri too! If anyone tries it, please report back here... BTW to several of you - I'm miles behind on replying to various SKOS (eg. skosdex/lucene and API) mails. Sorry about that. Late but not forgotten :) cheers, Dan -- http://danbri.org/
Received on Thursday, 12 March 2009 14:19:07 UTC