General tuning for Dbpedia Spotlight

Hi.
I am trying to use Dbpedia Spotlight to find stuff in arbitrary English texts.
Following the instructions, I found it very easy to download and install the whole shebang on my Mac laptop - thanks!
It does pretty well in finding stuff, but gets some strange things wrong for me (choosing people called Monday instead of the day of the week, for example, or Municipalities of Germany for Municipalities).
That’s fine - I understand that there is always a precision/recall thing going on.
But I want to use it to mark up web pages, so having even a small number of strange links is not too good.

So my question is:
What are the parameters I should set to get a set of results with high precision (even if low recall) for arbitrary English text?
I assume that I need to set Confidence and Annotation Score, and probably some Types.

Related to this, I am using the Lucene version. I see there is a Statistical version, but can’t work out what the difference might be. Should I be using that to get more precise results?

Sorry if this is somewhere in the docs, but I couldn’t find it easily.
My guess is that this is something that quite a few people have been through?

I am using it from php via http, if anyone can actually provide the code! :-)

Best
Hugh

Received on Thursday, 16 January 2014 11:32:56 UTC