Re: Linked data sets for evaluating interlinking?

Dear Cristina,

There's a set of three thesauri that have been manually mapped by the MACS project:
- LCSH (at id.loc.gov)
- Rameau (at data.bnf.fr)
- SWD (see https://wiki.dnb.de/display/LDS/Dokumentation+des+Linked+Data+Services+der+DNB)
These are not complete wrt coverage of the vocabularies (hundreds of thousands of concepts), but they already allow interesting experiments - for example we did [1].

Best,

Antoine

[1]  http://www.few.vu.nl/~aisaac/papers/telplus-ecdl09.pdf


> Hi,
>
> I am looking for pairs of linked data sets that can be used as gold standard for evaluations.  I would need pairs of data sets which have been manually linked, or data sets which have been (semi-)automatically linked with interlinking tools, and afterwards reviewed (to include the links which are not identified by tools). I have looked into the DataHub catalogue and queried VoiD descriptions, but unfortunately the information about how the interlinking process was carried out is often missing.
>
> Apart from the data sets which have been used in the OAEI-instance matching track, could anyone recommend (based on past experience) good data sets for evaluating data interlinking processes?
>
> Thanks in advance.
>
> Kind regards,
>
> Cristina
>
> --
> Cristina Sarasua
>
> Institute for Web Science and Technologies (WeST)
>
> Universität Koblenz-Landau
> Universitätsstraße 1
> 56070 Koblenz
> Germany
>
> e:csarasua@uni-koblenz.de
> p: +49 261 287 2772
> f: +49 261 287 100 2772
> w:http://west.uni-koblenz.de
>

Received on Monday, 26 August 2013 06:42:16 UTC