W3C home > Mailing lists > Public > public-lod@w3.org > June 2009

Re: vocabularies and data alignment

From: François Scharffe <francois.scharffe@inria.fr>
Date: Fri, 12 Jun 2009 15:07:11 +0200
Message-ID: <4A3252FF.80709@inria.fr>
To: Kingsley Idehen <kidehen@openlinksw.com>
CC: Hugh Glaser <hg@ecs.soton.ac.uk>, "public-lod@w3.org" <public-lod@w3.org>, Jerome Euzenat <Jerome.Euzenat@inrialpes.fr>
Kingsley Idehen wrote:
> Franois Scharffe wrote:
>> Hugh Glaser wrote:
>>> Hi,
>>> To put it in simple terms for me :-)
>>> Are you after the algorithms we use to identify when two instances 
>>> are the same?
>>> Best
>>> Hugh
>>
>> Yes !
>>
>> Franois
> 
> So if the answer is "Yes". Then do you mean things in the ABox and TBox? 
> Must be clear here as being too generic leads to confusion.

Link generators are working at the instance level (ABox), they generate 
links between instances. They need some input, a specification of what 
should be interlinked. We think this specification can be lifted to an 
alignment between vocabularies (TBoxes). Well we are not 100% sure this 
will work, that's why we would like to get such tools and their linkage 
specifications.
I can take an example, interlinking persons: one dataset is described 
with FOAF, the other with VCard.
?x foaf:name ?name.
?y vc:n [
	vc:family-name ?fn;
	vc:given-name ?gn.
	].
the linkage specification might be something like:
if compare(?name, concat(?gn," ",?fn)) > threshold
then output("?x owl:sameAs ?y")

In fact, this specification says
foaf:name <-> concat(vc:given-name," ",vc:family-name)
which is an alignment at the TBox level that can be lifted from the 
linkage specification.

I hope I was clear enough this time ;)


Cheers,
Franois
> 
> sameAs is not the best way to align things in the TBox.
> 
> Kingsley
>>
>>>
>>> On 11/06/2009 12:57, "Franois Scharffe" <francois.scharffe@inria.fr> 
>>> wrote:
>>>
>>> Dear LODers,
>>>
>>> There has been a couple of discussions already on this list on the need
>>> for a vocabulary to represent correspondences between terms of different
>>> vocabularies. We also saw recently various tools (e.g. Silk, ODDlinker)
>>> allowing to automatically interlink datasets given a specification of
>>> what should be linked.
>>>
>>> However, there is currently no common way to publish and share this
>>> information (i.e., not the links but the way to generate them, see [1]
>>> for precision).
>>>
>>> We are setting up an experiment [1] to see if it is possible to provide
>>> useful services from this data. But for that purpose we need your help.
>>>
>>> So this is a call for contribution: we are collecting any specification
>>> of link generator for the LOD graph.
>>>
>>> Of course, do not hesitate to comment on the idea or to tell us if you
>>> want to be involved.
>>>
>>> We promise a report on this by the end of summer (northern hemisphere 
>>> :).
>>>
>>> Cheers,
>>> Franois
>>>
>>> [1] http://melinda.inrialpes.fr
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
> 
> 


Received on Friday, 12 June 2009 13:08:08 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:21 UTC