AW: Searching for references to a certain URI

Hugh, Kingsley, Paul,

Thanks to all of you! I somehow suspected that there might be no simple answer, but had a glimmer of hope …

Loading several hundreds GB of the Web Data Commons crawl into some custom infrastructure is fine for a research project, but not an option for just looking up a few links to a selected target URI.

The scope of sameas.org is co-reference, which of course is reference, too (so sorry for not mentioning the obvious ;). But it naturally does not cover references like “:someBook dc:creator :somePerson”.

The uriburner seems to bring up data mostly from a lookup of the original uri. It includes (via scioc:links_to) the link from the English wikipedia page, yet misses that from the German one. So it also seems to cover only parts of the data hidden somewhere on the web.

Extending the search through the sameas.org equivalences could be a strategy, yet needs some caution, e.g., to exclude the educationist Horst Siebert (http://d-nb.info/gnd/120272814) from a search for the economist (http://d-nb.info/gnd/120273152).

So a search engine for linked data URIs seems be even more a desideratum than five years ago (with Sindice/Sig.ma supposedly covering more of the at that time much smaller web of data).

Thanks again, and all the best - Joachim


Von: Paul Houle [mailto:ontology2@gmail.com]
Gesendet: Donnerstag, 25. September 2014 20:37
An: Neubert Joachim
Cc: Linked Data community
Betreff: Re: Searching for references to a certain URI

There is a hyperlink graph published here for the regular web

http://webdatacommons.org/hyperlinkgraph/index.html


it's a little big though.

On Thu, Sep 25, 2014 at 4:59 AM, Neubert Joachim <J.Neubert@zbw.eu<mailto:J.Neubert@zbw.eu>> wrote:
What strategies do you use to find all references to a certain URI, e.g. http://d-nb.info/gnd/120273152, on the (semantic) web?

I used Sindice for this, but sadly the service is discontinued, and the data becomes more and more outdated. Google link:/info: prefixes don’t work, because highly relevant links on web pages (e.g. from https://en.wikipedia.org/wiki/Horst_Siebert) are excluded by rel=nofollow links, and pure RDF links (e.g. from dbpedia) don’t show up at all.

Cheers, Joachim



--
Paul Houle
Expert on Freebase, DBpedia, Hadoop and RDF
(607) 539 6254    paul.houle on Skype   ontology2@gmail.com<mailto:ontology2@gmail.com>
http://legalentityidentifier.info/lei/lookup

Received on Friday, 26 September 2014 13:52:37 UTC