- From: Kingsley Idehen <kidehen@openlinksw.com>
- Date: Sat, 27 Sep 2014 17:01:28 -0400
- To: public-lod@w3.org
- Message-ID: <542725A8.7070206@openlinksw.com>
On 9/26/14 9:52 AM, Neubert Joachim wrote: > The uriburner seems to bring up data mostly from a lookup of the > original uri. It includes (via scioc:links_to) the link from the > English wikipedia page, yet misses that from the German one. So it > also seems to cover only parts of the data hidden somewhere on the web. It depends on what you are seeking, there's a little more to the URIBurner instance (and other Virtuoso instances for that matter). For instance, subject to ACLs, a Virtuoso SPARQL endpoint will allow you crawl the LOD cloud for additional relations in which your URI is either the subject or object [1][2]. To do that you simply need to invoke a SPARQL query where the Virtuoso Web crawl pragmas are enabled. Trouble is that there need to be Linked Data sources in the mix for the crawler to de-reference, which is problematic here: curl -IH "Accept: text/turtle" http://d-nb.info/gnd/120273152 HTTP/1.1 303 See Other Date: Sat, 27 Sep 2014 19:52:17 GMT Server: Apache Location: http://d-nb.info/gnd/120273152/about/html Bottom line, you can incorporate crawling into SPARQL when using Virtuoso endpoints, but that doesn't negate the need for URIs that adhere to Linked Data principles in regards to the pathways available for crawling. For thing I need to investigate further is why the owl:sameAs relation object, from the German DBpedia dataset, isn't being de-refrenced as part of this SPARQL query solution processing pipeline. Links: [1] http://bit.ly/ZhXoBS -- SPARQL crawl example (scoped to relation predicate and objects) [2] http://bit.ly/1pysvhu -- ditto scoped to relation subject, predicate, and object -- Regards, Kingsley Idehen Founder & CEO OpenLink Software Company Web: http://www.openlinksw.com Personal Weblog 1: http://kidehen.blogspot.com Personal Weblog 2: http://www.openlinksw.com/blog/~kidehen Twitter Profile: https://twitter.com/kidehen Google+ Profile: https://plus.google.com/+KingsleyIdehen/about LinkedIn Profile: http://www.linkedin.com/in/kidehen Personal WebID: http://kingsley.idehen.net/dataspace/person/kidehen#this
Attachments
- application/pkcs7-signature attachment: S/MIME Cryptographic Signature
Received on Saturday, 27 September 2014 21:01:53 UTC