W3C home > Mailing lists > Public > public-lod@w3.org > September 2019

Re: Discovery Protocol? (was: LODCloud SPARQL Endpoints Spreadsheet)

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Wed, 18 Sep 2019 11:37:03 -0400
To: public-lod@w3.org
Message-ID: <04471d44-4aa3-3bc7-4edd-e1c258067cf7@openlinksw.com>
On 9/18/19 3:45 AM, Jerven Bolleman wrote:
> Hi All,
>
> For UniProt we provide the void file on the place you would expect
> "https://sparql.uniprot.org/.well-known/void" there are a few more
> endpoints which do the same. e.g.
> sparql.rhea-db.org <http://sparql.rhea-db.org> even if the void is
> more limited.
>
> For the EBI RDF platform they provide void files to HCLS spec
> (https://www.w3.org/TR/hcls-dataset/) on their ftp site.
>
> It's a bit of a catch 22. Publicity systems don't look at them, so
> tools do not expect them to exist. While for us we drive some of our
> documentation pages
> and query optimizations with this (1). Making it a valuable addition
> for us to expose it even if others don't yet look.
>
> The dependency for many tools that look at sparql endpoints to be
> crowd sourced instead of automated is a real shame.
> Crowd sourcing doesn't scale for keeping such records up to date. e.g.
> for practical reasons our RDF is split in a few hundred files,
> no one is going to hand add these to a form every month.
>
> FYI we are starting to use
> https://sparql.uniprot.org/.well-known/sparql-examples/ for users to
> find sparql example queries as a first step to
> making it easier to share queries. This is interesting because we
> store our example queries in our sparql endpoint which means we SPARQL
> to get SPARQL queries.
> This means we have just one piece of infra in production the SPARQL
> server it self.
>
> Regards,
> Jerven
>
> (1) Currently has a bug should be resolved next release later today.


Hi Jerven,

Hopefully, the crowd-sourcing effort will be temporary. Right now, its
our best option for cleaning up the list of SPARQL endpoints associated
with efforts like the LODCloud.

I certainly agree with the goal of fixing discoverability using a
combination of "best practices" and/or an actual HTTP-based pattern.


Kingsley

>
> On Wed, Sep 18, 2019 at 9:20 AM Heiko Paulheim
> <heiko@informatik.uni-mannheim.de
> <mailto:heiko@informatik.uni-mannheim.de>> wrote:
>
>     Hi Jens,
>
>     yes, there are ways of using, e.g., void descriptions. However, a
>     small study we conducted a few years ago revealed that, at least
>     back then, they were rarely used. [1]
>
>     I don't know if things have changed in the meantime, but I'd
>     rather doubt it.
>
>     Best,
>
>     Heiko
>
>     [1] http://www.heikopaulheim.com/docs/iswc2013_poster.pdf
>
>
>     Am 18.09.2019 um 08:55 schrieb Jens Scheidtmann:
>>
>>     Hi everyone,
>>
>>      
>>
>>     Is there a way to discover SPARQL endpoints? “Way” meaning
>>     ideally a protocol or at least a linked data way of publishing
>>     the existence of a SPARQL endpoint or some such.
>>
>>      
>>
>>     Jens
>>
>>      
>>
>>     *Von:*Kingsley Idehen <kidehen@openlinksw.com>
>>     <mailto:kidehen@openlinksw.com>
>>     *Gesendet:* Mittwoch, 18. September 2019 03:10
>>     *An:* public-lod@w3.org <mailto:public-lod@w3.org>
>>     *Betreff:* Re: LODCloud SPARQL Endpoints Spreadsheet
>>
>>      
>>
>>     On 9/17/19 7:57 PM, Michel Dumontier wrote:
>>
>>         Hi,
>>
>>          So, Bio2RDF's datasets are now just served from one sparql
>>         endpoint, and are loaded into different (versioned) graphs.
>>         how do you want to deal with this?
>>
>>         m.
>>
>>     Hi Michel,
>>
>>     Is there now a canonical endpoint that renders all the others
>>     redundant? Anyway, just add the endpoint in question or indicate
>>     the ones to be dropped.
>>
>>     Kingsley
>>
>>          
>>
>>         On Tue, Sep 17, 2019 at 3:15 PM Kingsley Idehen
>>         <kidehen@openlinksw.com <mailto:kidehen@openlinksw.com>> wrote:
>>
>>             Hi Everyone,
>>
>>             As part of the LODCloud effort (starting in 2007), a
>>             number of SPARQL Endpoints emerged around the initial
>>             SPARQL endpoint provided by DBpedia. Today, that cloud
>>             has grown into the largest Knowledge Graph on earth (by
>>             far!) and continues to drive new frontiers related to
>>             Artificial Intelligence and Machine Learning.
>>
>>             Having established itself as the preeminent global
>>             Knowledge Graph on earth, it is extremely important that
>>             we maintain an active list of SPARQL endpoints using
>>             practices that scale. Thus, we are providing a shared
>>             Google Spreadsheet for crowd-sourcing the maintenance of
>>             SPARQL endpoints that make up this important Knowledge Graph.
>>
>>             Please contribute your SPARQL endpoint(s) to the
>>             spreadsheet.
>>
>>             *Links*
>>
>>             ·SPARQL Endpoint Google Spreadsheet 1
>>             <https://community.openlinksw.com/t/open-invitation-for-contributions-to-an-up-to-date-list-of-query-service-endpoints-that-underlie-the-massive-lodcloud-knowledgegraph/1202>
>>
>>             ·What is the LODCloud, and why is it important?
>>             <https://medium.com/virtuoso-blog/what-is-the-linked-open-data-cloud-and-why-is-it-important-1901a7cb7b1f>
>>
>>             -- 
>>
>>             Regards,
>>
>>              
>>
>>             Kingsley Idehen        
>>
>>             Founder & CEO 
>>
>>             OpenLink Software   
>>
>>             Home Page: http://www.openlinksw.com
>>
>>             Community Support: https://community.openlinksw.com
>>
>>             Weblogs (Blogs):
>>
>>             Company Blog: https://medium.com/openlink-software-blog
>>
>>             Virtuoso Blog: https://medium.com/virtuoso-blog
>>
>>             Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers
>>
>>              
>>
>>             Personal Weblogs (Blogs):
>>
>>             Medium Blog: https://medium.com/@kidehen
>>
>>             Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/
>>
>>                           http://kidehen.blogspot.com
>>
>>              
>>
>>             Profile Pages:
>>
>>             Pinterest: https://www.pinterest.com/kidehen/
>>
>>             Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen
>>
>>             Twitter: https://twitter.com/kidehen
>>
>>             Google+: https://plus.google.com/+KingsleyIdehen/about
>>
>>             LinkedIn: http://www.linkedin.com/in/kidehen
>>
>>              
>>
>>             Web Identities (WebID):
>>
>>             Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
>>
>>                     : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this
>>
>>              
>>
>>
>>          
>>
>>         -- 
>>
>>         Michel Dumontier
>>         Distinguished Professor of Data Science
>>
>>         Maastricht University
>>         http://dumontierlab.com
>>
>>      
>>
>>     -- 
>>     Regards,
>>      
>>     Kingsley Idehen        
>>     Founder & CEO 
>>     OpenLink Software   
>>     Home Page: http://www.openlinksw.com
>>     Community Support: https://community.openlinksw.com
>>     Weblogs (Blogs):
>>     Company Blog: https://medium.com/openlink-software-blog
>>     Virtuoso Blog: https://medium.com/virtuoso-blog
>>     Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers
>>      
>>     Personal Weblogs (Blogs):
>>     Medium Blog: https://medium.com/@kidehen
>>     Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/
>>                   http://kidehen.blogspot.com
>>      
>>     Profile Pages:
>>     Pinterest: https://www.pinterest.com/kidehen/
>>     Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen
>>     Twitter: https://twitter.com/kidehen
>>     Google+: https://plus.google.com/+KingsleyIdehen/about
>>     LinkedIn: http://www.linkedin.com/in/kidehen
>>      
>>     Web Identities (WebID):
>>     Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
>>             : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this
>>      
>
>     -- 
>     Prof. Dr. Heiko Paulheim
>     Data and Web Science Group
>     University of Mannheim
>     Phone: +49 621 181 2652
>     B6, 26, Room B0.22
>     D-68159 Mannheim
>
>     Mail: heiko@informatik.uni-mannheim.de <mailto:heiko@informatik.uni-mannheim.de>
>     Web: www.heikopaulheim.com <http://www.heikopaulheim.com>
>
>
>
> -- 
> Jerven Bolleman
> me@jerven.eu <mailto:me@jerven.eu>


-- 
Regards,

Kingsley Idehen	      
Founder & CEO 
OpenLink Software   
Home Page: http://www.openlinksw.com
Community Support: https://community.openlinksw.com
Weblogs (Blogs):
Company Blog: https://medium.com/openlink-software-blog
Virtuoso Blog: https://medium.com/virtuoso-blog
Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers

Personal Weblogs (Blogs):
Medium Blog: https://medium.com/@kidehen
Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/
              http://kidehen.blogspot.com

Profile Pages:
Pinterest: https://www.pinterest.com/kidehen/
Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen
Twitter: https://twitter.com/kidehen
Google+: https://plus.google.com/+KingsleyIdehen/about
LinkedIn: http://www.linkedin.com/in/kidehen

Web Identities (WebID):
Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
        : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this


Received on Wednesday, 18 September 2019 15:37:51 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:22:50 UTC