Re: LODCloud SPARQL Endpoints Spreadsheet

Thanks Kingsley, very helpful.
Comments and some answers inline.

> On 18 Sep 2019, at 23:50, Kingsley Idehen <kidehen@openlinksw.com> wrote:
> 
> On 9/18/19 4:53 PM, Hugh Glaser wrote:
>> Thanks Kingsley.
> Hi Hugh,
>> But I still don't see the relationship.
> Okay, let's leave this in the "horses for courses" bucket for now.
> 
> More than anything else, let's get a somewhat clean list of SPARQL
> endpoints that enable users and user-agents explore what's depicted in
> the LOD Cloud pictorial.
Sounds great.
Make the data more actionable.
> 
>> https://lod-cloud.net/lod-data.json
>> has all the data you want, I think.
> 
> It should be helpful after its been transformed into RDF :)
> 
>> Is it useful for people to spend their time filling in your sheet?
> 
> I don't know, but this ultimately a voluntary exercise etc..
Let's hope you make it so.
> 
> BTW -- what is the RDF triple/quad store behind your various endpoints?
Ah. Being vintage, most of them use 3store, which is also vintage.
(Built by Steve Harris based on a simple proof of concept that you could do an RDF store with RDQL on top of SQL!)
https://sourceforge.net/projects/threestore/
> 
>> To be honest, I have spent too much of my life filling in these things, and I groaned when I saw your posting.
>> I can think of quite a few identical initiatives that wasted my time.
> Understood, and that's partly why we at OpenLink Software have embarked
> upon this particular exercise i.e., attempt to fix what's making folks
> feel the way you describe.
Great.
So you may want to be more explicit about what I suspect is not stated.
Things like ("best efforts" obviously):
Your commitment to longevity.
Your commitment to ensure I can get my data back out in many forms.
Your commitment to try to seed other sources (such as LOD-PIC and that great Wikidata Thing I didn't know about - thanks Jeff).
> 
>> 
>> At the least, why don't you prime it with the https://lod-cloud.net/ data.
>> And even plan to push new data back into it?
> 
> Is there an actual data dump URL that I've overlooked?
You didn't notice that the first URI I used in my post was
https://lod-cloud.net/lod-data.json
:-)
It certainly isn't straightforward to find.
And isn't RDF, but I'm sure you could sponge or whatever the json.

I see it would be hard to push data back - you don't have the linkage data that the LOD-PIC requires.
But maybe someone could generate that data automagically from your source.
> 
>> At least if you want to motivate me to do it, perhaps you could tell us what you plan do do with the data?
> I plan to publish the data in RDF as just another contribution to the
> LODCloud i.e., a dataset about SPARQL endpoints that enable end-users
> (people) and user-agents (software) explore said cloud via queries etc..
Sounds good.
> 
>> And how it will be published as Linked Data, I hope.
> I don't know any other way.
I know - I thought I would feed you that line ;-)
> 
>> And what will the licence be?
>> On what conditions will others be able to use the data?
> It will be CC-BY SA or whatever is the most open license in use these days.
Could you consider less restrictive than CC-BY SA, such as Public Domain (I think).
I know we could get into a long discussion here, which no-one would enjoy (licences).
My comment is that Attribution is hard, or at least possibly awkward, especially on the LOD.
You need to engineer any app to keep track of all the sources it might have used, and then find a way of communicating that.
I regularly avoid datasets that require attribution because of that.
And the SA is hard to interpret, and almost not meaningful, if all I did was use a few triples from your dataset, but inferred added another one, and then used that RDF fragment.
> 
>> At the moment, I don't intend to provide data, FWIW.
> That's okay, most your endpoints actually work so I only need you to
> confirm what engine sites behind the SPARQL Query Service endpoint .
I'm feeling encouraged already.
See above, 3store.
I would need to look at the full list - I'm not sure how up to date it is.
There may be some more recent ones that use 4store, and sameAs.org used mySQL, but I think we may have changed to MariaDB.
> 
> Regards,
You too.
Hugh
> 
> Kingsley
> 
>> 
>> Best
>> Hugh
>> 
>>> On 18 Sep 2019, at 16:33, Kingsley Idehen <kidehen@openlinksw.com> wrote:
>>> 
>>> On 9/18/19 3:41 AM, Gray, Alasdair J G wrote:
>>>> Hi
>>>> 
>>>> What is the relationship between this effort and the lod cloud diagram?
>>>> https://lod-cloud.net/
>>>> 
>>>> Thanks
>>>> 
>>>> Alasdair
>>>> 
>>>> http://www.macs.hw.ac.uk/~ajg33
>>> 
>>> Hi Alasdair,
>>> 
>>> Ultimately, this effort should make the bubbles in the LODCloud easier to understand e.g., does a node indicate existing of a SPARQL endpoint or just a published dataset. 
>>> 
>>> In the current state, the LODCloud is a little unclear to many. As an insider, I know that the LOD Cloud comprises the following:
>>> 
>>> 1. Datasets published using Linked Data principles
>>> 
>>> 2. SPARQL endpoints associated with Datasets -- that may or may not be functional which opens up a can of worms that are typically quite negative
>>> 
>>> One issue that has lingered in the wells of confusion for years is the fact that SPARQL actually provides an effective tool for Linked Data Deployment, as exemplified by DBpedia. It scales better than trying to handle the name -> address indirection manually using a Web Server. A few years ago that claim would have been challenged by a lot of doubt, but 10+ years later we have concrete proof :) 
>>> 
>>> 
>>> 
>>> Kingsley 
>>> 
>>>> From: Kingsley Idehen <kidehen@openlinksw.com>
>>>> Sent: Wednesday, September 18, 2019 2:10:13 AM
>>>> To: public-lod@w3.org <public-lod@w3.org>
>>>> Subject: Re: LODCloud SPARQL Endpoints Spreadsheet
>>>> 
>>>> On 9/17/19 7:57 PM, Michel Dumontier wrote:
>>>>> Hi,
>>>>> So, Bio2RDF's datasets are now just served from one sparql endpoint, and are loaded into different (versioned) graphs. how do you want to deal with this?
>>>>> m.
>>>> Hi Michel,
>>>> Is there now a canonical endpoint that renders all the others redundant? Anyway, just add the endpoint in question or indicate the ones to be dropped. 
>>>> 
>>>> Kingsley 
>>>> 
>>>>> On Tue, Sep 17, 2019 at 3:15 PM Kingsley Idehen <kidehen@openlinksw.com> wrote:
>>>>> Hi Everyone,
>>>>> 
>>>>> As part of the LODCloud effort (starting in 2007), a number of SPARQL Endpoints emerged around the initial SPARQL endpoint provided by DBpedia. Today, that cloud has grown into the largest Knowledge Graph on earth (by far!) and continues to drive new frontiers related to Artificial Intelligence and Machine Learning.
>>>>> 
>>>>> Having established itself as the preeminent global Knowledge Graph on earth, it is extremely important that we maintain an active list of SPARQL endpoints using practices that scale. Thus, we are providing a shared Google Spreadsheet for crowd-sourcing the maintenance of SPARQL endpoints that make up this important Knowledge Graph.
>>>>> 
>>>>> Please contribute your SPARQL endpoint(s) to the spreadsheet. 
>>>>> 
>>>>> Links
>>>>> 
>>>>>  • SPARQL Endpoint Google Spreadsheet 1
>>>>>  • What is the LODCloud, and why is it important?
>>>>> -- 
>>>>> Regards,
>>>>> 
>>>>> Kingsley Idehen       
>>>>> Founder & CEO 
>>>>> OpenLink Software   
>>>>> Home Page: 
>>>>> http://www.openlinksw.com
>>>>> 
>>>>> Community Support: 
>>>>> https://community.openlinksw.com
>>>>> 
>>>>> Weblogs (Blogs):
>>>>> Company Blog: 
>>>>> https://medium.com/openlink-software-blog
>>>>> 
>>>>> Virtuoso Blog: 
>>>>> https://medium.com/virtuoso-blog
>>>>> 
>>>>> Data Access Drivers Blog: 
>>>>> https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers
>>>>> 
>>>>> 
>>>>> Personal Weblogs (Blogs):
>>>>> Medium Blog: 
>>>>> https://medium.com/@kidehen
>>>>> 
>>>>> Legacy Blogs: 
>>>>> http://www.openlinksw.com/blog/~kidehen/
>>>>> 
>>>>> 
>>>>> http://kidehen.blogspot.com
>>>>> 
>>>>> 
>>>>> Profile Pages:
>>>>> Pinterest: 
>>>>> https://www.pinterest.com/kidehen/
>>>>> 
>>>>> Quora: 
>>>>> https://www.quora.com/profile/Kingsley-Uyi-Idehen
>>>>> 
>>>>> Twitter: 
>>>>> https://twitter.com/kidehen
>>>>> 
>>>>> Google+: 
>>>>> https://plus.google.com/+KingsleyIdehen/about
>>>>> 
>>>>> LinkedIn: 
>>>>> http://www.linkedin.com/in/kidehen
>>>>> 
>>>>> 
>>>>> Web Identities (WebID):
>>>>> Personal: 
>>>>> http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
>>>>> 
>>>>>        : 
>>>>> http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> -- 
>>>>> Michel Dumontier
>>>>> Distinguished Professor of Data Science
>>>>> Maastricht University
>>>>> http://dumontierlab.com
>>>> 
>>>> -- 
>>>> Regards,
>>>> 
>>>> Kingsley Idehen       
>>>> Founder & CEO 
>>>> OpenLink Software   
>>>> Home Page: 
>>>> http://www.openlinksw.com
>>>> 
>>>> Community Support: 
>>>> https://community.openlinksw.com
>>>> 
>>>> Weblogs (Blogs):
>>>> Company Blog: 
>>>> https://medium.com/openlink-software-blog
>>>> 
>>>> Virtuoso Blog: 
>>>> https://medium.com/virtuoso-blog
>>>> 
>>>> Data Access Drivers Blog: 
>>>> https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers
>>>> 
>>>> 
>>>> Personal Weblogs (Blogs):
>>>> Medium Blog: 
>>>> https://medium.com/@kidehen
>>>> 
>>>> Legacy Blogs: 
>>>> http://www.openlinksw.com/blog/~kidehen/
>>>> 
>>>> 
>>>> http://kidehen.blogspot.com
>>>> 
>>>> 
>>>> Profile Pages:
>>>> Pinterest: 
>>>> https://www.pinterest.com/kidehen/
>>>> 
>>>> Quora: 
>>>> https://www.quora.com/profile/Kingsley-Uyi-Idehen
>>>> 
>>>> Twitter: 
>>>> https://twitter.com/kidehen
>>>> 
>>>> Google+: 
>>>> https://plus.google.com/+KingsleyIdehen/about
>>>> 
>>>> LinkedIn: 
>>>> http://www.linkedin.com/in/kidehen
>>>> 
>>>> 
>>>> Web Identities (WebID):
>>>> Personal: 
>>>> http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
>>>> 
>>>>        : 
>>>> http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this
>>>> 
>>>> 
>>>> 
>>>> Heriot-Watt University is The Times & The Sunday Times International University of the Year 2018
>>>> Founded in 1821, Heriot-Watt is a leader in ideas and solutions. With campuses and students across the entire globe we span the world, delivering innovation and educational excellence in business, engineering, design and the physical, social and life sciences. This email is generated from the Heriot-Watt University Group, which includes:
>>>>  • Heriot-Watt University, a Scottish charity registered under number SC000278
>>>>  • Heriot- Watt Services Limited (Oriam), Scotland's national performance centre for sport. Heriot-Watt Services Limited is a private limited company registered is Scotland with registered number SC271030 and registered office at Research & Enterprise Services Heriot-Watt University, Riccarton, Edinburgh, EH14 4AS.
>>>> The contents (including any attachments) are confidential. If you are not the intended recipient of this e-mail, any disclosure, copying, distribution or use of its contents is strictly prohibited, and you should please notify the sender immediately and then delete it (including any attachments) from your system.
>>> 
>>> -- 
>>> Regards,
>>> 
>>> Kingsley Idehen       
>>> Founder & CEO 
>>> OpenLink Software   
>>> Home Page: 
>>> http://www.openlinksw.com
>>> 
>>> Community Support: 
>>> https://community.openlinksw.com
>>> 
>>> Weblogs (Blogs):
>>> Company Blog: 
>>> https://medium.com/openlink-software-blog
>>> 
>>> Virtuoso Blog: 
>>> https://medium.com/virtuoso-blog
>>> 
>>> Data Access Drivers Blog: 
>>> https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers
>>> 
>>> 
>>> Personal Weblogs (Blogs):
>>> Medium Blog: 
>>> https://medium.com/@kidehen
>>> 
>>> Legacy Blogs: 
>>> http://www.openlinksw.com/blog/~kidehen/
>>> 
>>> 
>>> http://kidehen.blogspot.com
>>> 
>>> 
>>> Profile Pages:
>>> Pinterest: 
>>> https://www.pinterest.com/kidehen/
>>> 
>>> Quora: 
>>> https://www.quora.com/profile/Kingsley-Uyi-Idehen
>>> 
>>> Twitter: 
>>> https://twitter.com/kidehen
>>> 
>>> Google+: 
>>> https://plus.google.com/+KingsleyIdehen/about
>>> 
>>> LinkedIn: 
>>> http://www.linkedin.com/in/kidehen
>>> 
>>> 
>>> Web Identities (WebID):
>>> Personal: 
>>> http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
>>> 
>>>        : 
>>> http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this
>>> 
>>> 
>>> 
> 
> -- 
> Regards,
> 
> Kingsley Idehen       
> Founder & CEO 
> OpenLink Software   
> Home Page: http://www.openlinksw.com
> Community Support: https://community.openlinksw.com
> Weblogs (Blogs):
> Company Blog: https://medium.com/openlink-software-blog
> Virtuoso Blog: https://medium.com/virtuoso-blog
> Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers
> 
> Personal Weblogs (Blogs):
> Medium Blog: https://medium.com/@kidehen
> Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/
>              http://kidehen.blogspot.com
> 
> Profile Pages:
> Pinterest: https://www.pinterest.com/kidehen/
> Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen
> Twitter: https://twitter.com/kidehen
> Google+: https://plus.google.com/+KingsleyIdehen/about
> LinkedIn: http://www.linkedin.com/in/kidehen
> 
> Web Identities (WebID):
> Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
>        : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this
> 
> 

-- 
Hugh
023 8061 5652

Received on Thursday, 19 September 2019 10:27:40 UTC