W3C home > Mailing lists > Public > public-lod@w3.org > March 2020

Re: COVID-19 Open Research Dataset

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Thu, 26 Mar 2020 11:23:26 -0400
To: public-lod@w3.org
Message-ID: <6f634d29-7adf-264a-af75-b0e034e814dd@openlinksw.com>
On 3/26/20 6:48 AM, Pieter Colpaert wrote:
> Hi all,
> We have already made an effort to convert the dataset to RDF (cfr mail
> below):
> At IDLab (Ghent University - IDLab), we have created a Knowledge Graph
> based on the 40000 scholarly articles described in a public dataset
> available on Kaggle. The Knowledge Graph contains all of the
> information that is present in the CSV & JSON: author information,
> content information and meta information. Moreover, it contains
> relations between the different entities, such as citations between
> papers and so on and extra information on different entities such as
> journals, research institutions and countries.
> A Comunica instance to perform queries can be found here:
> https://query-covid19.linkeddatafragments.org/
> The code that was used to map the structured data to RDF can be found
> here: http://www.github.com/GillesVandewiele/COVID-KG/
> The Knowledge Graph is available on Kaggle:
> https://www.kaggle.com/group16/covid19-literature-knowledge-graph
> A notebook on how to work with RDF data in Python:
> https://www.kaggle.com/group16/covid-19-knowledge-graph-starter
> A notebook on working with embeddings:
> https://www.kaggle.com/group16/covid-19-knowledge-graph-embeddings
> This could perhaps serve as a starting point for this work? If you
> require any additional information or wish to contribute, do not
> hesitate to get back in touch with Gilles Vandewiele (in CC) who is
> leading this effort!
> Kind regards,
> Pieter

Hi Pieter,


We've also loaded the dataset to our LOD Cloud Cache instance [1].


[1] https://twitter.com/kidehen/status/1243179530979860481 -- Twitter Thread

[2] https://tinyurl.com/vupbot8 -- simple exploration index page
generated from SPARQL query using SAMPLE()


> On 17/03/2020 01.55, Kingsley Idehen wrote:
>> All,
>> COVID-19 Open Research Dataset (CORD-19) has been opened up general
>> access. Naturally, this would be a great data source for Linked Data
>> transformation  and publication etc..
>> [1] https://pages.semanticscholar.org/coronavirus-research


Kingsley Idehen	      
Founder & CEO 
OpenLink Software   
Home Page: http://www.openlinksw.com
Community Support: https://community.openlinksw.com
Weblogs (Blogs):
Company Blog: https://medium.com/openlink-software-blog
Virtuoso Blog: https://medium.com/virtuoso-blog
Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers

Personal Weblogs (Blogs):
Medium Blog: https://medium.com/@kidehen
Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/

Profile Pages:
Pinterest: https://www.pinterest.com/kidehen/
Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen
Twitter: https://twitter.com/kidehen
Google+: https://plus.google.com/+KingsleyIdehen/about
LinkedIn: http://www.linkedin.com/in/kidehen

Web Identities (WebID):
Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i
        : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this

Received on Thursday, 26 March 2020 15:23:46 UTC

This archive was generated by hypermail 2.4.0 : Thursday, 26 March 2020 15:23:47 UTC