W3C home > Mailing lists > Public > public-lod@w3.org > March 2020

Re: COVID-19 Open Research Dataset

From: Pieter Colpaert <pieter.colpaert@ugent.be>
Date: Thu, 26 Mar 2020 11:48:47 +0100
To: Kingsley Idehen <kidehen@openlinksw.com>, public-lod@w3.org
Cc: "gilles.vandewiele@ugent.be" <gilles.vandewiele@ugent.be>
Message-ID: <b03c89c8-4a04-5bb3-b2ba-ca3a6989649f@ugent.be>
Hi all,

We have already made an effort to convert the dataset to RDF (cfr mail 

At IDLab (Ghent University - IDLab), we have created a Knowledge Graph 
based on the 40000 scholarly articles described in a public dataset 
available on Kaggle. The Knowledge Graph contains all of the information 
that is present in the CSV & JSON: author information, content 
information and meta information. Moreover, it contains relations 
between the different entities, such as citations between papers and so 
on and extra information on different entities such as journals, 
research institutions and countries.

A Comunica instance to perform queries can be found here: 
The code that was used to map the structured data to RDF can be found 
here: http://www.github.com/GillesVandewiele/COVID-KG/
The Knowledge Graph is available on Kaggle: 
A notebook on how to work with RDF data in Python: 
A notebook on working with embeddings: 

This could perhaps serve as a starting point for this work? If you 
require any additional information or wish to contribute, do not 
hesitate to get back in touch with Gilles Vandewiele (in CC) who is 
leading this effort!

Kind regards,


On 17/03/2020 01.55, Kingsley Idehen wrote:
> All,
> COVID-19 Open Research Dataset (CORD-19) has been opened up general
> access. Naturally, this would be a great data source for Linked Data
> transformation  and publication etc..
> [1] https://pages.semanticscholar.org/coronavirus-research
Received on Thursday, 26 March 2020 10:49:04 UTC

This archive was generated by hypermail 2.4.0 : Thursday, 26 March 2020 10:49:05 UTC