W3C home > Mailing lists > Public > public-lod@w3.org > March 2020

Re: COVID19 Cross-Domain Linked Dataset

From: Oscar Corcho <ocorcho@fi.upm.es>
Date: Fri, 27 Mar 2020 18:38:44 +0100
Cc: "public-lod@w3.org" <public-lod@w3.org>, "semantic-web@w3.org" <semantic-web@w3.org>, Kingsley Idehen <kidehen@openlinksw.com>
Message-Id: <36A9C8DE-4758-47E6-A2AE-496293C9840F@fi.upm.es>
To: janowicz@ucsb.edu
Thanks Kryzstof,

If anyone needs also a SOLR index of all the papers from the CORD-19/Kaggle dataset to build things on top of and prefers not to setup its own one, we have made this one available in one of our servers. If you are planning to make use of it, let us know, so that we can be alert in case that the service goes down (the amount of cyberattacks to our servers in Madrid has been growing constantly  during this last week, which goes on top of the set of humanitarian and health problems that we have here in Spain).

https://librairy.linkeddata.es/data/covid/select?q=*:* <https://librairy.linkeddata.es/data/covid/select?q=*:*>

This index contains not only the raw data available in CORD-19/Kaggle, but also the results of annotating the papers with ATC codes (drugs and their combinations that have been reported in the papers) and diagnoses and tests that have been obtained with CliNER. We will be adding more types of annotations.

We have also generated a probabilistic topic model with this corpus based on the ATC codes, in case that anyone wants to exploit it as well: http://librairy.linkeddata.es/covid19-model/ <http://librairy.linkeddata.es/covid19-model/>

We will also be providing RDF versions of some of this data that can be joined with the data made available by the Ghent folks and by you, if you wish.

Oscar and the rest of the team (Carlos Badenes, Pablo Calleja, Mariano Rico)

--
Oscar Corcho
Ontology Engineering Group
Departamento de Inteligencia Artificial
ETSI Informáticos
Universidad Politécnica de Madrid
Campus de Montegancedo, sn
28660 Boadilla del Monte, Madrid
Tel: +34 910672911




> El 27 mar 2020, a las 18:19, Krzysztof Janowicz <janowicz@ucsb.edu> escribió:
> 
> Dear all,
> 
> This is an update on our open knowledge graph about COVID-19.
> 
> In our knowledge graph, we have:
>   (1). COVID19 cases for countries and regions (data from Johns Hopkins University)
>   (2). Airlines that are suspended due to COVID19 (data are partially donated from Aviation Edge and partially from manual collection)
>   (3). Quarantine policies across the world (manually collected)
>   (4). Relief aids from NGO (manually collected)
>   (5). Regions on multiple levels
>   (6). Local event data such as cancellations (so far by example only)
>   (7). Supply chain disruptions (so far by example only)
> 
> We are  updating our KG by adding more resources. Examples include:
>  (1). Supply chains from wikidata
>  (2). Virus strain data (we are collaborating with UCSD on this part right now)
>  (3). Research articles from Kaggle (Thanks to Pieter, we will integrate your KG to ours)
> 
> This KG can be visualized at:
> http://covid.geog.ucsb.edu:7200/graphs-visualizations?config=48691118befc47e4a6a6aabde78cd73d
> 
> Example query: http://covid.geog.ucsb.edu:7200/sparql?savedQueryName=Max%20Confirmed%20Cases&owner=admin&execute
> 
> Query endpoint: http://covid.geog.ucsb.edu:7200/sparql and http://covid.geog.ucsb.edu:7200/repositories/COVID-19
> 
> Raw RDF: use the export at http://covid.geog.ucsb.edu:7200/graphs
> 
> Please keep in mind that we are constantly changing the data and even the underlying ontology; things will break at times.
> 
> We hope this can be another initiative for our community to contribute to such a global challenge. Any suggestions and/or interests of collaborations are *most* welcome.
> 
> Krzysztof
> 
> -- 
> Krzysztof Janowicz
> 
> Geography Department, University of California, Santa Barbara
> 4830 Ellison Hall, Santa Barbara, CA 93106-4060
> 
> Email: jano@geog.ucsb.edu
> Webpage: http://geog.ucsb.edu/~jano/
> Semantic Web Journal: http://www.semantic-web-journal.net
> 
> 


Received on Friday, 27 March 2020 17:39:14 UTC

This archive was generated by hypermail 2.4.0 : Friday, 27 March 2020 17:39:14 UTC