Call for Participation: 4th Summer Datathon on Linguistic Linked Open Data (SD-LLOD’22), May 29 - Jun 3, Madrid

Dear colleagues,

(with apologies for cross-posting)

The 4th Summer Datathon on Linguistic Linked Open Data (SD-LLOD-21) will be
held physically from May 29th to June 3rd 2022 at Residencia Lucas Olazábal
of Universidad Politécnica de Madrid, Cercedilla, Madrid. See

The SD-LLOD datathon has the main goal of giving people from industry and
academia practical  knowledge in the application of Linked Open Data
technology to Linguistics. The final aim is to allow participants to
migrate their own (or other’s) linguistic data and publish them as Linked
Data on the Web and/or develop applications on top of Linguistic Linked
Data. This datathon series is unique in its topic worldwide and continues
from the success of the previous editions in 2015 and 2017 in Cercedilla
(Spain), and in 2019 in Dagstuhl (Germany). This edition is supported by
COST (European Cooperation in Science and Technology) through NexusLinguarum,
the “European network for Web-centred linguistic data science” COST Action

During the datathon, participants will:

* Generate and publish their own Linguistic Linked Data from some existing
data sources.

* Apply Linked Data principles and semantic technologies (knowledge graphs,
RDF, SPARQL) to the field of language resources.

* Use the principal models used for representing Linguistic Linked Data, in
particular OntoLex lemon.

* Learn about Linked Data-based NLP workflows and applications.

* Learn about potential benefits and applications of Linguistic Linked Data
for specific use cases.

The program of the summer datathon will contain three types of sessions:

1. Seminars to show novel aspects and discuss selected topics.

2. Hands-on sessions to introduce the basic foundations of each topic,
methods, and technologies and where participants will perform different
tasks using the methods and technologies presented.

3. Datathon sessions, where participants will work, in groups of 4-5, on
miniprojects and where they will apply what they have learned, involving
the generation and/or use of Linguistic Linked Data.

Participants are invited to propose a “miniproject” related to the topics
of the datathon, which might include some datasets for their conversion
into linked data. In this edition, we particularly encourage miniprojects
that involve under-resourced languages. A selection of proposals will form
the basis for the miniprojects which the participants will work on during
the datathon sessions. Participants who do not propose a miniproject, or
whose miniproject is not selected, will be able to join another
miniproject. There will be an award for the best miniproject.



The datathon is a sponsored event, and it has no registration fee, but
participants are expected to cover the cost of their meals and
accommodation at the Residence. Details about the registration can be found
at the datathon website:

As part of the registration process, applicants are invited to submit a
short abstract of their ideas for the datathon (miniproject proposal, e.g.,
description of possible resources to be converted, linked or reused during
the datathon, ideas for use cases, etc.).

Registration will close on 18/04/2022. A minimum of fifteen travelling
grants will be provided by NexusLinguarum (covering accommodation, meals
and travel expenses, more details will appear in

COVID statement


The datathon is planned as a physical event. The local organisation is
committed to guaranteeing a safe event. Note that there might be some COVID
rules to comply with at the time of celebration of the event. These will be
announced in due course.

Important dates (tentative)


Registration opens: 14/02/2022

Registration closes:  18/04/2022

Notification: 22/04/2022

Payment closes: 13/05/2022

Datathon: 29/05/2022 to 03/06/2022



Jorge Gracia (University of Zaragoza, Spain)

Patricia Martín-Chozas (Universidad Politécnica de Madrid, Spain)

Anas Fahad Khan (Institute for Computational Linguistics «A.
Zampolli»/CLARIN-IT, Italy)

Christian Chiarcos (Goethe Universität Frankfurt, Germany)

Local organiser


Elena Montiel-Ponsoda (Universidad Politécnica de Madrid, Spain)

Confirmed tutors and lecturers


Sara Carvalho (University of Aveiro, Portugal)

Christian Chiarcos (Goethe Universität Frankfurt, Germany)

Thierry Declerck (DFKI, Germany)

Milan Dojchinovski (CTU in Prague, Czech Republic / DBpedia Association,
Manuel Fiorelli (University of Rome Tor Vergata, Italy)

Jorge Gracia (University of Zaragoza, Spain)

Dagmar Gromann (University of Vienna, Austria)

Max Ionov (Goethe Universität Frankfurt, Germany)

Anas Fahad Khan (Institute for Computational Linguistics «A.
Zampolli»/CLARIN-IT, Italy)

Francesco Mambrini (Università Cattolica del Sacro Cuore, Italy)

Patricia Martín-Chozas (Universidad Politécnica de Madrid, Spain)

Cécile Robin (NUI Galway, Ireland)

Gilles Sérasset (Université Grenoble Alpes, France)

Bernardo Stearns (NUI Galway, Ireland)

Armando Stellato (University of Rome Tor Vergata, Italy)

Manuel Vimercati (University of Milano-Bicocca, Italy)

Received on Tuesday, 22 February 2022 14:37:00 UTC