Multilingual Linked Open Data (MLOD) 2012
Data Post Proceedings

Special Issue of the Semantic Web Journal

http://www.semantic-web-journal.net/content/suggestions-special-issues

Workshop homepage: http://sabre2012.infai.org/mlode

Image of the Linguistic LOD Cloud: http://linguistics.okfn.org/resources/llod/

Scope of the call:

Researchers in NLP and Linguistics are currently discovering Semantic Web technologies and employing them to answer novel research questions. Through the use of linked data, there is the potential to solve many issues currently faced by the language resources community. In particular, there is significant evidence that RDF allows better data integration than existing formats,[1] in part through a rich ecosystem of tools provided by the Semantic Web, such as query[2] and federation.[3] In addition, the Semantic Web has already been used by several authors[4] [5] to define data categories and enable better resource interoperability. The utility of this method of publishing language resources has lead to the interest of a significant sub-community in linguistics.[6]

The focus of this special issue is language resources including language data such as written or spoken corpora and lexica, multimodal resources, grammars, terminology or domain specific databases and dictionaries, ontologies, multimedia databases, etc. [7] In addition, we require that these resources are published as linked data using appropriate technologies such as RDF and OWL. Furthermore, we will also accept submission of resources used to assist and augment language processing applications, even if the nature of the resource is not deeply entrenched in Linguistics, but only as long as the usefulness is well motivated (DBpedia redirects and disambiguation pages are one example[8] ). Overall, our vision is to boost a decentralized, collaborative, interlinked and interoperable Web of Data[9] in the area of multilingual language resources.

Topics of the call

  1. The current state of the Linguistic Linked Open Data (LLOD) Cloud and inclusion criteria
  2. Validation procedures for linguistic linked data sets
  3. Best practices of multilingual linked open data
  4. Legal issues relating to the licensing of linguistic linked data
  5. Semantic annotation
  6. Language resources, newly published as linked data
  7. Ontologies related to multilinguality and language technology.

Submission instructions

This special issue will especially welcome Full papers, Linked Dataset Descriptions and  Descriptions of Ontologies as specified here: http://www.semantic-web-journal.net/authors Please write us an email first before preparing submissions for other submission types.

Full papers

Full papers contain original research results. These submissions will be reviewed along the usual dimensions for research contributions which include originality, significance of the results, and quality of writing. Any submission that is an extension of previous work must clearly declare the reference and justify the scientific progress of the extension.

Linked data set and ontology descriptions

We invite submissions of data sets and a 4-6 page written description of the data.  This special issue follows the MLODE 2012 Workshop[10] in Leipzig and in addition to submissions by participants of the workshop we also strongly encourage novel submissions. Note that the workshop focuses on the quality and availability of the data, while this issue focuses on the data *and* the written article.

We require that:

  1. The data is clearly licensed, standardized licenses are recommended.  
  2. Submitted data must be available free for academic usage. Open licences are welcome.
  3. The data is openly accessible and follows web standards, especially RDF.

Technical instructions for contributing data sets:

Please send us the data in advance for validation. We have a strict “data first” policy. The data you send us will be evaluated and we will provide guidance and support, if necessary.  After the data review you will be allowed to submit your paper. Full details of requirements are available at http://wiki.okfn.org/Working_Groups/Linguistics/How_to_contribute

Deadlines

As a result of the MLODE workshop, we will provide an updated image of the LLOD cloud:

http://linguistics.okfn.org/resources/llod/

If you want to be included in the updated image, please submit your data as soon as possible to the MLODE workshop. Submission is possible here: http://code.google.com/p/mlode/

Preliminary (optional) data submission deadline: one week before MLODE (Sept 23rd)

Submission: 23 Nov

Notifications: 22 Feb

Resubmission: 22 Mar

2nd Notification: 26 Apr

Camera ready: 17 May

Guest editors

Sebastian Hellmann (AKSW, University of Leipzig)

Steven Moran (LMU Munich)

Martin Brümmer (AKSW, University of Leipzig)

John McCrae (CITEC, University of Bielefeld)


[1] C. Chiarcos, S. Nordhoff, and S. Hellmann, Linked Data in Linguistics, pages 161–179, Heidelberg, 2012. Springer, http://ldl2012.lod2.eu/program/proceedings

[2] E. Prud’Hommeaux and A. Seaborne. SPARQL query language for RDF. W3C recommendation, 2008.

[3] B. Quilitz and U. Leser. Querying distributed RDF data sources with SPARQL. In The Semantic Web: Research and Applications, pages 524–538. Springer, Berlin, Heidelberg, 2008.

[4] M. Windhouwer and S. E. Wright. Linking to linguistic data categories in ISOcat. In C. Chiarcos, S. Nordhoff, and S. Hellmann, editors, Linked Data in Linguistics, pages 99–107. Springer, Heidelberg, 2012.

[5] S. Farrar and D. T. Langendoen. An OWL-DL implementation of GOLD: An ontology for the Semantic Web. In A. Witt and D. Metzing, editors, Linguistic Modeling of Information and Markup Languages. Springer, Dordrecht, 2010.

[6] C. Chiarcos, S. Hellmann, et al. The Open Linguistics Working Group. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC-2012), Istanbul, Turkey, May 2012a.

[7] Definition loosely based on http://www.springer.com/education+%26+language/linguistics/journal/10579

[8] Pablo N. Mendes, Max Jakob and Christian Bizer. DBpedia for NLP: A Multilingual Cross-domain Knowledge Base. Proceedings of the International Conference on Language Resources and Evaluation, LREC 2012

[9] Sören Auer und Sebastian Hellmann. The Web of Data: Decentralized, collaborative, interlinked and interoperable In: LREC 2012, http://www.lrec-conf.org/proceedings/lrec2012/keynotes/LREC%202012.Keynote%20Speech%201.Soeren%20Auer.pdf

[10] http://sabre2012.infai.org/mlode