W3C home > Mailing lists > Public > public-egov-ig@w3.org > September 2011

Re: Multi-lingual labels for org ontology

From: Elena Montiel Ponsoda <elemontiel@gmail.com>
Date: Sat, 03 Sep 2011 10:48:21 +0200
Message-ID: <4E61E9D5.2020304@gmail.com>
To: Hugh Glaser <hg@ecs.soton.ac.uk>
CC: Bernadette Hyland <bhyland@3roundstones.com>, Asunción Gómez Pérez <asun@fi.upm.es>, Elena Montiel <emontiel@fi.upm.es>, Bart van Leeuwen <Bart_van_Leeuwen@netage.nl>, W3C public GLD WG WG <public-gld-wg@w3.org>, W3C eGov IC mailing list <public-egov-ig@w3.org>
Hi Hugh,

Now I think I understand what you mean...

I would complete the sentences below as follows:

"We publish ontology O with Spanish labels/descriptions in natural language".
"We have translated the German labels/associated descriptions in natural language of ontology O into English and Swedish."
"Ontology O only has partialially been described in Korean - can someone complete it for us please?"
or

"Ontology O only has some labels in Korean - can someone complete it for us please?"


In our research work we started by directly translating the labels that 
were associated to ontology elements (classes, datatype and object 
properties) in the form of rdfs:labels or as local names in URIs. That 
is why we mostly talked about "label translation" or "the translation of 
ontology labels".

Later on we started working on more complex and powerful models of 
linguistic description that kept the linguistic or lexical layer 
independent from the ontology, or to put it in other words, that 
extracted the natural language descriptions from the ontology (in the 
form of rdfs:labels or local names in URIs) and stored them in external 
models, and that allowed us to add many more linguistic and 
terminological descriptions to those labels. Some of this models are: 
LIR [1], LexInfo [2]...

Currently, in the context of the Monnet project, in which our main 
objective is the localization of ontologies, we have developed "/lemon/: 
lexicon model for ontologies" [3] (http://lexinfo.net/). The idea is 
that we get an ontology as input in which descriptions associated to 
ontology elements are expresed in the form of URI's local names or as 
rdfs:labels or as skos:pref lables, etc.. We extract that information 
and store it in our lemon model. Then, we add further linguistic 
descriptions to those "labels or descriptions": linguistic properties 
(part-of-speech, gender, number...), lexical and terminological 
variation, decompositions of phrase structures (representation of 
multiword expressions), syntactic frames and their mappings to the 
logical predicates in the ontology, and morphological decomposition of 
lexical forms. This is what we call "ontology lexicalization".

Afther that, what we translate are the so called LexicalEntries in our 
lemon model from English to German, for instance. Thanks to the 
linguistic descriptions associated to those lexical entries, and the 
contextual and semantic information provided by the ontology element (to 
which the LexicalEntry is referring or pointing to), we are capable of 
providing more adequate translations of that lexical layer.

For all these reasons I may be in favor of using a more general term 
such as "descriptions", since what we are translating are not always 
rdfs:labels per se, but LexicalEntries or Lexicalizations, etc., 
depending on the linguistic model we are using.

I hope this helps!!
Elena Montiel

[1] Montiel-Ponsoda, Elena, Guadalupe Aguado de Cea, Asunción 
Gómez-Pérez, and Wim Peters. (2010). Enriching Ontologies with 
Multilingual Information. Journal of Natural Language Engineering, 2010, 
DOI:10.1017/S1351324910000082

[2] Cimiano, Philip, Paul Buitelaar, John McCrae, and Michael Sintek. 
(2010). LexInfo: A Declarative Model for the Lexicon-Ontology Interface. 
Journal of Web Semantics, 2010 .

[3] McCrae, John, Guadalupe Aguado de Cea, Paul Buitelaar, Philipp 
Cimiano, Thierry Declerck, Asunción Gómez- Pérez, Jorge Gracia, Laura 
Hollink, Elena Montiel-Ponsoda, Dennis Spohr, and Tobias Wunner. (2011). 
Interchanging Lexical Resources in the Semantic Web. Language Resources 
and Evaluation (in press).



El 02/09/2011 21:30, Hugh Glaser escribió:
> Thank you all.
> It is great to be updated on all the important activities.
> We need lots of great examples of how the use of URIs in RDF enables multilingualism in powerful ways (as we  - at least I - often claim).
> And these efforts are doing it.
>
> However, I'm afraid I still don't have an answer to my question.
> Naming things is surprisingly important, and I think having a name for the set of labels etc in a specific language really concentrates the mind.
> Perhaps it is as simple as a "una localización", "un localisation" or whatever.
> Or maybe it is something with less of a geographic overtone, such as a "binding"?
>
> What I want to be able to say is thing like:
> "We publish ontology O with a Spanish xxx".
> "We have translated the German xxx of ontology O into English and Swedish."
> "Ontology O only has a partial Korean xxx - can someone complete it for us please?"
>
> Cheers
> Hugh
>
> On 1 Sep 2011, at 19:48, Bernadette Hyland wrote:
>
>> Prof. Asunción, Elena, all,
>> Thanks for your detailed replies to our request for guidance on multilingual semantic web activities and process.  Obviously there is a lot of interest&  activity (not surprisingly!)
>>
>> I looked into the process on 'how do we submit findings or recommendations (intentional use of lower case "r", not a W3C Recommendation) to a working group that is tasked with producing Recommendations' which was raised by Bart van Leeuwen in today's GLD WG telecon.
>>
>> The Member Submission Process is detailed here [1].  Bart, I encourage you to review it and forward this information to your Dutch colleague.  Presumably it has to be translated into a language supported by the W3C such that the working group members can consider it.
>>
>> I further investigated and found the Internationalization activities of the W3C are detailed here [2], articles, best practices&  tutorials here [3].
>>
>> Thanks again for your pointers&  background Asun and Elena.
>>
>> Cheers,
>>
>> Bernadette Hyland
>> co-chair W3C Government Linked Data Working Group
>> Charter: http://www.w3.org/2011/gld/
>>
>> [1] http://www.w3.org/2005/10/Process-20051014/submission#Submission
>> [2] http://www.w3.org/International/
>> [3] http://www.w3.org/International/articlelist
>>
>>
>>
>> On Sep 1, 2011, at 2:03 PM, Elena Montiel Ponsoda wrote:
>>
>>> Dear Bernardette, all,
>>>
>>> If you are interested in these issues, I would like to point you to some initiatives and research works we are currently involved in.
>>>
>>> This year we organize the second edition of the Multilingual Semantic Web Workshop (http://msw2.deri.ie/) at the ISWC in Bonn, Germany.
>>> We have received many interesting submissions and hope to have a successful workshop! :)
>>>
>>> With our colleagues of the Monnet project we just launched the OntoLex W3C Community Group to deal with the interface between linguistic information and ontologies (http://www.w3.org/community/ontolex/). The representation of multilingual information in ontologies and linked data, as well as the linking of data sets in different natural languages are key issues that we would like to discuss in the framework of this Group. We would be happy if you would join.
>>>
>>> Finally, let me point you to a paper we will be presenting at the DC-2011 (Dublin Core Metadata Initiative Conference) entitled Style Guidelines for Naming and Labeling Ontologies in the Multilingual Web (http://dcevents.dublincore.org/index.php/IntConf/index/pages/view/program-2011).
>>> There we discuss the pros and cons of using URIs local names vs labels to describe resources, and propose some guidelines for the Multilingual Web based on our experiences in the translation of the FOAF ontology and the IFLA ontologies into Spanish.
>>> Please, let me know if you would like to have more information on any of these issues.
>>> Best Regards,
>>> Elena Montiel
>>>
>>>
>>> El 01/09/2011 19:23, Asunción Gómez Pérez escribió:
>>>> Dear Bernadette, all,
>>>>
>>>> The need of deliverying content to users in their own language is something that we realised one year ago,  when we started to present the linkeddata ideas to some spanish companies and agencies.
>>>>
>>>> During the last year, we built several applications in different domains and we faced the problem of reusing  ontology terms  that are in other languages and the generation of RDF  data in spanish (including patterns for generating URIs). As example of  the applications, we have:
>>>> 	• GeoLinkedData is in the geographical domain. We have the ontologies in Spanish and English, and data are in Spanish.   Data comes  from the Spanish Geographical Institute.  Information about the ontologies and the application can be seen at   http://geo.linkeddata.es.
>>>> 	• In the domain of Libraries, we already migrated resources from the Spanish National Library related with Miguel de Cervantes' book. In that case, we provided the spanish translation to the  IFLA ontologies. Data are in spanish and are linked with VIAF. At this moment we are migrating the whole Spanish library dataset. You can see a prototype at  http://cultura.linkeddata.es/visualizer/  and  http://cultura.linkeddata.es/sparql.
>>>> 	• In the domain of metereological information, we did a first prototype migrating sensor data provided by the Spanish Metereological Agency named AEMET. In that case, the multilingual aspects are less relevant than in the previous cases because data are numerical values coming from sensor networks. A first prototype appear at  http://aemet.linkeddata.es.
>>>> Several months ago we started to work on a paper about multilingualism in the web of Data. We just got the confirmation today that the paper is accepted to be published at the Journal of Web Semantics. In that paper we  present this vision of a multilingual Web of Data. We discuss challenges that need to be addressed to make this vision come true and discuss the role that techniques such as ontology localization, ontology mapping, and cross-lingual ontology-based information access and presentation will play in achieving this. Further, we propose an initial architecture and describe a roadmap that can provide a basis for the implementation of this vision.
>>>>
>>>> Finally, regarding Ontology Localization, We are participating in an EU  project named MONNET ( lead by  DERI). If you go to http://www.monnet-project.eu/Monnet/ you can find  deliverables and scientific papers on that topic.
>>>>
>>>> Kind regards
>>>> Asun
>>>>
>>>>
>>>>
>>>> El 01/09/2011 18:19, Bernadette Hyland escribió:
>>>>> Hi Asun,
>>>>> Thank you.  Perfect timing.  The issue of localization was just discussed on today's GLD WG call.  Bart van Leeuwen raised the topic with respect to vocabulary / Web Metadata Standards published by an LOD practioner.  Bart was asking how to leverage what his colleague has done, if useful to this WG given that it is prepared in Dutch.[1]
>>>>>
>>>>> We know that activities are happening within W3C on Internationalization + LOD but didn't know immediately who to reach out to for guidance.  Your experience may be relevant and I hope that we can discuss this when you next join a GLD WG telecon.
>>>>>
>>>>> FYI, during today's call, there was a brief general discussion ensued around bringing people together and they'll figure out a solution was mentioned by Michael Hausenblas re: work happening in Ireland and in Irish, etc.  Point is, SMEs and developers want/need content in their own language and how can we help with a framework to promote this important effort.
>>>>>
>>>>> [1] http://standaarden.overheid.nl/owms/4.0/doc/
>>>>>
>>>>> Cheers,
>>>>>
>>>>> Bernadette Hyland
>>>>> co-chair W3C Government Linked Data Working Group
>>>>> Charter: http://www.w3.org/2011/gld/
>>>>>
>>>>> On Sep 1, 2011, at 11:54 AM, Asunción Gómez Pérez wrote:
>>>>>
>>>>>> We call it Ontology Localization.  Just in case you are interested on the topoic, there are several papers published about the topic:
>>>>>> 	• M. Espinoza, A. Gómez-Pérez, and E. Mena. Enriching an ontology with multilingual information. In Proceedings of the European Semantic Web Conference (ESWC 2008), pages 333–347, 2008.
>>>>>> 	• M. Espinoza, A. Gómez-Pérez, and E. Montiel-Ponsoda. Multilingual and localization support for ontologies. In Proceedings of the European Semantic Web Conference (ESWC 2009), pages 821–825, 2009.
>>>>>> 	• M. Espinoza, E. Montiel-Ponsoda, and A. Gómez-Pérez. Ontology localization. In Proceedings of the 5th International. Conference on Knowledge Capture (KCAP), pages 33–40, 2009.
>>>>>> 	• P. Cimiano, E. Montiel-Ponsoda, P. Buitelaar, M. Espinoza, A. Gómez-Pérez. A Note on Ontology Localization - Journal of Applied Ontology 5(2), 2010.
>>>>>> Best
>>>>>>
>>>>>> Asun
>>>>>>
>>>>>>
>>>>>> El 01/09/2011 17:39, Hugh Glaser escribió:
>>>>>>> Nice.
>>>>>>> So should these be called translations?
>>>>>>> They are French labels etc. for the resources.
>>>>>>> I guess they were probably/possibly arrived at by translating from English, but now it has happened, what is the right name for it all?
>>>>>>> Or should we talk about the English translations?
>>>>>>> Best
>>>>>>>
>>>>>>> Hugh
>>>>>>>
>>>>>>> On 1 Sep 2011, at 14:35, "Dave Reynolds"
>>>>>>> <dave.e.reynolds@gmail.com>
>>>>>>>   wrote:
>>>>>>>
>>>>>>>
>>>>>>>> Thanks to Dominique Guardiola the org ontology [1][2] now has French
>>>>>>>> translations for the label/comment/title strings.
>>>>>>>>
>>>>>>>> It's good to see multi-lingual support in semantic web ontologies and
>>>>>>>> I'm very grateful to Dominique for volunteering to do this translation.
>>>>>>>>
>>>>>>>> Dave
>>>>>>>>
>>>>>>>> [1]
>>>>>>>> http://www.w3.org/ns/org#
>>>>>>>>
>>>>>>>> [2]
>>>>>>>> http://www.epimorphics.com/public/vocabulary/org.html
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>> -- 
>>>>>> Prof. Asunción Gómez-Pérez
>>>>>> Director of the Ontology Engineering Group
>>>>>> Facultad de Informática
>>>>>> Universidad Politécnica de Madrid
>>>>>> Campus de Montegancedo, sn
>>>>>> Boadilla del Monte, 28660, Spain
>>>>>> Home page:
>>>>>> www.oeg-upm.net
>>>>>>
>>>>>> Email:
>>>>>> asun@fi.upm.es
>>>>>>
>>>>>> Phone: (34-91) 336-7417
>>>>>> Fax: (34-91) 352-4819
>>>>>>
>>>> -- 
>>>> Prof. Asunción Gómez-Pérez
>>>> Director of the Ontology Engineering Group
>>>> Facultad de Informática
>>>> Universidad Politécnica de Madrid
>>>> Campus de Montegancedo, sn
>>>> Boadilla del Monte, 28660, Spain
>>>> Home page:
>>>> www.oeg-upm.net
>>>>
>>>> Email:
>>>> asun@fi.upm.es
>>>>
>>>> Phone: (34-91) 336-7417
>>>> Fax: (34-91) 352-4819
>>>>
>>> -- 
>>> Elena Montiel-Ponsoda
>>> Ontology Engineering Group (OEG)
>>> Departamento de Inteligencia Artificial
>>> Facultad de Informática
>>> Campus de Montegancedo s/n
>>> Boadilla del Monte-28660 Madrid, España
>>>
>>> www.oeg-upm.net
>>>
>>> Tel. (+34) 91 336 36 70
>>> Fax  (+34) 91 352 48 19
>>>

-- 
Elena Montiel-Ponsoda
Ontology Engineering Group (OEG)
Departamento de Inteligencia Artificial
Facultad de Informática
Campus de Montegancedo s/n
Boadilla del Monte-28660 Madrid, España
www.oeg-upm.net
Tel. (+34) 91 336 36 70
Fax  (+34) 91 352 48 19
Received on Saturday, 3 September 2011 08:48:14 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Saturday, 3 September 2011 08:48:15 GMT