- From: John P. McCrae <john.mccrae@insight-centre.org>
- Date: Tue, 6 Nov 2018 15:36:54 +0000
- To: Christian Chiarcos <chiarcos@informatik.uni-frankfurt.de>
- Cc: "public-ontolex@w3.org" <public-ontolex@w3.org>, Hinkelmanns Peter <peter.hinkelmanns@sbg.ac.at>, "christian.chiarcos@web.de" <christian.chiarcos@web.de>
- Message-ID: <CAHLDFnpw+jv-7-TpwG3drxEkK7o-yJn7xOgVLhswMtLObOQZUQ@mail.gmail.com>
Hi Peter, Yes, to summarize other authors, it is not expected that a sense should have multiple references, unless they are semantically equivalent (e.g., skos:exactMatch). This seems relatively straightforward if you think about... if you make a distinction in your SKOS thesaurus, why wouldn't the same distinction be necessary in the lexicon? Similarly, different parts-of-speech necessarily have different meanings, so again they should have distinct lexical entries and senses. The new lexicography module <https://www.w3.org/community/ontolex/wiki/Lexicography> will provide more modelling for group senses in order to represent the structure of existing dictionaries and may better suit your use case. Regards, John On 6 November 2018 at 07:17, Christian Chiarcos < chiarcos@informatik.uni-frankfurt.de> wrote: > Dear Peter, > > great to see the MHDBDB being converted to RDF. For your use case, I would > actually not model the conceptual system itself in lemon, but rather use > SKOS and provide only lexicalization information in lemon. In other words, > your concept hierarchy is the ONtology element in lemON and ONtoLex, and > this is actually the original use case of lemon. > > The relation for linking with an ontology is ontolex:reference. The spec > requires that "The lexical sense has a single lexical entry and a single > reference in the ontology." (As in other cases, I think this is too > restrictive.) So, your problem remains the same. > > The "proper" solution would be to create "lexicalized concepts" in your > thesaurus (which can have skos:broader relations to more than one skos > concept), and to connect these with your lexical entries via > ontolex:reference. > > A shortcut, if you will, would be to use blank nodes as part of the > ontolex linking: > > mhdbdbLex:sense4709 ontolex:reference [ skos:broader > mhdbdbThes:concept23250000, mhdbdbThes:concept23309000, > mhdbdbThes:concept23309020 ] . > > Technically, this would be valid, but it is a bit borderline, because the > blank node is not strictly speaking *in* the ontology/thesaurus. (It is > certainly not worse than not having an ontology, though, as in most > lexicography use cases.) > > From a modelling perspective, I would prefer the second solution despite > these difficulties, as otherwise, the thesaurus grows infinitely with every > novel word being added. Blank nodes created as part of the linking are easy > to filter out if you need to work with the categories alone. Also, imagine > a use case where you have different lexicalizations/lemon models to be > linked to a single ontology, and those lemon models may require fusing > concepts in a different way. Using blank nodes as part of the linking makes > sure that lexicalization- (lexicon-) specific concept fusion can be clearly > separated from the (language-independent) concept hierarchy. > > We have similar difficulties (and no better solution) for multiple parts > of speech for the same lexical entry, etc. > > Best, > Christian > > Am .11.2018, 12:16 Uhr, schrieb Hinkelmanns Peter < > peter.hinkelmanns@sbg.ac.at>: > > Dear Ontolex-List, > > > > I’m writing as a team member of the „Middle High German Conceptual > Database“ (MHDBDB, http://www.mhdbdb.sbg.ac.at), an onomasiological > dictionary of middle high German. We are planning to convert our 40.000+ > dictionary articles to Ontolex. An *article* consists of one or many > *senses* which refer to one or many * categories* of our conceptual > system: > > > > > > In the example above you can see that the entry ‘hûs’ has two senses which > each point to different categories in our conceptual system. The conceptual > system has already been converted to SKOS. My attempt to express the entry > ‘hûs’ with ontolex looks like this: > > > > <rdf….> > > <ontolex:Word rdf:about="http://www.mhdbdb.sbg.ac.at/entry#2933"> > > <ontolex:lexicalForm> > > <ontolex:Form> > > <ontolex:writtenRep>hûs</ontolex:writtenRep> > > </ontolex:Form> > > </ontolex:lexicalForm> > > <lexinfo:partOfSpeech rdf:resource="http://www. > mhdbdb.sbg.ac.at/grammar#NOM"/> > > <ontolex:sense rdf:resource="http://www.mhdbdb.sbg.ac.at/sense#4709"/> > > <ontolex:sense rdf:resource="http://www.mhdbdb.sbg.ac.at/sense#28403"/ > > > > </ontolex:Word> > > <ontolex:LexicalSense rdf:about="http://www.mhdbdb.sbg.ac.at/sense#4709 > "> > > <ontolex:isSenseOf rdf:resource="http://www. > mhdbdb.sbg.ac.at/entry#2933"/> > > <ontolex:isLexicalizedSenseOf rdf:resource="http://www. > mhdbdb.sbg.ac.at/concept#23250000"/> > > <ontolex:isLexicalizedSenseOf rdf:resource="http://www. > mhdbdb.sbg.ac.at/concept#23309000"/> > > <ontolex:isLexicalizedSenseOf rdf:resource="http://www. > mhdbdb.sbg.ac.at/concept#23309020"/> > > </ontolex:LexicalSense> > > <ontolex:LexicalSense rdf:about="http://www.mhdbdb.sbg.ac.at/sense#28403 > "> > > <ontolex:isSenseOf rdf:resource="http://www. > mhdbdb.sbg.ac.at/entry#2933"/> > > <ontolex:isLexicalizedSenseOf rdf:resource="http://www. > mhdbdb.sbg.ac.at/concept#23209000"/> > > <ontolex:isLexicalizedSenseOf rdf:resource="http://www. > mhdbdb.sbg.ac.at/concept#23309020"/> > > </ontolex:LexicalSense> > > </rdf> > > > > As you can see above, I’m doing something probably not intended by > Ontolex: A lexical sense consists of multiple *isLexicalizedSenseOf* > references, mirroring the MHDBDB datastructure. > > > > How could our article structure be transferred to Ontolex correctly > according to the scheme? > > > > Many thanks in advance and best regards > > Peter > > > > *Peter Hinkelmanns MA* > > Wissenschaftlicher Mitarbeiter > > Mittelhochdeutsche Begriffsdatenbank (MHDBDB) | IZMF > > Universität Salzburg | Erzabt-Klotz-Straße 1, A-5020 Salzburg | > > Tel. +43 662 8044 4339 | e-mail: peter.hinkelmanns@sbg.ac.at > > Internet: CV und Fodok > <http://mhdbdb.sbg.ac.at:8300/wordpress/hinkelmanns/> | mhdbdb.sbg.ac.at > | facebook.com/mhdbdb <https://www.facebook.com/mhdbdb> | twitter. > com/MHD_BDB > > > > *Please consider the environment before printing this email* > > > > > > > -- > Prof. Dr. Christian Chiarcos > Applied Computational Linguistics > Johann Wolfgang Goethe Universität Frankfurt a. M. > 60054 Frankfurt am Main, Germany > > office: Robert-Mayer-Str. 10, #401b > <https://maps.google.com/?q=Robert-Mayer-Str.+10,+%23401b&entry=gmail&source=g> > mail: chiarcos@informatik.uni-frankfurt.de > web: http://acoli.cs.uni-frankfurt.de > tel: +49-(0)69-798-22463 > fax: +49-(0)69-798-28931 >
Attachments
- image/jpeg attachment: 01-part
Received on Tuesday, 6 November 2018 15:37:20 UTC