RE: Issues about the semantics of the ontology-lexicon interface [was: Re: Why not to shortcut the "sense" object] from Armando Stellato on 2012-10-15 (public-ontolex@w3.org from October 2012)

From: Armando Stellato <stellato@info.uniroma2.it>
Date: Mon, 15 Oct 2012 20:23:12 +0200
To: "'Aldo Gangemi'" <aldo.gangemi@cnr.it>, "'public-ontolex'" <public-ontolex@w3.org>
Cc: "'John McCrae'" <jmccrae@cit-ec.uni-bielefeld.de>, "'Guido Vetere'" <gvetere@it.ibm.com>, "'Philipp Cimiano'" <cimiano@cit-ec.uni-bielefeld.de>
Message-ID: <043601cdab02$24312fe0$6c938fa0$@uniroma2.it>
A reply on the other points discussed so far.

In bold (not bald J ), the main sentence about my preferences. The rest is
motivations and rationale behind them (in my PoV). The use of terms such as
Sense, representedBy etc.. is not definitive. I'm using the ones currently
most used, and then provide my candidate ones.

 

---Creating the sense class---

Concerning Philipp's summary, firstly I agree with the decision (?not yet
approved, it seems?) of creating the intermediate Sense class: it's
obviously needed, either for making room for lexical senses (definitely to
be distinguished from ontology entities), or to be able to talk about senses
(reifications of the meaning function).

 

+1 from me (though it seems we all agree). In terms of motivation, I've seen
referring much debate about whether they are useful or not to define
something not representable by an ontology model (e.g. the discussion about
"Meaning and Semiotics - Issues for Modelling" of this August, and the
question: "Is there any semantic aspect of a word sense that cannot be
represented in an ontological model?" ). I don't think this is the main
motivation we may want to address.

While the above is interesting, I think that a mere direct connection
(OntologyEntry <-> LexicalEntry) would not be inappropriate if we think the
ontology as the starting point. In this sense, we (in my humble opinion)
shouldn't care if there's "something about the meaning of a word that we
cannot represent with an ontology model" (I don't delve into the matter of
whether this is true or false) as from our mission, we are defining an
ontology-lexicon interface, and not a linguistic model per se. The ontology
(to be better "linguistically represented") is the cause for our work, and
not the effect. Thus if there's a conjecture about some "dark matter" in the
coverage of ontologies wrt some "word sense", it is out of scope.

I would instead add a practical/social motivation: if we are thinking about
the LOD, we would think about lot of RDFized lexical resources (e.g. I would
like to see all multilingual wordnets publicly released as LOD, but that's
not the case at the moment. and also there are many more resources that are
or will be released as LOD: Framenet, Verbnet, Senso Comune etc..). They
provide lot of common sense which is not committed to any particular
ontology. All of these resources expose, to some extent, some sort of
"semantic layer" (I use this vague name by purpose, as I want to include all
potential different theories and models, and just want to state that they
are not just collections of terms/labels). Now the point is: if we want to
reuse explicitly these resources, we need to address their semantic layer.

The entities in such a non-ontology-committed semantic layer (e.g. synsets
in wordnet) would be a potentially useful lingua franca in many
mapping/matching scenarios, because they are still not a (domain) ontology
resource (thus with very easy to express, non-committing, relations between
ontology and lexicon), though they have the power to be much less ambiguous
than pure language. Their high shareability would act as a informal glue
between ontologies, with high potentials for a new generation of matchers
which are trained to expect this "formally stated informal content".

On the other side, having the possibility to state connections, for
instance, between semantic roles of a frame and elements in an ontology
under a shared vocabulary, would allow the creation of computational objects
and greatly support their distribution as LOD resources themselves and their
reuse in heterogeneous applications, as these will based on: a shared
ontological resource, a shared linguistic resource (expressed in a standard
RDF vocabulary) and the links between them (again expressed in a shared
vocabulary).

I would like to see patterns like the one depicted in:

http://art.uniroma2.it/publications/docs/2008_SWAP2008_LinguisticWatermark3.
0.pdf (upper part of pag. 8)

and the idea of an abstract vocabulary of "extension points" where different
lexical resources can easily fit.

After all, we too have a use case, that is: LLD (lexical linked data). So,
by simple modus ponens, I would say that if the LLD use case is valid, then
we necessarily need that Sense element, and in case, we should find space
for connecting it to ontology entities!

 

Concerning the name, I vote for "sense", because sememes, acceptations, and
others, are either very technical for the layman, or even wrong, as Philipp
reminds us about the original notion of sememe. The only real alternative
would be "meaning", but I'd rather keep that term for the top-level class of
a meaning taxonomy, as I suggest in the following.

 

Naming of elements in the PATH

I don't like "Sense" that much, for two reasons:

1)      To me (but I've not a strong background in linguistics) sense evokes
the idea of a relationship. I feel like I cannot speak of a sense without a
word. E.g. In wordnet, we may say that the first sense of "runner" is
"someone who imports or exports without paying duties" and that the second
is "someone who travels on foot by running" and this would be correct. But
these senses, as entities, are called synsets in wordnet and these, and only
these, are the semantic units adopted in wordnet. A synset plays the role of
a "sense" for a given word (or set of). Or better, the sense of a word is a
link to a synset (which, practically, in Wordnet is more than just a set of
words, and more close to a referent of the world).

2)      +1 on the idea of a Taxonomy, I did the same on previous works on
the matter, and was proposing the same to do here. In this sense (erm,
pardon the abuse of the word!), I think Sense is too much constraining what
we could put here.

I liked Sememe, as it seemed quite abstract to fit, but by reading the
definition, it seems not appropriate (it is atomic).

Was thinking about Meaning, though this world is not clearly used in
literature (e.g. think about its translation used alternatively in different
translations for both terms in the "Sinn and Bedeutung" dichotomy, but this
is not the only case.).

 

I'm more oriented towards a very abstract particle, such as
SemanticUnit/Index/Entry. I'm not really sure about Semantic Field or
Semantic Class, which have been already defined in linguistics. If they fit
this level of abstraction (e.g. would synset be a semantic field (/class?),
where the shared semantic property is dictated by the Leibniz's principle of
meaning invariance wrt lexical substitution?), then they may go even better.

 

Also, I think that we have to really state what we expect (intension and
extension, through examples) what we would like to see "hosted" by this
class. Though I suggested to come as soon as possible to some terminology,
to have a more agile talking, I think we may coin a temporary terminology
(for the said reason of ease-of-speak), check what we want to define with
it, and then verify if it still covers what we need. Rem tene.verba
sequentur (it is rumoured that someone heard Cato murmuring "hopefully" :-D
).

 

Concerning the property names, I'm ok with both LexicalEntry - meaning ->
Sense, and with Sense - representedBy -> OntologyEntity. 

Maybe we could get rid of multiple related uses of the "mean" notion, which
can be somehow disturbing: Meaning as a class, meaning as a property between
lexical entries and senses, means as a property between lexical entries and
ontology entities . it may look like we are playing with words . what about
following the conventional naming patterns that employs the name of the
property range? E.g. LexicalEntry - sense -> Sense ; LexicalEntry -
meansOntologyEntity -> OntologyEntity. The advantage of using this
apparently redundant naming is that at the instance level, the triple become
very clear, e.g. Saxophone - sense -> wordsense-saxophone-1 ; Saxophone -
hasOntologyEntity -> music:Saxophone.

I also prefer "representedBy" to "characterizes", because the second is very
generic and not attested in any related literature. 

 

Lexical Entry.is it not Lexical Unit more appropriate? An entry exists
somewhere, and an object may appear as an entry in more "wheres". A unit
exists per se. Again, pardon my ignorance if I touched some already existing
definition which is not appropriate.

I really like the 

 

 

---Property chaining over senses---

 

I cut it short: totally agree on the property chain, and with perfect
overlap with all of the motivations provided by Aldo (thanks for saving me
more words to write, and for expressing them much better than I would have
done :-) ).

 

Cheers,

 

Armando
Received on Monday, 15 October 2012 18:23:53 UTC