- From: Manuel Fiorelli <manuel.fiorelli@gmail.com>
- Date: Wed, 15 Jul 2015 22:17:14 +0200
- To: Philipp Cimiano <cimiano@cit-ec.uni-bielefeld.de>
- Cc: "public-ontolex@w3.org" <public-ontolex@w3.org>
- Message-ID: <CAGDmdGh7M-Gv3yLAQAzhdD60chroUwHDgOf0A999eA8fmBgKOg@mail.gmail.com>
Hi Philipp, All I have just uploaded the image you asked for. 2015-07-15 14:30 GMT+02:00 Manuel Fiorelli <manuel.fiorelli@gmail.com>: > Hi Philipp, > > please find my comments below. > > 2015-07-15 8:06 GMT+02:00 Philipp Cimiano <cimiano@cit-ec.uni-bielefeld.de > >: > >> Hi Manuel, >> >> replying to this, todos from last Friday.... >> >> Am 07.07.15 um 15:55 schrieb Manuel Fiorelli: >> >> Dear Philipp, All >> >> here are my preliminary comments. Most of them are minor typos, while >> other may seed further discussion. >> >> ----- >> >> In the introduction to example 1, the spec says: >> >> "As an example we may describe a simple lexicon using this property as >> well as properties from Dublin Core and VoID: " >> >> The example then contains also the actual lexical entries that constitute >> the lexicon. This is good for what concerns the self-explanatory nature of >> the example. However, we should make clear that in general the metadata >> only deals with the description of the lexicon as a whole, while the >> representation of its actual content is in the scope of other modules. This >> is particularly relevant to "lexicon catalogs", which may only be >> interested in indexing lexicons without the need to also host the actual >> content. >> >> I kept the example as is but added a sentence that makes clear that >> the metadata describes the lexicon as a whole as suggested by you. >> >> > It seems OK to me. > > >> ----- >> >> In the definition of LexicalizationSet, the classes Lexicon and Dataset need, >> respectively, the prefix ontolex and void. >> >> >> Fixed >> > > Good > >> >> ----- >> >> I am not sure about this statement: >> >> "The lexicalization set object should be unique for a given >> lexicon-ontology pair" >> >> Indeed, the statement above imply that there cannot be two different >> lexicalization sets for FOAF using the WordNet RDF lexicon. I think that >> this conclusion is false, so the previous statement should be retracted. >> >> This has been removed. >> > > Good > > >> >> ----- >> >> In the definition of lexicalizationModel, the disjunction is spelled OR, >> whereas in other cases it is spelled in lowercase. >> >> >> has been fixed by you I guess, thanks. >> > > Actually, I think that ontolex:Lexicon should not be part of the domain > of lexicalizationModel, unless we want to support lexica not expressed > through the OntoLex model. Indeed, this proeperty is meant to hold values > such as RDFS, SKOS, SKOS-XL, OntoLex,... while the use of a catalog of > linguistic annoatations is reported by the property linguisticModel. > >> >> ----- >> >> The definition of lime:references does not mention the fact that in a >> lexical linkset an ontology reference can be associated with a lexical >> concept. >> >> >> In order to avoid overlading, I would prefer to keep "references" as >> referring to the distinct number of resources ?o, that is: >> >> # of different ?o such that (?s,reference,?o) >> >> > I noticed that in the wiki the domain still contains > lime:LexicalizationSet. What have you decided to do? Maybe you want to > coin another property, but I have no suggestion for its name. > >> >> ----- >> >> Concerning Example2: >> - we should add the language "ja" to the lexicalizationSet resource >> - we may say that the ontology is an instance of voaf:Vocabulary, which >> is a subclass of void:Dataset to represent vocabularies (both RDFS >> Schemas and OWL Ontologies) >> - I would extend the introduction to the example. This is my attempt: >> >> <cite> >> In the following example, we describe a lexicalization set expressing how >> elements of an ontology can be verbalized in Japanese by means of entries >> from a supplied lexicon. The metadata clearly tells which ontology and >> lexicon are involved in the lexicalization sets, as well as the relevant >> natural language. The knowledge of these facts about the lexicalization set >> allows us to assess the usefulness of a lexicalization set for a given task >> as well to discover relevant lexicalization sets, when we are constrained >> by the choice of an ontology, lexicon or natural language. >> >> We model the ontology as an instance of the class voaf:Vocabulary that >> is a kind of void:Dataset representing vocabularies (bot RDFS Schemas >> and OWL Ontologies). We benefit from the more specific distinctions made by >> VOAF, by breaking down the total number of entities in the ontology (held >> by the property void:entities) into separate counts for the classes and >> properties (held by voaf:classNumber and voaf:propertyNumber, >> respectively). >> >> Similarly, we use terms from the Lime vocabulary to represent statistics >> about the linguistic content of the lexicon and the lexicalization set. >> Overall, the ontology defines 80 entities and the lexicon 100 lexical >> entries; however, only 20 entities from the target ontologies have been >> associated with a total of 50 lexical entries. >> </cite> >> >> ----- >> >> Great, I have added your text to the example. >> >> > Good. However, I have just noticed that my introduction assumed the use of > some VOAF terms in the example: I will add the missings statements in the > example late this afternoon. > > >> >> In the definition of avgNumOfLexicalizations, it occurs the word >> "define" while it should be "defines". >> >> >> I can not find this, sorry. >> >> But this brings me to another issues. The formula for >> avgNumOfLexicalizations could be improved to make it clearer as follows: >> >> avgNumOfLexicalizations = # lexicalizations / # ontology entities in the >> reference dataset >> >> What do you think? Can you possibly update the formula? That would be >> great. Thanks. >> >> > I agree that the formula should be updated. I will try this afternoon to > produce a new formula, but I don't know if I can replicate exactly the same > appreance. > > >> >> ----- >> >> I would postpone example 3 to end of the section, and I would modify it >> as follows: >> - reuse the same data as in example 2, and make this clear in the >> introduction to the example >> - then, use the properties lexicalizations, avgNumOfLexicalizations and percentage >> to "analyze" the scenario depicted in example 2. For instance, it is now >> possible to tell explicitly that only 25% of the reference ontology has >> been lexicalized. >> >> We can make the example more interesting playing with polisemy so that >> the ratios are not "obvious". >> >> >> Actually, I think that example 3 makes definitely sense here. The ratios >> are rather obvious, true, but this is good as a simple and clear example. >> > > I think we already agreed on this during the last call. > >> >> ----- >> >> In the definition of LexicalLinkset, the class dataset needs the prefix >> void. >> >> ----- >> >> OK, this has been fixed as far as I see. >> >> > Good > > Best regards > > Manuel Fiorelli > -- Manuel Fiorelli
Received on Wednesday, 15 July 2015 20:17:44 UTC