- From: Uschold, Michael F <michael.f.uschold@boeing.com>
- Date: Mon, 7 Mar 2005 12:07:39 -0800
- To: <rector@cs.man.ac.uk>
- Cc: <noy@SMI.Stanford.EDU>, "Christopher Welty" <welty@us.ibm.com>, <public-swbp-wg@w3.org>
- Message-ID: <823043AB1B52784D97754D186877B6CF05F5D16F@xch-nw-12.nw.nos.boeing.com>
Alan, If you wish, I will happily make any editorial changes that you agree with. This will save an hour or two of your effort. Let me know which ones to include/exclude. Mike -----Original Message----- From: Christopher Welty [mailto:welty@us.ibm.com] Sent: Sunday, March 06, 2005 4:14 PM To: Uschold, Michael F Cc: public-swbp-wg@w3.org; public-swbp-wg-request@w3.org; rector@cs.man.ac.uk Subject: Re: [OEP] Comments on Specified Values Note Alan, Just to be sure you understand - Mike was actioned by the WG to review the document and confirm that only editorial changes had been made since the last working draft. While Mike's suggestions look like improvements for the most part, you are under no obligation at this stage to make any more changes to the document, it was accepted by the WG as a note pending only the review from Mike OK'ing changes since the last WD. Mike, Thanks for the careful review, however you have not yet completed your action, you must send a note to the WG indicating that only editorial changes have been made since the last WD - the last WD is at [http://www.w3.org/TR/swbp-specified-values/] (3 Aug). -Chris Dr. Christopher A. Welty, Knowledge Structures Group IBM Watson Research Center, 19 Skyline Dr., Hawthorne, NY 10532 USA Voice: +1 914.784.7055, IBM T/L: 863.7055, Fax: +1 914.784.7455 Email: welty@watson.ibm.com, Web: http://www.research.ibm.com/people/w/welty/ "Uschold, Michael F" <michael.f.uschold@boeing.com> Sent by: public-swbp-wg-request@w3.org 03/05/2005 12:37 PM To <rector@cs.man.ac.uk> cc <public-swbp-wg@w3.org> Subject Re: [OEP] Comments on Specified Values Note GENERAL comments Are you going to redo [some of] the figures using a nicer tool? Some of the text is literally undreadable on my 1600x1200 laptop screen. Contentwise, they are fine. Perhaps Natasha and her MAC-only diagramming tool has set too high a standard - I'm now spoiled :-) . Below are a bunch of highly specific recommended changes, most are very minor to do with fine tuning. In a few places, I have added a bit more detail. You may disagree with some, due to different writing styles. Take what you like, and leave the rest. It would have been easier and faster to edit the file directly, but I did not want to edit a non-latest version, in case you are also working on it still. Also, it makes it easy for you see exactly what my suggestions are, and ignore the ones you don't agree with. NB: I did not check any of the referenced full code examples. Has someone else done that? Do we want to have this and Natasha's note use the same naming conventions? You use underscore to separate names. In one sense, it is a good idea to use different conventions, showing that this is real life. It could also be confusing for folk reading more than one of our notes. This is a general question for the OEP TF. I lean towards a consistent naming convention, it would even make sense to say exactly what it is. Also, we should point out that we are nto recommending this as the right one, just one among variuos that work just fine. SPECIFIC comments Abstract: Replace "officers and restricted" with "officers and be restricted" --- Replace "specifies the constraints on the values that the property can take on" with "specifies what values the property can take on" --- Replace "methods to represent" with "methods for representing" OR: replace "methods" with "ways". IMHO: 'way' is preferred, since you use that phrase later: "There are at least three different ways to represent such specified collections of values" --- GENERAL ISSUE Replace "health person." with "healthy person." --- You say: "It is a common requirement in developing ontologies to be able to represent notions such as a "small man", a "high ranking officer" or a "health person." There are many such "features"..." "health person" should be "healthy person". More substantively, This wording suggests that 'small man' is a feature. Is this what you mean? Later you say: "it is necessary to specify the constraints on the values for the "feature" - e.g. that size may be "small", " This suggests the feature is 'size'. Its the difference between the property itself (size) and a property=value assertion (size=small). I don't know what you mean. I'm not sure whetehr you want to gloss over this distinction here (with carful wording), or whether it is worth making. As it stands, it is a bit confusing. --- Replace: * As individuals whose enumeration makes up the parent class representing the feature; (See pattern 1). with: * As an enumerated set [or list?] of individuals (e.g. small, medium, large) which makes up the parent class representing the feature; (See pattern 1). Also, add an example to the next bullet like I did for this one. ALSO: in the phrase "representing the feature"I can't figure out which meaning of feature you are using (size? size=large?). It seems neither. Rather it seems to be referring to the value set of the the property size. Frankly, I don't know what the phrase: "the parent class representing the feature"means. Parent of what? Do you mean "the set of all possible values that the feature(e.g. size) can have"? This is the feature space, which you define later. What does parent have to do with this? --- Because of this terminology confusion, I think you need to either: 1. move the vocabulary section much earlier in the document. 2. include a forward reference to the vocabulary section very early in the document. I prefer 1. --- USE CASE EXAMPLES: Replace "e.g. it should be inconsistent (unsatisfiable) to be both slender and obese or in good health and poor health" with "e.g. an inference engine will flag an error if an individual is declared to be both slender and obese or in good health and in poor health" I replaced the theorem proving lingo with everyday language to convey the point. --- Replace: "The others follow analogously." with "The solutions for the other examples are exactly analogous." --- NOTE CONVENTIONS Much of this section could also go in an appendix. I don't have a preference, works ok either way. Your call. Ideally this will be pulled out and used by more than one note. I suggest adding a formal TODO on this point. See also my note to the list on this. --- I would put the vocabulary first, in the conventions section, it is more important. In fact, it might be better to put the vocabulary in the very beginning, because you use the terms right away. Also, put partition last, as it is less important (as a bonus, it would also be in alpha order...) --- Diagramming: I still don't know what an open or closed arrow means. It is not obvious to me by looking at the different arrows. Perhaps you mean the shape of the arrowhead? The terms open and closed dont convey that to me. Is this a standard terminology for arrows that I am not familiar with? --- 'arrowsor' and 'arrowsindicate'need spaces to separate words. --- In: "* Downwards facing braces are used to indicate pairwise disjointness between subclasses or owl:allDifferent for individuals. (All sibling classes are disjoint and all individuals of each type are different in these examples.)" What is the owl symbol if the pairwise disjointness is between classes? Does it ever appear in a diagram? You might address this something like as follows: Downwards facing braces are used to indicate pairwise disjointness between subclasses or between individuals. These are represented in OWL as owl:?? and allDifferent respectively. (All sibling classes are disjoint and all individuals of each type are different in these examples.)" This might be an incorrect wording for what you are trying to say. --- "SWBP" --> "SWBPD" (tut tut) --- Pattern 1: To focus on the postive first, replace: "The second is more complex but is more flexible. Some classifiers also work more reliably with Pattern 2 than Pattern 1." with "The second pattern is more flexible, and some classifiers work more reliably with it than with Pattern 1. However, Pattern 2 is more complex." --- PATTERN 1: "In this approach, the class Health_Value is". Why a definite article? This is first mention of this class. Below I suggest a rewording for this pattern which says what to do in more detail. I feel that the readers of this note will find it easier to create their own examples if we give more details on exactly what needs to be done, rather than mainlu just describing what is the case, once you have the pattern. Both are needed. You may think this too wordy... I made similar changes to approach 4 in the classes as values note. I notice that you have very nice wording in the comments to this effect when presenting the code, that is excellent. The sentence: "Values are sets of individuals." is slightly ambiguous. 'values' here refers to the feature space, all possible values. But each individual is also a value. I suggest using the term feature space here. Overall, I suggest the following rewording. Replace: "In this approach, the class Health_Value is considered as the enumeration of the individuals good_health, medium_health, and poor_health. Values are sets of individuals. To say that "John is is in good health", is to say that "John has the value good_health for health_status" This assumes that a value is just a unique symbol, and a value set is just a a set of such symbols." with "In this approach the feature space is represented as an enumerated set of individual values. The feature of interest is represented as an OWL property (e.g. health_status). We wish to restrict the range of this property to be one of three values corresponding to good, medium and poor health. To achieve this, we create a class that represents the set of all possible values for the given feature, and we enumerate each value. This class (e.g. Health_Value)is defined to consist of exactly three enumerated individuals: good_health, medium_health, and poor_health. The name includes the string "_value" to indicate that this class was specifically created for representing values. Each enumerated individual is an instance of the value class. To say that "John is is in good health", is to say that "John has the value good_health for health_status". For this pattern, a value is just a unique symbol, and a value class is just a set of such symbols." --- Are the values 'sets' or 'lists' of individuals? Sometimes order DOES matter, so from a common sense perspective, they are not sets (although, if the order is not formalized, then from a model-theoretic viewpoint, I guess they are sets). Anyway, be consistent. --- replace: {{The value set and make it equal to the enumeration of the three individual values}} with {{Create the value class and make it equal to the enumeration of the three individual values}} --- The comment: {{Define each of the individual values as an individual of type Health_value}} should be boldface, like the others. Check for other examples of font inconsistency. --- Replace: "Many people find this the more intuitive approach." with "Many people find this approach very intuitive." because you have not introduced the other pattern yet. --- Replace "There is no possibility of further subpartitioning of values." with "There is no possibility of further subpartitioning of values (e.g. good_health into very good and pristine health)." Replace: "It is not possible, as it is for classes, to say that one individual is equivalent to the the union (disjunction) of two other individuals." with "It is not possible to say that one individual is equivalent to the the union (disjunction) of two other individuals. An analogous statement can be made with classes." This may just be style preference; you may not like it. It is longer, but [possibly] slightly more accurate. The problem is, strictly speaking, you cannot, for classes, "say that one individual is equal to the union of two other individuals" You CAN say something very analogous. --- Replace "Because individuals cannot overlap, if Health_Value is defined as equivalent to enumeration of one list " with "Because individuals cannot overlap, if Health_Value is defined as being equivalent to the enumeration of one list " --- Replace "To do so would cause the reasoner to indicate a contradiction. (i.e that Health_Value was "unsatisfiable".)" with "To do so would cause the reasoner to find a contradiction and flag an error." Unless you really want this theorem-prover speak (is it germane to the note?). Whatever you decide, use the same convention throughout, this occurs in more than one place. --- The last disadvantage starts off sounding like an advantage. Replace: "The representation is in OWL-DL, and DL reasoners should eventually be expected to make correct inferences with individuals used in this way. However, neither FaCT nor Racer (the two most widespread open source reasoners in use today) perform all the expected inferences reliably. " with "Although the representation is in OWL-DL, currently, neither FaCT nor Racer (the two most widespread open source reasoners in use today) reliably perform all the expected inferences. In the future, reasoners will likely be able to make correct inferences with individuals used in this way. " Or shorten last sentence even more: "This limitation will likely go away with improved reasoners. --- PATTERN 2: In the title of this pattern, and in the first sentence, you say 'feature' when I think you mean 'feature space'. --- I don't think the word 'continuous' is accurate because it might not be continuous, it could still be discrete. Or if strictly speaking it is accuarate, it might be misleading. --- You say: "the Good_health_values partition". The vocabulary definition does not define partition as a noun, just as a verb. You need to amend the definition if you want to use 'partition' as one of the mutually disjoint collectively covering subclasses. --- The sentence: "Theoretically, there is an individual health value, Johns_health, but all we know about it is that it lies someplace in the Good_health_value partition. " will likely confuse people. I had to puzzle for a while about it, and after reading the code, realized that these two variants are just like the two variants of approach 4 in the classes as values note [exactly analogous, I think]. I think it may be clearer to present pattern 2 here as I [re-]presented approach 4 in the other note. That is: Just describe the first variant as if it is THE approach. And then say, by the way, there is a minor variant... This would entail 1) removing the sentence: "There are two variants presented: one in which the individual Johns_health is explicitly represented, the other in which it is implied by an existential restriction." or moving it to later, as a summary remark. 2) removing the header: "Representation variant 1: Using a fact about the individual" 3) adding an introductory remark for the 2nd variant. 4) do something about the Venn diagram. It will need to be moved to after the variant, or simplified to leave out the restriction. The restriction is pretty confusing, in that diagram, so I suggest the latter option. Have every arrow represent an simple instance of the binary relation: has_health_status. Maybe add more named individuals? btw: The 'J' in "Johns_health" is a different font. --- The Healthy_Person class seems an optional, rather than essential part of this pattern. Perhaps this could be indicated? --- Here is a suggested rewrite taking into account the above comments; it is modeled after the rewrite for pattern 1. I also emphasize what is similar and different about this pattern and the first. Replace: "In this approach we consider the feature as a class representing a continuous space that is partitioned by the values in the collection of values. To say that "John is in good health" is to say that his health is inside the Good_health_values partition of the Health_value feature. Theoretically, there is an individual health value, Johns_health, but all we know about it is that it lies someplace in the Good_health_value partition. The cass Healthy_Person is the class of all those persons who have a health in the Good_health_value partition." with === "In this approach, the feature space is represented as a value class partitioned by a group of subclasses (e.g. Good_health_value Poor_health_value). As for pattern 1, we represent the feature of interest (has_health_status) as an OWL property, and we create a value class (Health_Value) that is used to restrict the range of the property. The key difference is in how we represent the values and the value class. The allowed values (good, medium and poor health) are represented as subclasses, not individuals. The value class is defined as the union of these subclasses. Using this pattern, to say that "John is in good health" is to say that the value representing his health_status (e.g. Johns_health) is contained in the Good_health_value partition of the Health_value feature space. Formally, Johns_health is an instance of the value class: Good_health_value. We can also create a class called Healthy_Person, defined to be the class of all persons whose health_status has a value in the Good_health_value partition. In this case, the classifier can infer that John is in that class." === NB: Although I spent a lot of time on this, I am still not entirely happy with this wording, it loses some of the simplicity of your text, in the attempt to be more precise. Perhaps there is a best of both world middle ground that you can find? BTW: you are using plural for the names of some of the classes. I fixed them, they are correct in the figures (singular). --- Not all of the comments in the Code for this example are the same font. --- You use the term 'parent value class'. It is a parent class, from a graph-theoretic point of view, but from the user's point of view, it is the class representing the main/whole feature space. The 'child' value classes represent portions of the whole feature space. I suggest not using the word parent. [minor style point] --- Replace: "you make the individual a type of the restriction." with "you make the individual an instance of the restriction." --- Fig 3: you refer to "lists of values" change to sets? --- The wording: {{The disjoint axioms make the subclasses partitioning}} seems a bit akward. Maybe you were doing a dance with terminology... Perhaps try: to {{The disjoint axioms make the subclasses partition the value class: Health_Value}} Also, move this comment to immediately after the followign comment,because it applies to all three subclass definitions: "{{Define each of the subclasses that make up the partitioon and make them pairwise disjoint}}" Better yet, skip that comment, it is probably unnecessary, given the one above. And fix: 'partitioon' --- There is a missing full stop after the definition of "medium_health_value". --- In the comment for defining healthy person, note that it is not mandatory for the pattern (or if it IS, can you tell me why?). --- Suggested indenting tweak to make things more clear what the two classes are that are being intersected. It took me a while to puzzle it out. This might be too much work in general, if automatically generated... :Healthy_person a owl:Class ; owl:equivalentClass [ a owl:Class ; owl:intersectionOf (:Person [ a owl:Restriction ; owl:onProperty :has_health_status ; owl:someValuesFrom :Good_health_value ]) ] . --- Figure 4: minor tweak. Highlight exactly what is different with this figure, compared to figure 2. Make the new part blue, or in bold? Maybe add text somewhere or caption the figure to say exactly what changed with this variant: 1. instance now implied, not explicit (noted by dotted lines) 2. new arrow from john to good_health_value [maybe overkill, it is pretty clear just by looking at it...] --- reasoner and reasonable in same sentence (stylistic quibble). maybe use inference engine or classifier isntead of reasoner? --- Give a simple example of alternate partitionings for the same feature space. A real life example that turned up in Boeing was dividing up the fuselage into [front, middle, rear] and [front, back]. Each was a partition. The problem arose because we needed to do a semantic mapping, which of course is inherently ambiguous. --- The following seems contradictory: "If variant 2 is to be used as part of a database schema or similar, then a convention for creating anonymous instances in the database is required. (Logicians call such anonymous instances "skolem constants".) In practice, this can usually be ignored." If you can ignore it, then a convention is not required. This point is fairly lost on me, I can't follow what you are getting at. --- WELL, I don't know what happened, someone must have slipped a much finer toothed comb in my pocket - I didn't think there would this much more additional feedback... Despite all the verbiage, I really like the note, all this is just fine tuning. Mike
Received on Monday, 7 March 2005 20:08:14 UTC