- From: Pat Hayes <phayes@ihmc.us>
- Date: Wed, 13 Jun 2007 13:56:30 -0500
- To: Tim Berners-Lee <timbl@w3.org>
- Cc: "M.David Peterson" <m.david@xmlhacker.com>, "r.j.koppes" <rikkert@rikkertkoppes.com>, "Yuzhong Qu" <yzqu@seu.edu.cn>, "Sandro Hawke" <sandro@w3.org>, <semantic-web@w3.org>, <swick@w3.org>, "John Black" <JohnBlack@kashori.com>
>On 2007-06 -11, at 17:09, Pat Hayes wrote: > >>Tim, as this discussion gets to the heart of >>what Ive been trying to argue for several >>years, please take the comments below as >>intended in a spirit of analysis rather than >>just pins and angels. > >Ok. [takes a deep breath]. I'll go down this path with you. Thanks. > >> >>>On 2007-06 -11, at 13:53, John Black wrote: >>> >>>> >>>>Tim Berners-Lee wrote >>>>> >>>>> >>>>>On 2007-06 -09, at 21:22, M. David Peterson wrote: >>>>> >>>>>>On Sat, 09 Jun 2007 07:13:52 -0600, Tim Berners-Lee <timbl@w3.org> >>>>>>wrote: >>>>>> >>>>>>>No. It cannot identify both a document and a person. >>>>>> >>>>>>Tim: Will all due respect... WTF? >>>>> >>>>> >>>>>I am using the 'identify' in the strict sense of 'denote'. >>>>>The semantic web is like a logic language in which URIs are symbols. >>>> >>>>Do you believe that by claiming to use the >>>>strict, logical sense of the word 'denote' >>>>you thereby cause or require such denotations >>>>to be absolute and unambiguous? Where do >>>>think denotations (or identifications) come >>>>from? >>> >>>The architecture is that each URI is owned. >> >>OK >> >>> With HTTP URIs, this happens through the >>>domain name system and often delegation >>>within a domain. Unlike a word, a URI has an >>>owner. The owner attempts to make enough >>>information available that the URI can be used >>>by others without ambiguity in practical >>>situation. >> >>Well now, several points. First, do owners in >>fact do this? I havnt seen a great deal of such >>information-providing going on, myself. But OK, >>fair enough: perhaps you mean, they SHOULD make >>such information available. But second, more >>seriously, HOW would they do this? Take your >>example: >> >>>For example, W3C owns >>>http://www.w3.org/People/Berners-Lee/card#i >>>and has delegated to me the right to say what >>>that URI stands for. >> >>OK. So, what DOES that URI stand for? How will >>you tell someone what the referent is that you >>intend it to denote, so that they know what to >>use it for? Now of course, you and I being >>smart human native English speakers who are >>reasonably tech-savvy can look at this and >>figure out that it is probably meant to refer >>to you. But really, that does depend on us >>being this smart and savvy. > >Well, I put it in the Tabulator and I get out (among other stuff): > > Tim Berners-Lee > Assistant Amy van der Hiel > HomePage http://www.w3.org/People/Berners-Lee/ > Work > Address > City Cambridge > Country USA > PostalCode 02139 > Street 32 Vassar Street > Street2 MIT CSAIL Room 32-G524 > Phone tel:+1-617-253-5702 > Latitude 42.361860 > Longitude -71.091840 > Organization expandfetchWorld Wide Web Consortium > >which to many people gives a fairly lear indication of what is identified. No argument. Of course, as I said, people are smart enough to figure it out. But (1) machines aren't; and (2) for some purposes, this is still ambiguous. If someone or some thing makes a distinction between person-as-citizen and person-as-biological-system (say), then they might consider the above to still be ambiguous. No doubt, being competent human reasoners, they would not be unduly bothered by this, but the ambiguity of reference would still be there. > And in fact for non-english speakers, there are >words to the effect that "Sir Tim Berners-Lee is >geboren in Londen in 1955. Hij wordt gezien als >de bedenker en grondlegger van het World Wide >Web (WWW), dat was geïnspireerd door HyperCard. >Hij werkte hierin samen met de Belg Robert >Cailliau. Als directeur van het World Wide Web >Consortium houdt hij toezicht op de ontwikkeling >van webtalen en -protocollen als HTML, XML, CSS >en HTTP. Berners-Lee schreef ook de eerste >webbrowser, net als het web World Wide Web >genaamd, en de eerste webserver." and >Sir Timothy "Tim" John Berners-Lee, KBE (TimBL o >TBL). Nacido el 8 de junio de 1955 en Londres >Inglaterra, se licenció en Física en 1976 en el >Queen's College de la Universidad de Oxford. Sus >padres eran matemáticos. Trabajando como >investigador en el Laboratorio Europeo de Física >de Partículas (CERN) de Ginebra, concibió la >idea de un proyecto de hipertexto global, que >años más tarde se convertiría en la world wide >web."" > > >>It isn't absolutely obvious: and there are >>many, many web pages out there which I really >>have no idea what their owners would say they >>denote. > >(Web pages? Web pages are documents. I wouldn't >say they denote things. Symbols, like URIs, >denote. Sorry, I was careless. I should have said, there are many webpages out there which, if someone told me that they were intended to establish a referent for a URI, I would have no (or only the vaguest) idea what the referent was. >The URI ><http://www.w3.org/People/Berners-Lee/card#i> >denotes me, and the URI ><http://www.w3.org/People/Berners-Lee/card> >denotes the web page: an RDF one, but a web >page. That web page describes mainly me but >also to an extent other people and things. Yes, quite. In fact it can hardly do otherwise, if you think about how RDF works. >) > > >>>To use it for something else is an error. >> >>But surely if you make this ruling then it is >>up to you to tell me enough about what it is >>supposed to be used for, so that I can know >>when Im making an error. HOW?? > >Well, do you want the human way of the machine way? >- The human way is that you are tempted to use >it to represent my car but you see the stuff >above, and you realize that for example, people >would be concerned about the weight property and >so on. >- The machine way could be for example that you >operate with an OWL system which include the >belief that foaf:Person and dot:Car are distinct >classes, which will flag an error. That allows some inferences to be done which can make some distinctions, true. But it still doesn't get it actually attached to YOU. >> And what about a URI that I own and wish it >>to denote, say, the planet Venus, or my pet >>cat? What do I do, to attach the URI to my >>intended referent for it? >> >>>> >>>>In my opinion to denote (or to identify) is a >>>>verb, something that is done by the users of >>>>a symbol. After all, symbols (URI) are not >>>>agents, they don't wake up and choose to >>>>denote this or that. >>> >>>They have wonders which create them for a specific purpose. >> >>wonders/owners I presume. > >yes > >> >>But the key question to me is, how can they >>communicate the purpose to the reader? How does >>one publish an intention? How does one indicate >>a referent, other than by describing it? How >>much description is enough? > >Experience shows. This is engineering. >That question is I suppose one agonized over by working groups all the time. >Fortunately, there are social systems not only >for announcing that a UTI has been minted and >describing what it denotes, but also for getting >feedback from people who don't understand it, or >whose machines are not able to process it. This >feedback can lead to an adjustment of the >information out there, publication of tutorials, >and so on. I'm not getting my point across. I shouldn't have said 'how much is enough', which I think you took as a genuine question but I meant rhetorically (because its never enough.) >>>>Nor do I think denotation is an attribute or >>>>property of a symbol, somehow built in or >>>>attached when the symbol is first conceived. >>>>It is more like a dance. I use a symbol to >>>>denote something expecting you to interpret >>>>it to denote the same thing. And this >>>>coordination, this synchrony of >>>>interpretation by both sender and receiver, >>>>is not always easy. It requires real effort >>>>to sustain it. The minter of a URI cannot >>>>make it happen by declaration, nor can a >>>>research group or a standards body just >>>>decree it so. >>> >>>In many cases, the URI is defined by >>>connection to already well-defined sets of >>>things. In other cases, such as the terms in >>>the OWL ontology, there was a huge amount of >>>effort and discussion involved, and the >>>current term is supported by a lot of ongoing >>>tutorials and so on. No one said it was easy. >>>But it is a different architecture from dance >>>associated with natural language words. >> >>The huge effort was for defining the meanings >>of the OWL reserved vocabulary, and that is >>indeed about as defined as a vocabulary can >>get. But if we just take some OWL ontology and >>ask how well it constrains the meanings of the >>new URIs that occur in it, this is *very* much >>like the word dance that John refers to. In >>fact in all essentials it seems to me to be >>pretty much the same game. I publish (say) some >>stuff which I intend to mean something, you >>read (hear) it and get some insight into my >>meaning. You draw some conclusions and if you >>send (tell) them to me I can check to see if I >>agree and if not try to send (say) some more to >>you to make your grasp of my intentions more >>firm. > >Yes, of course the game happens. But for >example when I download my OFX bank statements, >then my computer processes them. Millions of >people do this. Very very few of them worry >about who >did the dance, or do the dance themselves. >Actually, some people from banks, Microsoft and >Inituit etc did the dance. Sure, in some cases it has all been worked out in enough detail and incorporated into existing protocols and so forth. I don't intend to deny that this can be, has been, done successfully. But take that example. Can this kind of thing be done for *all* referring names? Is there a *general*, *architectural* technique for attaching names to referents? Or do we need to wait for the astronomers to organize our ways of referring to things in the dark sky, and biologists to give us a biologists-agreed way of referring to cells and microtubules, and etc. for all the rest? > This is the "total cost of ontologies" argument. I confess Im not familiar with that yet. I'll search for it. >> >>It HAS to be similar, barring machine >>telepathy. There really isn't anything that >>either people or machines can do much beyond >>send symbols to one another: and symbols carry >>meaning only imperfectly, with inherent >>ambiguity. > >Yes. In theory, there is always a very small >possibility that I completely misunderstand the >way you use a word, say "car", and a moderate >probability that in some small very rare corner >case we haven't come across yet, we in pedantic >mode would disagree about its meaning. You are mocking my point as mere pedantry, and I agree it is pedantry; but its not mere. It is pedantry which is now built into the Web, because it arises from the way that SWeb languages have specified how they convey meaning, and the obscure kinds of ambiguity I'm pointing out are ones that arise immediately in actual ontological engineering practice. At this very time I'm having a parallel email argument with Barry Smith on another forum about the need (or not) to distinguish continuants from occurrents. This is pedantic, if you like, but its also engineering. Does your Tabulator output above tell me whether TimBL is a continuant or an occurrent? OK, neither you nor the bank needs to know: but the BOF foundational ontology (now being used in a lot of medical applications) does. Maybe at some point your life will depend on getting this right. >The extent to which I can behave, and build >systems, as though in fact the terms had >well-defied meanings depends on the amount of >dancing which has been done. As the dance is >done, the structures in my brain and those in >yours, while never identical, become very similar An aside: that is often claimed or assumed as obvious, but there is lots of evidence against it. >, specifically in that they develop an very >strong association between a class of real-world >object (car) or concept (TransitiveProperty). >Philosophy studies the dance. (Philosophy has >ion the past been distracted by a concern that >you can't measure the brain's structure, and you >can't measure an external TranstiveProperty >object. The fact htat philsophers have found >this difficult doesn't stop the fact that it >happens, and now we have fMRI machines we are >even relieving some of this angst. But I digress) Yes, you do; but so did I :-) >The important thing is that as the dance is >done, the probability of major disagreement, and >the degree of pedantic disagreement, >decrease very dramatically, to become negligible for engineering purposes. This is where we disagree. A lot more dancing is needed for ontological engineering to work on the SWeb. Its not negligible, and it hasn't all been done yet. And as ontologies get used more and more, this extra dancing (to resolve previously invisible ambiguities) will need to be done by more and more systems and people in more and more contexts. Its already happening. >>> >>>It is different by design. The semantic web >>>is an engineered system, not an observation of >>>nature. >> >>But it is a real system, and so it has to obey >>laws of nature. And there are laws of meaning >>as well as of physics. > >You bring up physics. Indeed. As an engineer, I >design my coffee mug to stay in one place: with >no wheels. Does it? It is a real system, and >so it has to obey laws of nature. Physics tells >as that at any time, there is a finite >possibility that any object might just be >measured to be in a different place, such as a >foot to the right and no longer around my >coffee. In fact if I study the construction of >a coffee mug from particles, all kinds of >questions arise, as the first few particles are >very difficult to manage. I can't even build a >tool which will tell me where the stupid >electron is, without my being completely unaware >of its momentum. Given that I can't actually >claim to have put an electron and proton >together with any degree of accuracy, how can I >claim to be able to build a mug which will stay >in one place. Well, it turns out that as the >number of particles becomes larger, these >effects, while still true, just become ignorable >for engineering purposes. > >The analogy is limited, of course. Yes, it is. I think I'll ignore it rather than demolish it. >>>>The reason this matters is that since it >>>>requires this effort to create a >>>>denotation/identification in the first place, >>>>it is far more sensible, to me at least, to >>>>expect that the final disambiguation of a >>>>symbol be accomplished in the same way, by >>>>coordinated effort of the parties using the >>>>symbol, not by declaration of the W3C >>>>specifications that all URIs be absolutely >>>>unambiguous. >>>>This seems to me to be, as my grandfather used to say, a vain task. >>> >>>Your grandfather would perhaps have suggested >>>that an attempt to define the meaning of >>>common words, as the Académie Française is set >>>up to do were a 'vain task'. Many would >>>agree. But given that his water came to him >>>though pipes connected, possibly, by half-inch >>>British Standard pipe-thread connections, and >>>he rode on rails set a certain distance apart >>>by some committee, and his TV came for better >>>or worse in 525 or 625 lines as decided by >>>other committees, he may have respected that >>>the creation of standards is a very valuable >>>function, and an essential to progress. >>> >>>When people meet to define W3C specifications >>>they are not doing it out of vanity. They >>>are performing coordinated effort of the >>>parties who would like to be able to use the >>>symbol. They are, in general, users and >>>representatives of users of the symbol. They >>>come together to allow those who follow them >>>to use it. They often work long hours, >>>receiving inadequate recognition for either >>>products shipped or papers published, the >>>conventional metrics of performance, so I >>>would not call it vanity. >> >>The point surely is that URIs used to refer >>(not as in HTTP, but as in OWL) do *not* have a >>standardized meaning. Standards are certainly a >>chore to create, but they only go so far. OWL >>defines the meanings of the OWL namespace, but >>it does not define the meanings of the FOAF >>vocabulary, or the URIrefs used in, say, >>ontologies published by the NIH or by JPL. > >Ah, so you accept that the OWL working group has >effectively given meanings to the the terms in >the OWL namespace, but you don't think that the >FOAF friends gave meaning to the FOAF namespace, >or the NCBI to its ontologies? How curious. Not at all. OWL has a (actually several, regrettably) semantics defined with mathematical precision, and corresponding operational systems related carefully to that semantics. The FOAF namespace is intended to refer things that have no mathematical description, and their meaning is some cases is not even loosely defined (what is a 'friend', exactly?) Now, don't get me wrong, I'm not meaning to complain about this. It is fine for a vocabulary like OWL to have little more than the way its being used, as a specification of what its supposed to mean. In fact, I think this will happen more and more; in some cases, like dc:author, the "socially" defined meaning now takes precedence over the declared intentions of the authors of the spec. But one cannot then rationally claim that these URIs have no ambiguity in what they refer to. They are wonderfully, usefully, creatively ambiguous. That is one reason why they are so useful, because their meaning has NOT been tied down too tightly. > OWL does indeed only go so far. >FOAF and OFX and so on go bit further, but >also only so far. They all create new terms >which, modulo arguments afterward an so on, >become with time an increasingly stable >foundation for communication between parties. Of course. I have never denied this. What I have been saying is that it is a mistake to think (and still more to authoritatively assert) that this stability of foundation for communication depends on, or arises from, the terms being unambiguous referents or identifiers. On the contrary, it arises precisely from the fact that they are often treated exactly like newly coined English words, rife with ambiguity and which slowly crystallize to a clearer (but still not unique or even necessarily fixed) meaning from the ways they are used (recent examples of this in English include "hacker" which in my youth was a term of respect, now having criminal associations, and more recently "wanker" which in US English has been stripped of its obscene UK-English connotation, and can be used safely in a variety of social settings.) >>The only way those meanings can be specified is >>by writing ontologies: and finite ontologies do >>not - cannot possibly - nail down referents >>*uniquely*. No amount of authority-delegating >>or standard-setting is going to change this >>basic fact. > >Fortunately, the trains continue to run, and >bank statements continue to generate graphs and >tax forms, more and more people say they >foaf:knows each other, despite the fact that >none of them or the relationships have been >defined really completely precisely to the >complete mutual understanding of all parties. Of course. I don't deny this. But what this illustrates is precisely what Ive been saying, which is that terms do not need to be defined tightly, still less defined so tightly that they must be considered to be uniquely specific identifiers, in order to be used successfully in communication. >This seems to me to be a very important bridge >between philosophy of microscopic linguistic >interactions of agents, and the large scale >world of communities with common terms which are >'good enough for government work'. I suppose I >am surprised it isn't in the textbooks. Im sure it is, somewhere. To be fair to linguists, many of them use a robust notion of "word meaning" which corresponds to your notion, in which for example "bank" has three meanings. But if you ask a computational linguist, they will probably tell you that "bank" has more like ten or fifteen distinguishable senses. And if you talk to someone who actually writes ontologies - you know, an ontology *engineer* - then the number of distinct things that could all be rendered by an English word like "bank" or (famous Cyc example) "cover", then you get answers in the dozens. This is mere pedantry for some people, but tough engineering practice for others. If URIs have to identify uniquely and unambiguously, and if URIs are universal and eternal, which of them gets to specify how finely divided those URI meanings have to be sliced and diced? Pat -- --------------------------------------------------------------------- IHMC (850)434 8903 or (650)494 3973 home 40 South Alcaniz St. (850)202 4416 office Pensacola (850)202 4440 fax FL 32502 (850)291 0667 cell phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Received on Wednesday, 13 June 2007 18:56:49 UTC