- From: pat hayes <phayes@ai.uwf.edu>
- Date: Fri, 18 May 2001 23:29:48 -0500
- To: "Ziv Hellman" <ziv@unicorn.com>
- Cc: www-rdf-logic@w3.org
- Message-Id: <v04210139b72b968b566f@[205.160.76.183]>
> > >I have no objection to binary predicates; I could
> > >even live with all predicates being binary if it would allow me to
> > >speak for lots of ontologists. :)
> >
> > The restriction to binary (plus unary, ie at-most-binary) predicates
> > is mildly inconvenient but quite live-with-able, I agree. That's two
> > ontologists on the list.
> >
> > Pat
>
>At the risk of being on the receiving end of a hailstorm of flames
>from the regulars on this list, I will toss a spanner into the works
>here and question the use of triples.
>
>As correctly pointed out above, using triples is essentially
>reducing everything to binary predicates. Now it is certainly
>provably true that every multi-ary relation can indeed be reduced to
>a collection of binary predicates, and this has been known for a
>very long time. The RDF spec even notes this and provides examples
>for doing so. The question is whether too high a price is paid in
>certain cases.
>
>On the one hand, essentially reducing the world to binary predicates
>is what the OO and XML communities have done for a long time, with
>the attributes assigned to objects really being binary predicates.
>This viewpoint can be understood as stemming from looking at most
>relations as functional, in the sense that, as the canonical RDF
>example puts it, if one asks "who is the creator of this resource?"
>and the answer is "Ora Lassila", then one is working with a binary
>predicate associating a specific resource with a specific person. So
>far so good.
>
>On the other hand, standard mathematics and logic, KIF, the
>relational data-base world, and even full-power UML, all permit the
>use of multi-ary relations and do not limit themselves to binary
>predicates. Why?
>
>I think the reason has to do with the fact that although it appears
>at first that one is gaining simplicity by using only binary
>predicates, or encoded triples, in practice when one is forced to
>exchange a straightforward n-ary predicate with a clumsy collection
>of binaries, the simplicity one has seemingly gained is more than
>lost in the translation. If we really are going to create a
>world-wide web of semantic meanings for a plethora of daily needs,
>this issue may need to be addressed again down the road.
>
>Take as simple an example as requesting a bank balance. This
>requires a relation that is at least 3-ary: at minimum one needs the
>account number and the date&time. The balance cannot be assigned as
>a simple attribute of the account, because its value changes with
>time, and it certainly is not an "attribute" of the date&time alone.
>For another, more complicated example that is a canonical one I use,
>consider a travel agent asked by a customer the flight seating
>he/she has been assigned. The travel agent will respond that in
>order to answer the question, one needs to know at minimum the
>quadruple of {name of the customer, the date of the flight, the
>airline carrier, the flight number} -- because the seating of a
>particular person on a particular flight is not an attribute of any
>one element in that list, but an attribute of the full quadruple.
>
>Again, I know that these examples can be reduced to encoded triples
>-- but is the resulting clumsiness worth it compared to the
>straightforward multi-ary statement? And perhaps more to the point,
>consider that in order to really take off, the SW will eventually
>have to come into contact with the data the world has stored in
>relational data-bases, which routinely make use of reams of tables
>representing very large multi-ary relations. If the industrial world
>is told that uploading/downloading this data through the SW will
>require painfully chopping up the tables into an explosion of
>triples, waiting for the transmission traffic to complete and then
>reconstituting from them the tables at the other end, one may fear
>that it will recoil in horror.
(Late reply, sorry)
I entirely agree with you, but I do not expect this view to win the
day, so I have decided to give this issue up without a fight.
This is a quarrel that has been repeated over and over again in many
different areas, including Krep, linguistics, and philosophical
logic, as well as several computer-science-related areas. Recently
the 'standard upper ontology' discussion lists have been rehashing
it. I can see good arguments both ways, unfortunately.
The good argument for the binary case has been given by Seth and
Sandro. It was used in linguistics by Davidson long ago, who said
that the best way to think of the meaning of a simple (one main verb)
English sentence was in terms of a single 'event' indicated by the
verb, and then a lot of relationships of various other things to that
event. The example he used was "He did it in the kitchen, quickly,
silently, at midnight, with a knife..." , where 'it' turns out to be
making a ham sandwich. The point being that it is impossible to say
how many extra qualifications you might get, and if you have to model
them as arguments or parameters, then the arity (number of arguments)
has to keep changing. Moreover, many of the binary relationships seem
to be already encoded in ordinary grammar, often called 'cases', so
there is an agent (the subject) and an object (the object of the
sentence) and maybe an instrument, and a time (tense) and a manner
(adverbs) and so on. This kind of analysis has ben very influential
in ontology design because it is so handy in this way, particularly
during the process of ontology design when things are changing. And,
indeed, any n-ary relation can be encoded in this way, with a little
artifice, since one can think of the relational n-tuple as the
'thing'. (Notice though that this reifies relational instances, not
sentences, so the RDF account of reification seems to fail here.)
That is why object-oriented or graph-based representations (like
semantic networks) aren't *obviously* useless.
On the other hand, it can be argued that for many purposes, this
flexibility is beside the point. Not all relationships are naturally
expressible as simple English sentences. Some relations are known to
be of a certain fixed arity, and there is no particular reason why
one should not be able to take advantage of that knowledge when it is
available, since it is about as easy to represent an n-ary relation
as it is to represent a binary one in a sentential or tabular form.
And since these formats do not rule out the binary case when that is
useful, why not allow them to be used when appropriate, with
concomitant savings in clarity and efficiency? (The binary expansion
in general makes a single atomic sentence into a sequence of n-2
atoms. Since search inefficiency tends to be exponential in inference
depth, this can be a major computational cost.) Allowing arbitrary
numbers of arguments simply allows everything to co-exist; it does
not prevent a Davidsonian analysis or a binary reduction if your
tastes run that way.
The problem with this arises when people become committed to a format
which is only capable of representing the binary case: graphs or OO
notations. Such enthusiasts often become so enamoured of their
particular formats, and so enthusiastic about their perceived
advantages, that they are hard to persuade; and since, as they will
never tire of pointing out, the general n-ary notations *can* be
translated into theirs, what rationale do we have for resisting their
case? It is often useless, I have found, to try to tell them that any
normal form is as good as any other, or to try to persuade them that
a more eclectic approach has its advantages. The fact that a simple
format is universal is indeed rather cute, and it can be bewitching
when you first encounter it.
Jim Hendler declared at the beginning of the DAML work that 'purely
aesthetic' arguments would not be permitted to influence the design
of the language, which applied in this context is a pre-emptive
strike against any arguments based on the observations you produce.
The fact that the entire world of mathematics, logic, and database
engineering has chosen to use relations freely, is in the end only an
aesthetic argument. It is *possible* to get used to the ugliness,
inefficiency and style-cramping awkwardness that a purely binary
language imposes, rather in the way that it is possible to get used
to midwestern cooking. Transmission speeds are so fast, and memory so
cheap, that any linear losses in information density do not have any
really nasty economic consequences; so I have decided to let the
clowns win this particular battle. If people wish to automatically
translate an efficient notation into an inefficient one, just let
them do it. Microsoft will do it anyway, whatever we decide.
I personally will continue to use relational languages in my own
ontology work (in fact, KIF allows for variably polyadic relations,
which can take any number of arguments, a distinct expressive
advantage which makes many axiomatizations wonderfully compact: kudos
to Mike Genesereth for thinking of it) but I doubt if the Semantic
Web will.
Best wishes
Pat Hayes
---------------------------------------------------------------------
IHMC (850)434 8903 home
40 South Alcaniz St. (850)202 4416 office
Pensacola, FL 32501 (850)202 4440 fax
phayes@ai.uwf.edu
http://www.coginst.uwf.edu/~phayes
Received on Saturday, 19 May 2001 00:29:48 UTC