- From: pat hayes <phayes@ihmc.us>
- Date: Tue, 15 Jul 2003 18:20:10 -0500
- To: www-tag@w3.org
- Cc: Pat Hayes <phayes@ai.uwf.edu>
- Message-Id: <p06001215bb39faa42145@[10.0.100.23]>
Gentlemen, I would like to ask you to please clarify the meaning of
the terms 'resource' and 'representation' in
http://www.w3.org/TR/2003/WD-webarch-20030627/.
Allow me to elaborate. Your introductory example asserts the following:
"Objects in the networked information system called resources are
identified by Uniform Resource Identifiers ( URIs ). "
and later the document says:
"URIs identify resources. When a representation of one resource
refers to another resource with a URI, a link is formed between the
two resources. The networked information system is built of linked
resources, and the large-scale effect is a shared information space.
The value of the Web grows exponentially as a function of the number
of linked resources (the "network effect"). "
These, and other pieces of text concerning 'resources' published by
other W3C authorities, seem to clearly indicate that the word
"resource" is intended to refer to the entities *in* the networked
information system: they are the kind of thing we use words like
'website', 'client' and 'server' to describe; they are things with a
computational state, things with which one can communicate, things
which send and receive information which can be transmitted along
optical fibers and twisted pairs, things than can be linked to one
another.
So far this is clear; and the account of 'representation' given in
the document is also then reasonably clear:
"Agents (such as servers, browsers and multimedia players)
communicate resource state through a non-exclusive set of data
formats, used separately or in combination (e.g., XHTML, CSS, PNG,
XLink, RDF/XML, SVG, SMIL animation). In the travel scenario, Dan's
user agent uses the URI to request a representation of the identified
resource. In this scenario, the representation consists of XHTML with
embedded weather maps in SVG. "
On this picture, the information (which Dan, in your introductory
example, reads on his screen, and which is in some sense all about
the weather in Oaxaca) is a representation of the (current state of)
some entity *in the WWW itself*: a resource in the global information
network: the state of some computer system, or maybe some abstraction
of a computer system.
However, it is also clear that neither the weather in Oaxala, nor
Oaxala itself, are entities of this kind: weather and cities in
Mexico are not the kind of entities which can be thought of as
'objects on the networked information system'. Other examples abound,
eg http://chandra.harvard.edu/photo/2003/ngc1068/index.html is in
clearly about a galaxy containing a supermassive black hole, which is
also not something one would expect to find as part of an networked
information system, given the likely physical constraints on network
architecture.
It seems that there is a systematic ambiguity between two senses of
'resource' (or maybe two senses of 'representation') here. In your
first example, I doubt very much that Dan, when looking at his screen
after telling his browser to retrieve
http://weather.example.com/oaxaca, thinks of what he is reading as in
any sense about the state of something on the WW information network.
Certainly if I were in his shoes, I would be reading it as being
about Oaxala and weather: that is why he is reading it, presumably:
to find out something about the weather in Oaxala. So what this
representation is *about* is not, apparently a resource: so it is not
a representation of a resource, in the usual sense of
'representation' and what is apparently your sense of 'resource'.
Similarly, http://chandra.harvard.edu/photo/2003/ngc1068/index.html
sure reads to me like it is about NGC 1068. But this means that
either it is a 'representation' which is not about what it is 'of',
or else that NGC 1086 is an 'object in the networked information
system'; neither of which seem to me to be remotely plausible as
factual claims using the ordinary senses of the words, and kind of
brain-damaged as attempts at a formal definition of some kind of
architectural/semantic theory.
Now, this could be just a matter of philosophical opinion, were it
not for the fact that semantic web languages like RDF and OWL have
been given *formal* semantic theories which have direct architectural
consequences for Web agents, and which depend crucially on notions
like the term 'about' I have used rather loosely above. RDF uses URI
references as *names* to *refer* to entities. So if a web page such
as http://chandra.harvard.edu/photo/2003/ngc1068/index.html were to
include RDF markup, one might expect to find things like this in it:
<rdf:Description
rdf:about="http://chandra.harvard.edu/NGC/ngc1068"
rdf:type="http://chandra.harvard.edu/AOtype/Activegalaxy7"
</rdf:Description>
where the URIs refer respectively to a galaxy and an RDFS class of
galaxy types. This is completely incompatible with what your document
says about resources and representations. Using the URI in this way
does not create any kind of link between anything on this planet and
NGC 1086 (which is, fortunately, about 50 million light-years away).
But RDF/RDFS/DAML/OWL/OIL and all the other emerging Semantic Web
formalisms *require* that URIs be used in this way, as *referring
expressions*, not as informational links in a global architecture.
The RDF/RDFS/OWL semantics assumes that URI references refer to
"resources" , but it explicitly denies that this word "resource" is
limited to the kinds of resource that you seem to be talking about.
On the SW view, *anything* is a resource: galaxies, regions of
France, kinds of wine, sodium atoms, classes, mathematical
abstractions, even fictional entities: anything that can be referred
to by a name. None of these can possibly be "objects in a networked
information system". So whatever you are talking about, and whatever
they are talking about, y'all cannot possibly be using the words
"resource" and "representation" in the same sense.
As a result, several of the assertions you make in this document are
not correct. For example
2.8.2
"merging Semantic Web technologies, including "DAML+OIL" [ DAMLOIL ]
and "Web Ontology Language (OWL)" [ OWL10 ], define RDF properties
such as equivalentTo and FunctionalProperty to state -- or at least
claim -- formally that two URIs identify the same resource. "
is incorrect. These assertions claim that two URI references *denote*
the same entity in all interpretations. That is not the same notion
as 'identify'.
In fact, there is no such notion as 'identify' in RDF/RDFS/OWL
semantics; and the first principle in section 2 ("All important
resources SHOULD be identified by a URI ") is meaningless when taken
literally in the context of semantic web languages, as URIs there
typically cannot be said to identify anything: they act as names
whose possible referents are constrained by the assertions made using
them, but they are not 'linked' to anything, not 'bound' to anything,
and are not obliged to 'identify' anything; and the universes of
discourse may contain entities which cannot possibly be all
identified or even referred to by URIs, since there are too many of
them, or it is physically impossible to identify them with enough
precision, or simply because it is impractical to do so.
------
Sorry this comes across so negatively, but there seems to be a
central misunderstanding right at the center of several architectural
accounts of the Web, and I think it is important to get it sorted out.
There are two distinct models of how names refer. In some ways, URIs
are like file names in a programming language: they provide a way to
access a piece of information, a global address of some entity which
delivers information on demand. In this sense they provide a link
between network components, can be considered to be unique, and it
is reasonable to claim that every one of the resources they link
should be identified by a URI. This is the old sense of URLs, which
of course has now been generalized: but URIs, particularly when
discussed in an architectural framework, seem to retain a kind of
shadow of this URL heritage. In other ways, emphasized more recently
and particularly by the semantic web languages, URIs are more like
referring names in an assertional language: they simply denote
things. In this sense they do not provide links (naming something
does not establish a link to it, eg one can name entities which no
longer exist or could possibly exist, such as the 19th century); they
are not unique; and it is ridiculous to claim that every entity
should have a name (does every grain of sand on Pensacola beach have
a URI? But it is easy to invent a URI for the rdfs:ClassOf all the
sand grains on Pensacola Beach, and to assert - possibly in a rule
language - that every such grain is made of quartz. This requires
that every grain be considered to be a 'resource'.). Many of these
architectural requirements make sense only if we interpret the
language of the documents as though URIs were slight generalizations
of URLs. For example - due to Roy Fielding - consider a webcam which
delivers a view of a room when suitably pinged; then one might say
that the room is the resource 'identified' by the webcam's URI; this
kind of generalization of 'resource' allows the edges of the Web to
extend a little further into the world surrounding what is usually
thought of as the global network. But names, and the idea of
reference, extend much further than this: they extend to the entire
universe of things that exist, will exist, have ever existed or could
possibly exist. Most of what is said about URIs *does not make
sense* when one tries to read the language of the documents as though
URIs are general referring names; and yet the semantic web standards
are being written based on this assumption.
We need to get clear on this issue, or else we will continue to be
mired in confusion.
Let me suggest that it would be worth distinguishing between what a
representation is *about*, and what resource *produced* it. The
document currently says that URIs are used to retrieve
representations 'of' a resource. It is easy to read this as saying
that the representation is 'about' the resource: that it 'refers to'
or 'describes' the resource; but this is evidently incompatible with
the notion of a resource as something that must be 'part of' an
informational network. This point has nothing particularly to do
with the semantic web, by the way: it is just as true of the current
web, as the Chandra example (and indeed your own Oaxala weather
example) shows. The *source* of a representation and what might be
called the *topic or content* of the representation need have very
little to do with one another (although trust may be based on a
judgement of the authority of the former to make claims about the
latter). Right now the two ideas seem to be confused; I think it
would be clearer if they could be explicitly separated. This
shouldn't make any deep difference to the purely architectural issues
you are describing, but it will greatly help to clarify the semantic
issues which depend in part on them, and which we still have not
managed to fully harmonize with them.
Pat Hayes
--
---------------------------------------------------------------------
IHMC (850)434 8903 or (650)494 3973 home
40 South Alcaniz St. (850)202 4416 office
Pensacola (850)202 4440 fax
FL 32501 (850)291 0667 cell
phayes@ihmc.us http://www.ihmc.us/users/phayes
Received on Tuesday, 15 July 2003 19:20:20 UTC