- From: pat hayes <phayes@ai.uwf.edu>
- Date: Tue, 29 Apr 2003 11:57:55 -0500
- To: "Roy T. Fielding" <fielding@apache.org>
- Cc: uri@w3.org
- Message-Id: <p05210610bad4300844a8@[10.0.100.12]>
(To cut to the chase, go to @@) >>Skolemized URIs work just fine. The existing Web protocols and >>conventions are perfectly adequate, in fact, to handle this. They >>are BETTER than the current RFC 2396 prose says they are. > >You have said that a few times, but the problem isn't what RFC 2396 says >but rather what you assume it means. It was, and is, the best I could do by reading the text as carefully as I know how. I didn't assume anything: I tried to follow what it was trying to say. >You asked for a definition of >identity, and I gave it, at which point you asked me to define all >of the words within that definition. That is nonsensical. Im afraid I do not regard it as nonsensical, if the vocabulary used by your definition is more obscure than the original, which it was. I share your frustration with what could be an endless lexicographic discussion, which is why my suggestion was to remove the potentially contentious wording, rather than to attempt to define or clarify it. >Furthermore, >your entire argument boils down to that we should consider changing >the definition, which of course is why the issue is on the issues list >and was listed as "pending". Everything that has been said in this >discussion is just a verbose variant of what was already said and >summarized within that issue. That is why I have been and will continue >to be brusque in my responses: I am sick of reading fifty paragraph >mails for a request that has already been conveyed in one paragraph. I must apologize for contributing to the some extended email interchanges which were aside from the main editorial point. I will focus on issues of wording. >To answer your real question: the definition in RFC 2396 means exactly >what you wish it to mean. Resources are things that have identity -- >not one unique identity, but an identity that is being referenced for >the sake of denotation. Oh dear, that makes things worse. Now I REALLY do not know what you are talking about. > That is a common English word and the way I >used it in the sentence does not allow for any other reasonable >interpretation I have no idea what your intended interpretation was (or is). >, including the one Miles described about the identity >axiom in mathematics. It really isn't enough to say that it is a common English word. In that role it has many meanings, and several of the more common, as in the useage 'identity theft', apply only to persons, so become meaningless when applied to a larger domain of just about anything. (What would it mean to apply identity theft to my toaster, or to a sodium atom?) And I do not know of ANY sense in which something can be said to have more than one identity, except perhaps metaphorically ("Alphonse, SuperSpy, Man of Many Identities") > After reading this discussion, it appears that >you grasp the concept without any problem Not at all. I honestly do not know, even after a great deal of reading, what you mean. The best I can do is to assume that you are trying to say several different things at the same time; but even when I get the picture reasonably clear, the ground seems liable to shift unexpectedly. The above, for example, suggests that you think of things as having many possible identities, and that one refers to one of these rather than to the thing itself. This comes out of left field for me. I find this an extraordinary concept, and I have no idea what kind of ontological framework would arise from taking it seriously. So I'm afraid that I still have no confidence that I am really grokking you. >, but you are being faced with >other arguments that fail to grasp it or are using the multiple potential >definitions of the term "identity" as some bizarre excuse to claim that >a URI defines the resource's identity. Okay, that was Miles point as >well, so we can replace use of that term with the definition. However, >I am not going to proceed recursively down the ontological tree until >everyone is satisfied -- I just want to get it to the point where the >intended definition is distinguished from those that are not intended, >even if I can't understand how people could confuse them now. > >I do not accept the argument that nothing should be said about resources >in the spec. The URI-WG tried that originally and failed miserably -- >dozens of identifier schemes and related projects were created based on >false assumptions about the limits of resource. Years of effort were >wasted in attempts to create "better identifiers" that didn't suffer >from the imagined limitations of existing identifiers. That is why >the definitions were added in 1997, and I'd rather hash this out in >one place than have to explain it again in every single working group. > >In fact, I don't think the problem you are describing has to do with >confusion over "identity" at all, but rather what it means to "identify" >a resource. That is a separate question, though some folk apparently understand the first to be closely related in meaning to the second. I have worries about both of these, in fact, and I am not sure whether or not the document intends to imply that they are related. >In that sense, it is unfortunate that they are called >identifiers, but I can improve that definition as well. In fact, I >already did that yesterday: > >http://www.apache.org/~fielding/uri/rev-2002/rfc2396bis.html#rfc.section.1.1 > >If anyone has comments on that text, please be specific. OK, will do. The first sentence of Resource says A resource is a source of supply or support within the context of a given system. This seems to go off in an entirely new direction. The only way I can understand this, it would rule out all kinds of things from being resources. I can refer to entities which have never existed, or which have existed but do not exist now, or which will exist in the future, or which are too small, too large, or too remote to serve as 'sources' of anything. So this seems to directly contradict the next sentence, if that is being understood in the way it has been up to now. Taken together, the first three sentences of your current draft seem to clearly indicate that the term 'resource' is reserved to mean only things which can be in some sense causally influence the system, things it can in some (perhaps broad) sense interact with, that the system can 'use' (fourth sentence). So anything which it cannot causally interact with cannot be a resource? That seems like a much more restrictive notion than the one we were discussing previously. Is that really your intent? If so, thanks for being clearer, but I would urge you to reconsider. The second paragraph seems to contain a number of presuppositions. Please allow me to list them in order to clarify whether or not they are intended: 1. Resources are represented at times 2. Resources endure through (all?) time, retaining their identity. 3. A single, enduring, resource may (must?) have different representations at different times 4. There are mappings, called 'conceptual mappings' from resources (at times?) to representations. 5. The system 'views' a resource as being a conceptual mapping from a (from that?) resource to representations. 6. resources 'supply' representations to the system. Now, I have a lot of trouble understanding exactly what this all means. First, where did this idea of representations come from? This section is supposed to be about resources. Does 'representation' here refer to *any* kind of representation, including for example descriptions, graphical renderings, etc., or is it restricted to a particular class of representations (which class?) Second, there seems to be an implicit assumption here of a kind of traffic between the resource and the 'system', involving representations. Resources are represented by representations, but they also emit representations and supply them to the system, providing a 'view' of resources (of themselves?). I can make out here, obviously, in rather blurry outline, a sketch of the Web; but the document would benefit by this picture, if it is indeed intended, being made more explicit, since it could be interpreted in other ways. But also, if I do interpret these paragraphs in this way, the notion of resource becomes, if anything, murkier: for the word seems to be being used to refer both to the Web sources of information - the functional entities which return information when pinged for example by a GET - *and* to the entities that the resulting representations are 'about', *and* to the mappings from the former things to representations (and we are not told about the relationship of these to resources). Does it in fact refer to all of these, or is the text meant to suggest that these are all the same category? But surely the whole point of the Web is that it is used to convey information about far more than its own servers. (I know that this is returning to ancient debates which run on for ever. My point is that the wording being used here does not help us avoid these debates: it only serves to forever cause them to be re-opened, since it can be understood ambiguously in apparently conflicting ways, and it can also be read as asserting that these categories are the same. ) Third, I (still) find the sentence "This conceptual mapping of resource to representations is what the system identifies as a resource." quite opaque in meaning. It seems to say that a resource *is* a mapping from itself (at a time?) to a representation of itself, which is simply incoherent. Read more carefully, perhaps it only says that this is the systems' view of the resource: but what does that mean? Fourth, what 'system' are we talking about here? (The Web? a Web browser? Something else?) If resources can be almost anything, what has 'the system' - in fact, ANY system - got to do with the identity of a resource? Please don't tell me that 'system' is just an English word. Fifth, why bring time into it? This seems to add needless complexity to an already complex issue, and also to impose a particular ontological perspective on the entire discussion which might be too limiting. There are already deployed industrial ontology standards (eg. EPISTLE) which reject, for good technical reasons, the 'common-sense' picture of things enduring through time without changing their identity; and you are immediately embroiled in potential debates about whether such endurants can be said to endure for ever, or when it is appropriate to decide what the temporal boundaries might be, etc. etc. (There's enough discussion of this stuff in the technical ontological literature to fill a library, never mind philosophy and linguistics.) In any case, there may well be transient - non-enduring - resources which it is still useful to refer to and exchange information about. At the very least, I would suggest choosing words which do not seem to imply that *all* resources must have this kind of enduring temporal identity. The next paragraph, by the way, although clearly intended to be an clarifying example, seems to say that a single representation can be of two distinct resources at the same time. In fact, in the case indicated, it *must* be. (It might help if you could indicate what the 'conceptual mapping' is for this example. I presume it must be something like a function from times to camera views (?); but in this case the same function identifies both resources, apparently.) (BTW, issues of identity get very delicate when we start to talk about mappings and times. In particular, it simply doesn't make sense to say that a mapping takes times as an argument and also itself endures. This is like asking how fast time goes by: the question embodies a temporal contradiction. Either say that the mapping is itself temporal, so that the mapping of the camera changes every time a picture is taken, or say that the mapping - which itself now exists outside of time altogether - is from the camera and a time to a picture.) Concerning the text of 'Identifier': can you indicate the intended distinction between "define" and "provide sufficient data to distinguish....from all other things..." ? I have trouble distinguishing these concepts. Finally, the idea that a URI carries data to identify the referent seems wrong to me: surely what the URI identifies is the 'owner' of the URI, which is often the authoritative source of information concerning the intended meaning or referent of the URI. But the distinction seems important. This is why I have been howling at Graham and Patrick on your email list, because the idea that URIs always contain enough information in their syntax to uniquely specify a single *referent* is incoherent; but the idea that they do so uniquely identify a *source of information* is not only coherent, but seems to be close to correct. And of course, in some cases - as when the resource in question IS the source of information - the URI might indeed identify the resource. But not in all. Pat Hayes @@ PS. suggested modified text (based on http://www.apache.org/~fielding/uri/rev-2002/rfc2396bis.html#rfc.section.1.1 "Resource The Web supports traffic in representations, and resources are whatever is indicated or referred to by a representation. In general, a resource could be anything. This specification does not place any limits on the nature of a resource. In practice, a system's resources will usually include the things to which it can apply actions in the future. For example, if a system is performing an information retrieval action, then things that can provide information to that system are considered resources for the system. Familiar examples of a resource include an electronic document, an image, a service (e.g., "today's weather report for Los Angeles"), and a collection of other resources. Not all resources are network "retrievable"; e.g., human beings, corporations, and bound books in a library can also be considered resources. For some purposes, resources may include abstractions, non-physical things or fictional objects. A common misunderstanding of resources, particularly in information systems, is to equate a resource with a representation of that resource as observed at a particular instant in time. The same resource may endure though time and have different representations at different times; a change in the representation need not be considered a change in the resource. For example, a resource that is defined to be "a live camera view from my window" may continue to exist as the same resource, even though the view is changing. Identifier An identifier is an object that carries identifying data. The data may identify a resource or a source of information about a resource. A URI is an identifier object consisting of a sequence of characters that matches the restricted syntax defined by this specification. ------ I have tried to keep to the spirit of the original without saying anything that would cause damage. I avoided the 'conceptual mapping' language in order to avoid getting into potential tar-pits about whether or not mappings endure through time, etc. etc., and tried to use 'might' and 'often' style language rather than 'do' and 'must' style language. The two sentences starting "In practice, ..." clearly serve a useful role in the explanation, but I worry that by being used so centrally in the definition, particularly with the original 'however', they convey the implicit message that ALL resources are of this 'actionable' variety. I have tried to add words to guard against this interpretation; but it might be easier to just cut this out altogether. (It seems to me that to say this stuff properly requires getting into issues like one resource being a source of information about other resources, an area I know you have thought about more extensively than most people have. I guess the point is that this stuff needs to either be said explicitly or not said at all, since to kind of half-say it only creates more potential confusion than clarity.) PPS. I feel reluctant to engage in writing someone else's prose for them, which is considered highly rude and intrusive behavior in the circles I move in; so please don't be offended by my rewriting your words for you. I gather this is the right way to behave here. ------- -- --------------------------------------------------------------------- IHMC (850)434 8903 or (650)494 3973 home 40 South Alcaniz St. (850)202 4416 office Pensacola (850)202 4440 fax FL 32501 (850)291 0667 cell phayes@ai.uwf.edu http://www.coginst.uwf.edu/~phayes s.pam@ai.uwf.edu for spam
Received on Tuesday, 29 April 2003 12:57:59 UTC