Re: RDF and Resources from Dan Brickley on 1999-06-06 (www-rdf-comments@w3.org from April to June 1999)

From: Dan Brickley <Daniel.Brickley@Bristol.ac.uk>
Date: Sun, 6 Jun 1999 11:18:12 +0100 (BST)
To: Paul Prescod <paul@prescod.net>
cc: xml-dev <xml-dev@ic.ac.uk>, www-rdf-comments@w3.org
Message-ID: <Pine.GHP.4.02A.9906061054010.5860-100000@mail.ilrt.bris.ac.uk>
Interesting point. The RDF spec says...

> > not directly accessible via the Web; e.g. a printed book. Resources 
> > are always named by URIs plus optional anchor ids (see [URI]). 

...when it perhaps ought to say "always named by URI References", per
section 4.1 of the RFC. 

I believe this is a confusion rather than a bug in the spec, in that
nothing I can find in either doc tells us that the things picked out
using URI References, ie. the #'d view into the retrieved object, cannot
themselves be resources. If they have identity (cue separate debate)
they're resources. And hence RDF describable.



Some excerpts from ftp://ftp.isi.edu/in-notes/rfc2396.txt

 Uniform Resource Identifiers (URI) provide a simple and extensible
   means for identifying a resource. 
[...]
   Resource
         A resource can be anything that has identity.  Familiar
         examples include an electronic document, an image, a service
         (e.g., "today's weather report for Los Angeles"), and a
         collection of other resources.  
[...]
4. URI References

   The term "URI-reference" is used here to denote the common usage of a
   resource identifier.  A URI reference may be absolute or relative,
   and may have additional information attached in the form of a
   fragment identifier.  However, "the URI" that results from such a
   reference includes only the absolute URI after the fragment
   identifier (if any) is removed and after any relative URI is resolved
   to its absolute form.  Although it is possible to limit the
   discussion of URI syntax and semantics to that of the absolute
   result, most usage of URI is within general URI references, and it is
   impossible to obtain the URI from such a reference without also
   parsing the fragment and resolving the relative form.

      URI-reference = [ absoluteURI | relativeURI ] [ "#" fragment ]

   The syntax for relative URI is a shortened form of that for absolute
   URI, where some prefix of the URI is missing and certain path
   components ("." and "..") have a special meaning when, and only when,
   interpreting a relative path.  The relative URI syntax is defined in
   Section 5.

[...]
 the optional fragment identifier, separated from
   the URI by a crosshatch ("#") character, consists of additional
   reference information to be interpreted by the user agent after the
   retrieval action has been successfully completed.  As such, it is not
   part of a URI, but is often used in conjunction with a URI.




So... on my reading...

--- the URI is the bit before the # in a URI or URI Reference
--- the entire 'thing' picked out by a URI Reference might still be a 
	resource and have 'identity' (and might eg. have a first-class
	URI in another scheme, eg uuid: or urn:). Even if just a section
	of a larger object (video frame etc).
--- for any URI reference, there are potentially two objects (resources)
    implicated; the one picked out by the URI proper, and the one picked 
    out by the URI reference, if there is an optional '#'.
    RDF uses the latter when the #foo is present. The Web doesn't give 
    a name to the relationship between these two things, though the 
    Web Characterisation activity seems to be making a good start in 
    that direction.

I see nothing in this to suggest that that RDF can't describe bits of
non-XML content, since those components for eg might well have URIs
(eg. urn, uuid...) of their own. What I do see is a rather ugly glitch
(doubtless motivated but still scary) in the section of the URI RFC
which defines fragment identifiers:

4.1 tells us:

	The semantics of a fragment identifier is a property of the data
   resulting from a retrieval action, regardless of the type of URI used
   in the reference.  Therefore, the format and interpretation of
   fragment identifiers is dependent on the media type [RFC2046] of the
   retrieval result.


This is gross as it relativises the semantics of frag identifiers
to the mime time of the object as retrieved. Since that is itself a
(possibly time-sensitive) function of content-negotiation between client
and server, fragment identification issues get tangled up with
possibility of the server offering multiple
(language/format) negotiable versions or renderings (or manifestations
-- pick your terminology) of the same object.

When this wierdness in web architecture is cleared up, and the different
renderings available (eg. over http, http-ng) are more explicitly
differentiated, RDF and XML will be living in a happier less ambiguous
world. 

Dan

 On Sun, 6 Jun 1999, Paul Prescod wrote:

> RDF's definition of resource seems to be incompatible with the URI
> specification that it references:
> 
> > All things being described by RDF expressions are called resources. 
> > A resource may be an entire Web page; such as the HTML document 
> > "http://www.w3.org/Overview.html" for example. A resource may be a 
> > part of a Web page; e.g. a specific HTML or XML element within the 
> > document source. A resource may also be a whole collection of pages; 
> > e.g. an entire Web site. A resource may also be an object that is 
> > not directly accessible via the Web; e.g. a printed book. Resources 
> > are always named by URIs plus optional anchor ids (see [URI]). 
> 
> According to RFC 2396, a resource is something addressable by a URI
> (without fragment identifier) -- a complete HTML or XML document, not a
> part of it.
> 
> Also, I find it interesting to note that RDF does not seem to be able to
> attach metadata to components of things that are not XML: frames of a
> video, records in a database, paragraphs in a Word document and so forth.
> We need the ability to do so.
Received on Sunday, 6 June 1999 06:18:16 UTC