Re: Resources and URIs from Dan Brickley on 1999-12-08 (www-rdf-interest@w3.org from December 1999)

From: Dan Brickley <danbri@w3.org>
Date: Tue, 7 Dec 1999 20:41:09 -0500 (EST)
To: Sergey Melnik <melnik@DB.Stanford.EDU>
cc: RDF Interest Group <www-rdf-interest@w3.org>, Gabe Beged-Dov <begeddov@jfinity.com>
Message-ID: <Pine.LNX.4.20.9912072006230.17859-100000@tux.w3.org>
On Tue, 7 Dec 1999, Sergey Melnik wrote:

> > For anonymous resources, I don't think your proposal can work in the way
> > you hope.
> 
> I am deeply convinced that we need the ability to refer to anonymous
> resources. Refer to
> 
> 	http://www.bootstrap.org/augment-132082.htm#11K
> 
> for supporting arguments.

We agree, and we disagree. I think I've a better idea how we disagree now:

Excerpt from:	http://www.bootstrap.org/augment-132082.htm#11K 
	Every Object Addressable -- in principle, every object that someone might
	validly want/need to cite
	should have an unambiguous address (capable of being portrayed in a manner
	as to be human readable and
	interpretable). (E.g., not acceptable to be unable to link to an object
	within a "frame" or "card.") 11K 

This principle is also fundamental to the Web architecture. If it matters
to the Web, it deserves a Web identifier. We agree totally on the
importance of identifiers for all resources.

When you say that we need the ability to refer to 'anonymous' resources, I
have problems. Being "anonymous" is not intrinsic to the resource, but a
property of a _mention_ of a resource in some bundle of RDF statements.
Being able to name these mentions is also important of course...


Example 3:
 Some RDF might say: "the eyecolour of the hairdresser of my boss's niece
is brown". This is a (perhaps!) useful piece of information that we might
exchange using XML and RDF. Or "the price of my boss's car". In both
cases, the RDF statements would be anchored to resources using URIs at
some point, eg. a person: URI for the speaker.

[person:uk:nx9300001]--boss--> [ ] ---niece--> [ ] ---hairdresser--> [ ] --eyecol-> "brown"
[person:uk:nx9300001]--boss--> [ ] --price--> "30,000 ukp" 

So here's where I think we disagree: the object/thing/entity/resource that
is my boss (lets assume I have exactly one) is not intrinsically
anonymous. There is a URI name for that resource, we just don't see it in
every piece of data that tells us things about the resource in question.
(another eg: "my hometown is the rainyest city in England"...)

Within the context of these two information fragments, we make anonymous
_mention_ of various resources. Others may do likewise
elsewhere. Resources aren't anonymous; mentions may be.

So, we can admit a desire to acquire knowledge of the identifiers for all
kinds of things. We can also admit to wanting to be able to point into
documents of all kinds at various levels of granularity. So we might want
to give a name to my mention of the boss, the hairdresser, and so forth. These
aren't the same thing though. Both are challenging (assigning identifiers
to people is a politically loaded business to be in, eg. recent fuss over
cookie data); giving identifiers to the claims made in a document, the
structural parts of a document, and the things defined or identified by
that document is also pretty hairy. 

> > URIs are just more information about a resource, as are
> > temporary placeholder URIs.
> 
> Remember my disclaimer about the semantics: it is out of scope.

Disclaim away - I didn't mention semantics! ;-)

Resources have URIs; we don't always know those URIs. We may learn them
later or not at all. 

> >          "it is essential that every parser is guaranteed to produce the same URI
> >         for the resource. Period."
> > 
> > This is a very strong statement, and one that could be qualified in a
> > number of ways.
> 
> Right. The strength of the statement depends on this qualification. 

Exactly

> > Do you mean: produce the same generated URI for the same (anonymous) resource
> > 
> >         - given byte-for-byte identical XML input (from same source)
> >         - given byte-for-byte identical XML input (from different sources)
> >         - given statement-for-statement identical graphs from varying sources
> 
> In my view, the above statement must be true *at least* for the first
> case: given byte-for-byte identical XML input from same source, the same

Yes. For some sense of source (eg. might be a business-2-business
transaction rather than the http:// uri of some static XML file). 
(Neither of us have yet said what we mean by 'source' exactly...)

This is the same (important) issue as that raised previously, ie. that we
need conventions for identifying the provenance of our triples. So, I'd claim
that whatever convention emerges for giving the (URI-named) provenance
of a triple, such as transaction/message identifier, will serve for the
notion of 'source' we're handwaving vaguely about here.

> noname URI should be produced. This is the weakest possible assumption,
> and it is easily satisfiable. I don't think the uniqueness assumption
> makes sense for different sources. 

Can you characterise what counts as a 'difference' source for incoming
RDF? 

> Your Example 1 illustrates that very well.
> 
> > ...
> > Example 1: same XML bytes, different servers
> > ...
> 
> However, I do believe that a unique noname URI makes sense in your
> second example:

Possibly a dumb example. I shouldn't have introduced time-changing
resources as these are problematic enough to think about on their own.

> 
> > Example 2: same XML bytes, same server, different resource.
> > 
> >         <rdf:RDF>
> >         <WeatherReport>
> >         <todayCloudCoverPhoto rdf:resource="/weathertoday.png"/>
> >         <description>this is a daily weather report; the
> >         todaysCloudCoverPhoto property is a reference to a dynamically changing
> >         resource, created by a different person each day</description>
> >         </WeatherReport>
> >         </rdf:Description>
> >         </rdf:RDF>
> > 
> > Feeding this to an XML/RDF parser we'd get an anonymous resource of type
> > "WeatherReport". On different days we might get descriptions of different
> > (anonymous) WeatherReports.
> 
> In the above example, you would get the same URI for the instance of
> "WeatherReport" whenever you stopped by at the server. And, if you
> insist on considering the semantics of this resource, it would identify
> the "current" whether report from the given datasource, whatever
> "current" means. When presented to the user at runtime by a weather
> report application, the report would indeed reflect the current whether
> situation.

I'm not sure we can say that. There may be a series of new WeatherReports,
with author etc. If we go to http://.../rdf/brief.rdf we might get the
above RDF each day, if we go to http://.../rdf/verbose.rdf imaging a more
complex chunk of RDF as above but giving more info  (price, author
etc) for each instance of WeatherReport. This I think is enough to show
that the WeatherReports might change daily.

 > 
> > stuck. Writing out parsers so they try to always generate the same
> > identifier given sufficent context (server, date, content-negotiation,
> > language-negotiation, bytes-to-parse etc) seems to me fraught with
> > difficulty.
> 
> I agree there are unresolved issues here. However, I tend to believe
> Doug Engelbart (see URL above) that this is a crucial requirement. 

Resource identification, sure. Resource-mention identification, sure. I
think the current discussion (and in fairness, much thinking (including my
own) about uris, rdf and web architecture) conflate the two. 

> By
> restricting the context say to the origin and content of the RDF
> description the problem can be solved relatively easily.

I'm still not convinced. Are you claiming that anonymity is a property
intrinsic to a resource rather than it's mention in some context?

cheers,

Dan

--
danbri@w3.org
Received on Tuesday, 7 December 1999 20:41:18 UTC