W3C home > Mailing lists > Public > w3c-sgml-wg@w3.org > January 1997

Re: Base

From: Michael Sperberg-McQueen <U35395@UICVM.UIC.EDU>
Date: Thu, 23 Jan 97 09:23:54 CST
Message-Id: <199701231529.KAA00348@www10.w3.org>
To: W3C SGML Working Group <w3c-sgml-wg@www10.w3.org>
On Thu, 23 Jan 1997 07:47:53 -0500 Tim Bray said:
>At the time I was a robot writer and my robot was looking for help in
>de-duping, i.e. how many slightly-variant mirror copies of the Jargon file
>do you really want to index?  But beyond this narrow interest, I then
>believed, and still do, strongly, that it would be useful for there
>to be a standard way for an object to assert "Here is the canonical
>address which I request to be used in retrieving the object you are now
>reading".  Use in hot-lists and by crawlers being just two obvious
>applications.  I think this would be a highly useful, cheap-to-specify,
>easy-to-implement item to put in XML-LINK.

I half agree and half violently disagree.  Maybe it's just the

>Any reason not to do this? - Tim

Yeah:  often there *is* no canonical address stuff should be
retrieved by:  if you're on the East Coast get it from the site in
North Carolina; if on the West Coast, from the Petaluma site; if in
Europe, from Oxford; if ...

It seems to me that what one needs for deduping is a unique
identifier:  i.e. an (F)PI or a URN.  Would the problem of deduping
not be solved just by allowing a doc to say 'By the way, this is my

Or have I misunderstood?

Received on Thursday, 23 January 1997 10:29:38 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 20:25:06 UTC