- From: Sandro Hawke <sandro@w3.org>
- Date: Wed, 05 Feb 2003 14:35:06 -0500
- To: dbooth@w3.org
- Cc: www-archive@w3.org
[ some notes ]
1. Graph Serialization
Be able to convey information about cyclic relationships, like:
Sam admires Betty
Betty admires Geoff
Geoff admires Sam
Anything which does this is an "identifier". In XML, ID/IDREF can
do this. In programming languages they are usually called
identifiers.
Without identifiers, at best you have a tree structure:
<something> admires
<something> which admires
<something> which admires ....
In an abstract sense, any transmittable object from a very large
space of objects (>=32 bits) should suffice. Strings work well,
as do structures which can be serialized as strings. In another
sense, identifiers are a class of lexemes (tokens) in the
language.
These are local-scope or document-scope identifiers. In RDF they
are NodeIDs.
2. Graph Merging
Be able to convey cyclic information from multiple sources
Doc1: Sam admires Betty
Doc2: Betty admires Geoff
Doc3: Geoff admires Sam
and we know the geoff in Doc2 and Doc3 are the same individual.
When you merge texts, you need the identifiers to be in the same
namespace (have the same mapping from identifier-to-individual).
It can be possible (with some sort of "imports" directive) to
include-and-remap, but that hasn't been the tradition in RDF.
These all-in-one-big-namespace strings are global or universal
identifiers. We have two kinds of theses:
[2a] Mintable Identifiers. When you need an identifier for
something, and you don't already have one, you make up a new one
using an algorithm that produces new identifiers. In AI terms
these are Skolem constants; the UUID and tann/taguri algorithms
work in many cases, as do crptographically random numbers.
[2b] One-True-Name Identifiers. Every reference to something is
made using the same identifier. If you don't know the
one-true-name, go look it up somewhere. If there is no name, then
you can add one. In AI terms, this corresponds to the
unique-names assumption. We know of no practical approach to
providing this kind of identifier in the general case, so this is
not actually a feature of any identifier plan. It is listed here,
however, because it would obviously be useful and this lets us
be clear about schemes which might provide it in certain limitted
domains.
3. Document-Reference and Self-Reference
Documents (web pages) should be able to refer to and convey
information about themselves and each other.
Doc1: Doc1 and Doc2 were last updated (by Sam) in 1997.
Documents, in this sense, are maintainable, mutable things, not
fixed artifacts.
There is some interplay between Document-Reference and Graph
Merging: the pair <DocumentId, NodeID> functions as a MintableID
for graph merging.
4. Auxiliary Information Retreival
Given a document D which uses an identifier X, be able to find more
information (IX) about X. There are different qualities associated
with how retreival might work:
- Deterministic: the mapping from (D,X) to IX is specified and
well-understood (ie not Google)
- Distributed: the mapping does not go through a central point,
(ie no SemanticLookupService.net)
- Maintainable: the information IX can be updated over time
(ie D and X can not include the secure hash of IX)
- Secure: the mapping from (D,X) to IX can only be affected by
authorized parties (eg D or X includes the secure hash of the
public key used to sign Y)
- Authoritative: the mapping is determined by the social entity
who minted X and/or the one who wrote D.
- Annotatable: additional mappings from X to IX2, IX3, ..., may
be contributed by 3rd parties. (lots of issues here; maybe
this needs to be separated out.)
- Clickable: identifier works as a web address in deployed
browsers to get IX. That is D is not used; X->IX is web
dereference function. Should conneg work between RDF and HTML?
The big split in approaches is:
- X gives URL for IX (eg "hash")
- D gives URL for IX (eg rdfs:isDefinedBy in instance data)
- Someone else maps X to IX (eg rdfs:isDefinedBy in other doc)
Received on Wednesday, 5 February 2003 14:36:28 UTC