Resource Description Framework (RDF):
Overview and Abstract Data Model

Editors' Working Draft 25 July 2002

This version:: http://www.ninebynine.org/wip/RDF-basics/2002-07-25/Overview.htm
Latest version:: http://www.ninebynine.org/wip/RDF-basics/Current/Overview.htm
Previous version:: http://www.ninebynine.org/wip/RDF-basics/2002-07-23/Overview.htm
Editors:: Graham Klyne (Clearswift and Nine by Nine); Jeremy Carroll (Hewlett Packard Labs)
Series editor:: Brian McBride (Hewlett Packard Labs)

Abstract

The Resource Description Framework (RDF) is a data format for representing metadata about Web resources, and other information. This document describes the abstract graph syntax on which RDF is based, and which serves to link its XML serialization to its formal semantics. It also describes some other technical aspects of RDF that do not fall under the topics of formal semantics, XML serialization syntax or RDF schema and vocabulary definitions (which are eacyh covered by a separate document in this series).

Status of this Document

The following text refers to the intended status of this document: it has not yet been approved by the RDF core working group or W3C director and has no status other than editors' working draft at this time.

This is a W3C RDF Core Working Group Working Draft produced as part of the W3C Semantic Web Activity (Activity Statement).

This document is being released for review by W3C Members and other interested parties to encourage feedback and comments, especially with regard to how the changes affect existing implementations and content.

This is a public W3C Working Draft and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use W3C Working Drafts as reference material or to cite as other than "work in progress". A list of current W3C Recommendations and other technical documents can be found at http://www.w3.org/TR/.

Comments on this document are invited and should be sent to the public mailing list www-rdf-comments@w3.org. An archive of comments is available at http://lists.w3.org/Archives/Public/www-rdf-comments/.

1. Introduction
2. RDF overview
3. Graph syntax
4. Additional technical considerations
5. Acknowledgments
6. References
- 6.1 Normative References
- 6.2 Informational References
Appendix W: Issues addressed in this document
Appendix X: Outstanding issues for this document
Appendix Y: Change log

1. Introduction

The Resource Description Framework (RDF) is a data format for representing metadata about Web resources, and other information. The normative documentation of RDF falls broadly into the following areas:

XML serialization syntax
Formal semantics
RDF vocabulary definition language (RDF schema)
Abstract graph syntax
Other technical details

This document addresses the last two of these items. The first three are covered by separate documents ([RDF-SYNTAX], [RDF-SEMANTICS], [RDF-VOCABULARY]).

In section 2, some background to the design goals and rationale of RDF is presented. There is also some discussion of the intended implications of publishing an RDF document (section 2.3).

RDF is based on a graph syntax, which is typically serialized using XML. This graph syntax captures the fundamental structure of RDF, independently of any serialization syntax that may be used. The formal semantics of RDF are defined in terms of the graph syntax. The graph syntax is defined in section 3 of this document

Section 4 presents a number of technical issues that don't clearly fall into any of the more explicit areas noted above.

2. RDF overview

RDF uses well established ideas from various data and knowledge representation communities, with recognizable relationships to Conceptual Graphs, logic-based knowedge representation, frames, and relational databases [Sowa,CG,KIF,Hayes,Luger,Gray].

RDF builds on XML, which provides a syntactic framework for representing documents and other information. It has a simple graph-based data model and formal semantics with a rigorously defined notion of entailment, which in turn provides a basis for well founded deductions in RDF data.

The real value of RDF comes not so much from any single application, but from the possibilities for sharing data between applications. The value of information thus increases as it becomes accessible to more and more applications across the entire Internet.

2.1 Motivation

The development of RDF has been motivated by the following uses, among others:

Web metadata: providing information about web resources and the systems that use them (e.g. content rating, capability descriptions, privacy preferences, etc.)
Applications that require open rather than constrained information formats (e.g. scheduling activities, describing organizational processes, annotation of Web resources, etc.)
To do for machine processable information (application data) what the World Wide Web has done for hypertext: to allow data to be processed outside the particular environment in which it was created, in a fashion that can work at Internet scale.
Interworking between applications: combining data from several applications to arrive at new information.
Automated processing of Web information by software agents: the Web is moving from having just human-readable information to being a world-wide network of cooperating processes. RDF provides a world-wide lingua franca for these processes.

2.2 Design goals

The design of RDF is intended to meet the following goals:

A simple data model
Formal semantics and well-founded inference
Extensible URI-based vocabulary
XML-based syntax
Use XML schema datatypes
Anyone can say anything about anything
A basis for legally binding agrements
Universal expression of ground facts

2.2.1 A simple data model

RDF has a simple data model that is easy for applications to process and manipulate. The data model is independent of any specific serialization syntax.

NOTE: the term "model" used here in "data model" has a completely different sense to its use in the term "model theory". See the RDF model theory specification [RDF-SEMANTICS] or a textbook on logical semantics (e.g. [HUNTER,DAVIS]) for more information about what logicians call "model theory".

2.2.2 Formal semantics and well-founded inference

RDF has a formal semantics which provides a sound basis for reasoning about the meaning of an RDF expression. In particular, it supports rigorously defined notions of entailment which provide a basis for defining reliable rules of inference in RDF data.

2.2.3 Extensible URI-based vocabulary

The vocabulary is fully extensible, being based on URIs with optional fragment identifiers (URIrefs). URIrefs are used for naming all kinds of things in RDF data. The only other kind of label that appears in RDF data is a literal string.

2.2.4 XML-based syntax

RDF has an XML-based serialization form which, if used appropriately, allows a wide range of "ordinary" XML data to be interpreted as RDF [STRIPEDRDF].

2.2.5 Use XML schema datatypes

RDF can be used with XML schema datatypes [XML-SCHEMA2], thus assisting the exchange of information between RDF and other XML applications.

[[[Add comment here if goal not fully achieved]]]

2.2.6 Anyone can say anything about anything

To allow operation at Internet scale, RDF is an open-world framework that allows anyone to say anything about anything. In general, it is not assumed that all information about any topic is available. A consequence of this is that RDF cannot prevent anyone from making nonsensical or inconsistent assertions, and applications that build upon RDF must find ways to deal with conflicting sources of information. (This is where RDF departs from the XML approach to data representation, which is generally quite prescriptive and aims to present an application with information that is well-formed and complete for the application's needs.)

2.2.7 Universal expression of ground facts

Through its use of extensible URI-based vocabularies, RDF aims to provide for universal expression of ground facts; i.e. assertions of specific properties about specific named things.

RDF itself does not provide the machinery of inference, but provides the raw data upon which such machinery can operate. Other work is looking for ways to build more expressive expressions on the basic capabilities of the RDF core language.

2.2.8 A basis for binding agrements

RDF is intended to convey assertions that are meaningful to the extent that they may, in appropriate contexts, be used to express the terms of binding agreements.

This goal is explored further in section 2.3 below.

2.3 Meaning of RDF documents

The RDF specification emphasizes the formal structure and meaning of RDF. But there is also a social dimension that is easily overlooked when dealing with such formal aspects.

2.3.1 Formal semantics

[[[These words adapted from Primer 7.1]]]

RDF is a language designed to support the Semantic Web, in much the same way that HTML is the language that supports the original Web. The Semantic Web aims for data to be shared and processed by automated tools as well as by people. To serve this purpose, certain meanings of RDF statements must be defined in a very precise manner; this is provided by the RDF Model Theory [RDF-SEMANTICS].

Model-theoretic semantics assumes that a language refers to a 'world', and describes the minimal conditions that a world must satisfy in order to assign an appropriate meaning for every expression in the language. A particular world is called an interpretation, so that model theory might be better called 'interpretation theory'. The idea is to provide an abstract, mathematical account of the properties that any such interpretation must have, making as few assumptions as possible about its actual nature or intrinsic structure. The RDF model theory is couched in the language of set theory simply because that is the normal language of mathematics - for example, the model theory assumes that names denote things in a set IR called the 'universe' - but the use of set- theoretic language is not supposed to imply that the things in the universe are set-theoretic in nature.

The chief utility of such a semantic theory is not to suggest any particular processing model, or to provide any deep analysis of the nature of the things being described by the language (in our case, the nature of resources), but rather to provide a technical tool to analyze the semantic properties of proposed operations on the language; in particular, to provide a way to determine when they preserve meaning.

The RDF model theory treats RDF as a simple assertional language, in which each triple makes a distinct assertion, and the meaning of any triple is not changed by adding other triples. Based on the semantics defined in the model theory, it is simple to translate an RDF graph into a logical expression with essentially the same meaning.

2.3.2 Social meaning

[[[Adapted words from DanBri/PatHayes]]]

RDF/XML documents, i.e. encodings of RDF graphs, can be used to make representations of claims or assertions about the world. RDF graphs may be asserted to be true, and such an assertion should be understood to carry the same social import and responsibilities as an assertion in any other format. A combination of social (e.g. legal) and technical machinery (protocols, file formats, publication frameworks) provide the contexts that fix the intended meanings of the vocabulary of some piece of RDF, and which distinguish assertions from other uses (e.g. citations, denals or illustrations).

A media type, application/rdf+xml has been registered for indicating the use of RDF/XML as an assertional representation in this way (see section 3.7).

2.3.3 Interaction between social and formal meaning

To support logical entailments, formal RDF meaning is based on a model theory (see section 2.3.1). The notion of truth here is crucial: a possible world may correspond to some RDF if and only if the RDF statement is true in that world. This leads to consideration of what makes a statement be true:

logical truths: statements which are true (in any MT interpretation) completely by view of their grammatical form and formal interpretation rules of the language. These statements actually tell you nothing about the real world, as they are logically true of all possible worlds;
assumed truths; roughly, according to Quine [QUINE] "statements that a native speaker of the language would regard as obvious". These might be regarded as axioms of the native speaker's world. In general, these will be truths that cannot be obtained by purely logical means, but rather are assessed by social means and institutions;
assumed deductions: determined by some informal process (assumed to be valid) from some other truths that have already been accepted;
mixed truths: truths that can be determined by some logical entailment from some other truths that have already been accepted.

It is presumed here that any interesting statement about the world or human afairs must ultimately depend on assumed truths. Having accepted such an assumed truth into one's worldview, other interesting truths may be deduced by logical means. Semantic web vocabulary gains currency through use, so also do semantic web deductions ultimately have force through acceptance by people. There is a combination of logical and social (non-logical) dimensions in which semantic web deduction must operate.

The RDF code language provides a way to make simple formal assertions, with very no machinery for formalizing allowable inferences. Inferences are performed by processes, embedded in software implentations, whose validity is not formally demonstrable, and must be assumed or trusted to be valid (in relation to the world and/or human affairs). It is expected that semantic web languages layered on RDF will give formal expression to allowable inference, and to allow provable deductions by generic software modules to replace the individual ad-hoc implemenations.

2.3.4 Implications of asserting RDF

When an RDF graph is asserted in the web, its publisher is saying something about their view of the world. (The mechanism for deciding whether or not a graph is asserted is not defined here, but it is presumed that the publisher's intent will be clear in some way -- social convention or logical deduction.)

When a user invokes an application, there is also a social and technical context of invocation that determines some set of RDF assertions that will be assumed to be true: the application itself, and any RDF files that are passed to it. Garbage-in, garbage-out applies: if the initial assumed facts are wrong or meaningless, the results will have little value. No specfic mechanisms for deciding or evaluating the validity of any such assertions are defined here.

An assertion tells us something about "the world" and human affairs, through the normal model theoretic possible-world constraint mechanisms. Some of the truths that are asserted may be logical truths that can be evaluated using logical machinery. Others may be assumed truths that cannot be evaluated logically, but can be determined by human interpretation. So when we assert an RDF graph, one is stating a constraint on the real world, saying that both the logically testable and humanly interpretable truths in the graph are indeed true in that world.

In accordance with appropriately sanctioned logical entailment, it is intended that inferences may be used to deduce new RDF statements with the same force of assertion as the explicitly statements from which they are derived.

Noting that there is no single human opinion about the truth of some statements, the graph may further contain commentary for human interpreters to indicate the realm of human interpretation that should be applied. This means a graph may contain "defining information" that is opaque to logical reasoners. This information may be used by human interpreters of RDF informaton, or programmers writing software to perform specialized forms of deduction in the Semantic Web.

2.4 RDF concepts

RDF uses the following key concepts:

Graph data model
URI-based vocabulary
XML serialization syntax

2.4.1 Graph data model

The underlying structure of any RDF expression is a directed labelled graph (or multigraph), which consists of nodes and labelled directed arcs that link pairs of nodes. The formal semantics for RDF is defined in terms of this graph syntax. An RDF expression is sometimes called an RDF graph. The graph can conveniently be represented as a set of triples, where each triple contains two node labels and an arc label:

Each arc corresponds to a startement that asserts a relationship between the nodes that it links. The meaning of an RDF graph is the conjunction (i.e. logical AND) of all the statements that it contains.

2.4.2 URI-based vocabulary

Nodes in an RDF graph are labelled with URIs with optional fragment identifiers (URIrefs), literal strings, or nothing at all. Arcs are labelled with URIrefs.

The label on a node indicates what that node is meant to represent. The label on an arc names the relationship that is asserted to hold between the nodes connected by that arc. Some URIrefs may indicate web resources, and a node thus labelled is presumed to denote that resource. Other URIrefs may represent abstract ideas or values rather than a retreivable Web resource. RDF thus leverages the universal naming space of URIs [URIS].

2.4.3 XML serialization syntax

RDF has a specific serialization syntax based on XML. There are several ways in which a given RDF graph can be prepresented in XML: these various forms allow RDF to be represented in ways that are amenable to specific XML applications. In this way, XML application data can easily be designed to be accessible to generic RDF processors [XML-AS-RDF].

Other syntaxes for RDF graphs are possible (e.g. [NOTATION3]), but only the XML syntax is normatively specified and recommended for use to exchange information between Internet applications.

2.5 RDF core URI vocabulary and namespaces

RDF uses URIs to label resources and properties. Certain URIs are reserved for use by RDF, and may not be used for any purpose not sanctioned the RDF specifications. Specifically, URIs with the following leading substrings are reserved for RDF core vocabulary:

http://www.w3.org/1999/02/22-rdf-syntax-ns# (conventionally associated with prefix rdf:)
http://www.w3.org/2000/01/rdf-schema# (conventionally associated with prefix rdfs:)

Used with the RDF/XML serialization, these URI prefix strings correspond to XML namespaces [XML-NS] associated with the RDF core vocabulary terms.

NOTE: these namespace URIs are the same as those used in earlier RDF documents [RDF-MS, RDF-SCHEMA]. The URIs have not been updated because the working group feels its work has been to clarify the earlier work rather than to change it.

2.5.1 RDF defined vocabulary terms

The vocabulary terms are listed here using QName syntax. The corresponding URI reference is formed by concatenating the URI corresponding to the prefix (see above) with the given local name.

rdf:RDF
rdf:Description
rdf:ID
rdf:about
rdf:resource
rdf:parseType
rdf:type
rdf:Bag
rdf:Seq
rdf:Alt
rdf:li
rdf:_n (for decimal numerals n)
rdf:Statement
rdf:Property
rdf:subject
rdf:predicate
rdf:object
rdf:bagID
rdf:value
rdfs:Resource
rdfs:Class
rdfs:ConstraintResource
rdfs:ConstraintProperty
rdfs:Literal
rdfs:subClassOf
rdfs:subPropertyOf
rdfs:comment
rdfs:label
rdfs:seeAlso
rdfs:isDefinedBy
rdfs:range
rdfs:domain
rdfs:Container
rdfs:ContainerMembershipProperty
rdfs:member

Informal descriptions of some of these terms are given in the RDF vocabularies document [RDF-VOCABULARY]. Where formal semantics are defined, they are given in the RDF formal semantics document [RDF-SEMANTICS].

Some of the above vocabulary terms are used for purely syntactic purposes in the RDF/XML serialization, and do not appear in the abstract graph syntax (see section 3.1). Any other use of these names is considered to be an error.

Other names from the rdf: and rdfs: namespaces (i.e. starting with one of the URI strings noted above) should be used only if they are defined by the RDF specification. Processors encountering unrecognized names in these namespaces should issue a warning, then continue to process them as any other vocabulary.

2.5.2 RDF deprecated vocabulary

The following RDG core vocabulary terms defined in previous RDF specification documents have been deprecated for future use:

rdf:aboutEach
rdf:aboutEachPrefix
rdfs:ConstraintResource
rdfs:ConstraintProperty

3. Graph syntax

This section defines the RDF graph syntax. The RDF graph is sometimes referred to as the (data) model of RDF (see the RDF Primer [RDF-PRIMER], and RDF Model & Syntax [RDF-MS]). In brief, the RDF graph is a directed graph with labelled edges and partially labelled nodes.

A goal of this section is the precise definition of equality between RDF graphs. This benefits interoperability (two conformant implementations are more likely to be practically interoperable if they have a precise conception of the way in which they are the same). It is required for the specification of the RDF Test Cases [RDF-TESTS], which depend on testing equality of RDF graphs for their execution. It is required by the RDF Model Theory [RDF-SEMANTICS] which assigns the same meaning to any pair of equal RDF graphs.

Note: Many RDF applications and frameworks do not need to implement RDF graph equality. They do need to respect equality when assigning meaning to RDF graphs. RDF recommendations do not define conformance or compliance levels.

The specification of the RDF graph commences with the labels used in the graph, which can be uri references, string literals, or XML literals; equality is defined for each. It then proceeds to describing arcs (triples), a complete graph and graph equality.

[[[Need to liaise with PatH to ensure graph description is consistent with MT --action jjc]]]

3.1 URI References

Within an RDF graph, URI reference labels are drawn from the lexical space of the anyURI datatype as defined for XML schema datatypes [XML-SCHEMA2], constrained to be an absolute rather than a relative URI reference, and to be in Unicode Normal Form C [NFC] in conformance with [CHARMOD].

Precisely, a URI Reference Label within an RDF graph is a Unicode string [UNICODE] that:

is in Normal Form C [NFC] and
would produce a US-ASCII string that is an absolute URI reference in the form defined by [URIS] as modified by [RFC-2732], when:
- encoded in UTF-8, and
- with disallowed characters escaped according to the percent escaping algorithm below

The disallowed characters that must be %-escaped include all non-ASCII characters, the excluded characters listed in Section 2.4 of [URIS], except for the number sign (#) and percent sign (%) characters and the square bracket characters re-allowed in [RFC-2732].

Disallowed characters must be escaped as follows:

Each disallowed character is converted to UTF-8 [RFC-2279] as one or more bytes.
Any bytes corresponding to a disallowed character are escaped with the URI escaping mechanism (that is, converted to %HH, where HH is the hexadecimal notation of the byte value).
The original character is replaced by the resulting character sequence.

Two URI reference labels within RDF are equal if and only if they compare as equal, character by character, as Unicode strings. A URI reference label is not equal to a string literal label or an XML literal label.

See the following test cases, per [RDF-TESTS]:

3.2 RDF Literals

An RDF literal is either an XML literal or a string literal.

Two RDF literals are equal if and only if they are either both XML literals and equal or both string literals and equal.

3.2.1 String Literals

A string literal label in an RDF graph is composed of a Unicode string [UNICODE] that is in Normal Form C [NFC], and a language identifier that is either null or as specified below.

Two string literals are equal if both components are equal. The Unicode string components are compared on a character by character basis. The language tag components are equal if both are null or if both are defined and equal as language identifiers.

Allowable language identifiers are the legal values for xml:lang as specified by section 2.12, Language Identification, in [XML], or null. Equality of language identifiers (as specified in [RFC-3066]) is defined by case insensitive character by character comparison.

Note: This direct comparison between language identifiers is appropriate for the purpose of defining equality between RDF graphs, but is linguistically naive. [RFC-3066] suggests more advanced comparison techniques.

Note: Literals beginning with a composing character (as defined by [CHARMOD]) are allowed however they may cause interoperability problems, particularly with XML version 1.1 [XML 1.1].

See the following test cases, per [RDF-TESTS]:

[[[Subject to WG disposition of test cases]]]

3.2.2 XML Literals

Within an RDF graph, an XML literal is a Unicode [UNICODE] string paired with a language identifier. The string is well-balanced, self-contained XML element content [XML].

Two XML literals are equal if both components are equal. Comparison of XML literal is described below. The language identifiers are equal if both are null or if both are defined and equal as language identifiers, per [RFC-3066].

Within an RDF graph, an XML literal is a Unicode [UNICODE] string paired with a language identifier.

[[[Edit following to incorporate xml:lang]]]

An XML literal can be used to form an XML document by enclosing it with <tag> and </tag> and encoding the resulting string in UTF-8. No escaping is applied in this process. This resulting document is a well-formed XML document [XML] that also conforms to XML Namespaces [XML-NS].

Note: If compatibility with XML version 1.1 is desired, then XML literals in RDF graphs must be restricted to those that are fully normalized according to [XML 1.1].

Two XML literals are equal if both components are equal. Comparison of XML literals is described below. The language identifiers are equal if both are null or if both are defined and equal as language identifiers, per [RFC-3066].

3.2.2.1 Equality of XML Literals

The definition of equality for XML literals is not precisely defined by this specification. The description given here is used by the RDF Test Cases [RDF-TESTS], and also constrains any implementation defined equality.

Two XML literals may be comparted by the following steps:

Form an XML document from each literal by enclosing it with <tag> and </tag> and encoding the resulting string in UTF-8.
Take the exclusive canonicalization without comments [XC14N] of the element content of the root element of each document.
Compare the two resulting UTF-8 strings byte by byte.

Implementations may specialize this definition of equality (i.e. if two XML literals compare equal according to an implementation then they must compare equal according to this definition, but not conversely).

In particular, implementations may treat XML comments as significant, and may treat namespaces that are in scope but not visibly utilized (as defined by [XC14N]) as significant.

[[[should this para be moved to a longer non-normative appendix which would have the goal of showing the DPH that they can do nearly nothing and still conform with this rather opaque requirement. @@@@ The use of character by character equality between XML literals is discouraged, except in the case where XML literals have already been canonicalized with an appropriate treatment of namespaces. @@@ would a test case showing where naivity is insuifficient be helpful, it would contain two character-by-character identical XML literals which had qnames with namespace prefixes bound to different namespaces]]]

See the following test cases, per [RDF-TESTS]:

[[[Subject to WG disposition of test cases]]]

3.3 Nodes

An RDF graph is defined using a set of nodes. Many of the nodes are blank, and some of the nodes are labelled with RDF literals or RDF URI references, i.e. there is a partial labelling function from the set of nodes to the union of the set of RDF literals and RDF URI references.

A tidy set of nodes is one in which no two nodes have equal labels. A tidy set of nodes may have any number of distinct blank nodes.

Two nodes are equal if and only if they are the same node. In particular, two different blank nodes are not equal.

3.4 RDF triples

An RDF triple describes an arc in an RDF graph. It contains three components:

a subject node
an predicate, labelling the arc, which is a URI reference, and
an object node

The set containing the subject and object nodes of a triple is tidy (per definition in section Nodes).

The subject must not be labelled with an RDF literal.

Two RDF triples are equal if and only if their subjects are equal, their predicates are equal, and their objects are equal.

3.5 RDF graph

An RDF graph is a collection of RDF triples.

The nodes of an RDF graph are the set of nodes that are either subject or object of some triple in the graph.

The set of nodes of an RDF graph is tidy (per definition in section Nodes).

Note: The definition of an RDF graph diverges from the definition of a directed graph in a standard text such as [[[missing ref]]] in that: (a) all nodes must be in at least one arc; (b) all the arcs are labelled; (c) some of the nodes are labelled; (d) labels on nodes are required to be distinct; (e) some labels are shared between nodes and arcs.

3.6 Graph Equality

Two RDF graphs are equal if and only if they are isomorphic. An RDF graph isomorphism is a directed graph isomorphism that respects the labels on both arcs and nodes.

An RDF Graph isomorphism I between two graphs G and G' is a bijection between the nodes of G and the nodes of G', such that:

I(n) is blank if and only if n is blank,
the label (if any) on I(n) equals the label on n, and
the triple (I(s),p,I(o)) is in G' if and only if the triple (s,p,o) is in G,

for all nodes n, s, o in G and all RDF URI references p.

@@@@ I note that I have used a system of typed objects with identity and with named components without introducing or defining it. (Notice that an XML Literal xml"foo"-"en" is distinct from a String Literal "foo"-"en"). Personally I think that introducing and defining such a system will confuse more than enlighten. I could be persuaded ...

3.7 Reserved URIs in graph syntax

@@@Suggestion that this subsection to be deleted, in favour of expressing this as a constraint in the serialization section of the RDF/XML syntax doc.

The following elements of RDF vocabulary have syntactic significance only in the XML serialization, and never appear in the RDF abstract graph syntax:

rdf:RDF
rdf:Description
rdf:ID
rdf:bagID
rdf:about
rdf:resource
rdf:parseType

4. Additional technical considerations

4.1 Character normalization

For the processing of character data that can be represented in different ways, RDF processors are required to conform to Early Uniform Normalization, as described by Character Model for the World Wide Web 1.0 [CHARMOD].

4.2 Fragment identifiers

How should RDF treat a URI reference with a fragment identifier? Conventional web architecture has that the meaning of a fragment identifier is dependent on the MIME type of a resource that is obtained by dereferencing the URI part. URIs without fragment identifiers are generally presumed to map to some resource for which a Web representation (or several) can be retrieved. But RDF has no concept of a fragment identifier separate from a URI: RDF treats a URI reference as an opaque identifier that denotes some resource [RDF-SEMANTICS]. Further, an RDF resource identifier may denote something that is not web-retrievable; e.g. a car, or a Unicorn.

These apparently conflicting interpretations can be reconciled if:

we assume that the URI part (i.e. excluding fragment identifier) of any URI reference used in an RDF document indicates a Web resource with an RDF representation. (If it is not dereferencable as such on the web, so be it, but we must assume its notional existence.) So when someurl#frag is used in an RDF document, someurl is presumed to designate an RDF document.
when used in an rdf document, someurl#frag means the thing that is indicated, according to the rules of the application/rdf+xml mime type as a "fragment" or "view" of the RDF document at someurl. If the document doesn't exist, or can't be retrieved, then exactly what that view may be is somewhat undetermined, but that doesn't stop us from using RDF to say things about it.
the RDF interpretation of a fragment identifier allows it to indicate a thing that is entirely external to the document, or even to the "shared information space" known as the Web. That is, it can be an abstract idea, like my car or a mythical Unicorn.
thus, an RDF document acts as an intermediary between some web retrievable documents (itself, at least, also any other web-retrievable URIs that it may use, including schema URIs and references to other RDF documents), and some set of abstract or non-Web entities that the RDF may describe.

This provides a handling of URI referencess and their denotation that is consistent with the RDF model theory and usage, and also with conventional web axioms. This approach somewhat extends the idea of a "fragment" or "view" beyond the common idea (when handling web documents) that it is a physical part of a containing document.

In view of this, it is reasonable to consider that URIs without fragment identifiers are most helpfully used for indicating web-retrievable resources (when used in RDF), and URIs with fragment identifiers are used for abstract ideas that don't have a direct web representation. This is not a hard-and-fast distinction, as the line between resources having or not having a web-retrievable representation is sometimes hard to draw precisely.

4.3 Forming a URI reference from a Qname

The RDF/XML syntax uses QName syntax [XML-NS] to identify various resources, notably RDF properties. But the RDF graph syntax contains only URI references, and does not recognize QName forms.

Mostly, the handling of QNames is a matter for RDF parsers. But there are some occasions where an RDF writer needs to know the correspondence between QNames and URI references (e.g. when using a typed node production). The mapping is described in [RDF-SYNTAX], sections 3.1.2 or 3.1.4.

5. Acknowledgments

The editors acknowledge valuable contributions of the following:

Frank Manola
Pat Hayes
Dan Brickley

This document is a product of extended deliberations by the RDFcore working group, whose members have included:

Art Barstow, W3C
Dave Beckett, ILRT
Dan Brickley, ILRT
Dan Connolly, W3C
Jeremy Carroll, Hewlett Packard
Ron Daniel, Interwoven Inc
Bill dehOra, InterX
Jos De Roo, AGFA
Jan Grant, ILRT
Graham Klyne, Clearswift and Nine by Nine
Frank Manola, MITRE Corporation
Brian McBride, Hewlett Packard
Stephen Petschulat, IBM
Patrick Stickler, Nokia
Aaron Swartz, HWG
Mike Dean, BBN Technologies / Verizon
R. V. Guha, Alpiri Inc
Pat Hayes, IHMC
Sergey Melnik, Stanford University
Martyn Horner, Profium Ltd

This specification also draws upon an earlier RDF Model and Syntax document edited by Ora Lassilla and Ralph Swick, and RDF Schema edited by Dan Brickley and R. V. Guha. RDF and RDF Schema Working group members who contributed to this earlier work are:

Nick Arnett (Verity)
Tim Berners-Lee (W3C)
Tim Bray (Textuality)
Dan Brickley (ILRT / University of Bristol)
Walter Chang (Adobe)
Sailesh Chutani (Oracle)
Dan Connolly (W3C)
Ron Daniel (DATAFUSION)
Charles Frankston (Microsoft)
Patrick Gannon (CommerceNet)
RV Guha (Epinions, previously of Netscape Communications)
Tom Hill (Apple Computer)
Arthur van Hoff (Marimba)
Renato Iannella (DSTC)
Sandeep Jain (Oracle)
Kevin Jones, (InterMind)
Emiko Kezuka (Digital Vision Laboratories)
Joe Lapp (webMethods Inc.)
Ora Lassila (Nokia Research Center)
Andrew Layman (Microsoft)
Ralph LeVan (OCLC)
John McCarthy (Lawrence Berkeley National Laboratory)
Chris McConnell (Microsoft)
Murray Maloney (Grif)
Michael Mealling (Network Solutions)
Norbert Mikula (DataChannel)
Eric Miller (OCLC)
Jim Miller (W3C, emeritus)
Frank Olken (Lawrence Berkeley National Laboratory)
Jean Paoli (Microsoft)
Sri Raghavan (Digital/Compaq)
Lisa Rein (webMethods Inc.)
Paul Resnick (University of Michigan)
Bill Roberts (KnowledgeCite)
Tsuyoshi Sakata (Digital Vision Laboratories)
Bob Schloss (IBM)
Leon Shklar (Pencom Web Works)
David Singer (IBM)
Wei (William) Song (SISU)
Neel Sundaresan (IBM)
Ralph Swick (W3C)
Naohiko Uramoto (IBM)
Charles Wicksteed (Reuters Ltd.)
Misha Wolf (Reuters Ltd.)
Lauren Wood (SoftQuad)

6. References

6.1 Normative References

[RDF-SYNTAX]: (RDF syntax...)
[RDF-SEMANTICS]: RDF Model Theory, P. Hayes, Editor. Work in progress. World Wide Web Consortium, 14 February 2002. This version of the RDF Model Theory is http://www.w3.org/TR/2002/WD-rdf-mt-20020214. The latest version of the RDF Model Theory is at http://www.w3.org/TR/rdf-mt/.
[RDF-VOCABULARY]: (RDF vocabulary and schema...)
[RDF-DATATYPES]: (RDF datatypes...)
[RDF-MIME-TYPE]: (RDF MIME type registration...)
[RDF-TESTS]: RDF Test Cases, A. Barstow and D. Beckett, Editors. Work in progress. World Wide Web Consortium, 15 November 2001. This version of the RDF Test Cases is http://www.w3.org/TR/2001/WD-rdf-testcases-20011115/. The latest version of the RDF Test Cases is at http://www.w3.org/TR/rdf-testcases.
[XML]: Extensible Markup Language (XML) 1.0, Second Edition, T. Bray, J. Paoli, C.M. Sperberg-McQueen and E. Maler, Editors. World Wide Web Consortium. 6 October 2000. This version is http://www.w3.org/TR/2000/REC-xml-20001006. latest version of XML is available at http://www.w3.org/TR/REC-xml.
[XML-NS]: Namespaces in XML, T. Bray, D. Hollander and A. Layman, Editors. World Wide Web Consortium. 14 January 1999. This version is http://www.w3.org/TR/1999/REC-xml-names-19990114. The latest version of Namespaces in XML is available at http://www.w3.org/TR/REC-xml-names.
[URIS]: RFC 2396 - Uniform Resource Identifiers (URI): Generic Syntax, T. Berners-Lee, R. Fielding and L. Masinter, IETF, August 1998. This document is http://www.isi.edu/in-notes/rfc2396.txt.
[RFC-2732]: (RFC 2732...)
[RFC-2279]: (RFC 2279...)
[UNICODE]: (Unicode...)
[NFC]: (Unicode normal form C...)
[CHARMOD]: Character Model for the World Wide Web 1.0, M. Dürst, F. Yergeau, R. Ishida, M. Wolf, A. Freytag, T Texin, Editors, World Wide Web Consortium Working Draft, work in progress, 20 February 2002. This version of the Character Model is http://www.w3.org/TR/2002/WD-charmod-20020220/. The latest version of the Character Model is at http://www.w3.org/TR/charmod/.
[RFC-3066]: (RFC 3066 -- language tags...)
[XC14N]: (XML canonicalization...)
[KEYWORDS]: RFC 2119 - Key words for use in RFCs to Indicate Requirement Levels, S. Bradner, IETF. March 1997. This document is http://www.ietf.org/rfc/rfc2119.txt.
[RFC-3023]: RFC 3032 - XML Media Types, M. Murata, S. St.Laurent, D.Kohn, IETF. January 2001. This document is http://www.ietf.org/rfc/rfc3023.txt.

6.2 Informational References

[RDF-PRIMER]: RDF Primer, F. Manola, E. Miller, Editors, World Wide Web Consortium W3C Working Draft, work in progress, 19 March 2002. This version of the RDF Primer is http://www.w3.org/TR/2002/WD-rdf-primer-20020319/. The latest version of the RDF Primer is at http://www.w3.org/TR/rdf-primer/.
[XML-1.1]: Extensible Markup Language (XML) 1.1, John Cowan, Editor. World Wide Web Consortium Working Draft 25 April 2002. (Work in progress)
[XML-INFOSET]: (XML infoset document...)
[XML-SCHEMA0]: XML Schema Part 0: Primer - W3C Recommendation, World Wide Web Consortium, 2 May 2001.
[XML-SCHEMA1]: XML Schema Part 1: Structures - W3C Recommendation, World Wide Web Consortium, 2 May 2001.
[XML-SCHEMA2]: XML Schema Part 2: Datatypes - W3C Recommendation, World Wide Web Consortium, 2 May 2001.
[SOWA]: John Sowa, Knowledge Representation, ...
[CG]: Conceptual Graphs (spec)
[KIF]: KIF
[LUGER]: Luger and Subblefield, Artificial Intelligence
[HAYES]: Pat Hayes... in defense of logic
[GRAY]: Peter Gray, Logic, Algebra and Databases,...
[HUNTER]: Geoffrey Hunter, Metalogic ...
[DAVIS]: Ruth E. Davis, Truth, Deduction and Computation ...
[STRIPEDRDF]: RDF: Understanding the Striped RDF/XML Syntax, D. Brickley, W3C, 2001. This document is http://www.w3.org/2001/10/stripes/.
[XML-AS-RDF]: (example of generic XML data that is RDF compatible - draft-klyne-xxx-rfc822-xml-xxx is one...)
[NOTATION3]: Tim Berners-Lee, DesignIssues note on N3, ...
[RDF-MS]: Resource Description Framework (RDF) Model and Syntax Specification, O. Lassila and R. Swick, Editors. World Wide Web Consortium. 22 February 1999. This version is http://www.w3.org/TR/1999/REC-rdf-syntax-19990222. The latest version of RDF M&S is available at http://www.w3.org/TR/REC-rdf-syntax.
[RDF-SCHEMA]: (Original 2000 version of RDF schema...)

Appendix W: Issues addressed in this document

[[[For reviewers' reference. This appendix will be removed on final publication.]]]

For source information, see paragraph-numbered original documents and issue list:

HTML embedding: Section: 6.2
Graph model: Section: 4
MIME content type registration: Section: 3.7
rdf-namespace-change: Section: 3.1. The working group seeks community feedback concerning its decision not to change the namespace URIs, and the rationale noted here.
rdf-charmod-uris: Section: 4.1
rdf-charmod-literals: Section: 4.2
rdf-equivalent-representations: Section 4: graph syntax is primary, other serializations are related to that.
rdfms-xmllang: Section: 4.2.1, 4.2.2
rdfms-assertion: Section: 2.3, 2.3.4
rdfms-literal-is-xml-structure: Section: 4.2.2
rdfms-identity-anon-resources: Section 4: addressed by graph syntax, and also model theory
rdfms-graph: Section 4: (liaise with PatH about possible overlap)
rdfms-literalsubjects: Section 4: graph syntax says literals cannot be subjects.
rdfms-uri-substructure: Section 4: the graph syntax has no QNames. Formation of URI reference from a QName is a syntax issue -- but section 6.4 contains also contains a note.
rdfms-identity-of-statements: Section 4: the graph syntax in this document defines triple equality.
rdfms-fragments: Section 6.3: description of interpretation of fragment identifiers.
rdfms-replace-value: Section 5.5: refers to description in schema doc.
rdfms-validating-embedded-rdf: Non-issue for this document: issue postponed by WG.
rdfms-rdf-names-use: Section 3.1.1: Use of syntactic names is an error, and other undefined rdf(s) names should generate a warning.
[RDF-MS] Literals: Section: 4, 6.1. I18N and character set issues: M&S paras 216-220 are covered in graph syntax and note of character normalization.
[RDF-MS] Transporting RDF: Dropped: Appendix B of M&S contains discussion that doesn't seem to really add any value.
[RDF-MS] XML literals: Section 4.2.2: graph syntax distinguishes string and XML literals. M&S para 203 discusses rdf:parseType; this is now covered by the RDF/XML syntax document.
[RDF-MS] xml:lang: Section 4.2: M&S para 221 covered in graph syntax.
[RDF-MS]: reserved vocab: Sections 3.1, 4.7: M&S para 223 mentions RDF vocabulary that is not available for properties.
[RDF-SCHEMA]: description of container types: Section 5.1: selected material from paras 90, 91 and section 3.5.
[RDF-SCHEMA]: rdfms-boolean-values-properties: Section 6.5: use classes to achieve the same effect.
----

Appendix X: Outstanding issues for this document

[[[For reviewers' reference. This appendix will be removed on final publication.]]]

See: http://lists.w3.org/Archives/Public/www-archive/2001Jun/att-0021/00-part and http://www.w3.org/2000/03/rdf-tracking/.

[RDF-MS] introduction: Sections: 1, 2.1, para60 [[[awaiting WG consensus on shape of this document]]]
References: Complete all reference entries
Review graph definitions: Check for consistency with self, and with other documents
Address inline editorial comments: These are marked with 'class="todo"' in the HTML source.
Test case references: Where used, check that the test case references used are correct and relevant. Also, add hyperlinks.
Section 4, tidiness of literals: Review for consistency with final group decisions on tidiness of literals.
xxxx: xxxx
xxxx: xxxx

Appendix Y: Change log

[[[For reviewers' reference. This appendix will be removed on final publication.]]]

$Log: Overview.htm,v $
Revision 1.17  2002/07/25 13:31:00  graham
Folded in jjc changes to section 3

Revision 1.16  2002/07/23 11:27:12  graham
Remove sections that will be included in the primer:
- RDF in HTML
- Boolean values

Revision 1.15  2002/07/23 11:01:50  graham
Removed "RDF specification" section (was section 3)
Removed "RDF vocabulary" section (was section 5)
Previous section 3.1 listing RDF vocabulary moved to section 2.5.
Drafted abstract
Drafted introduction section.

Revision 1.14  2002/06/29 10:02:08  graham
Add rdf:bagID to syntax-reserved vocabulary
Remove note of character normalization in 4.2.1 (covered later)
Correct reference sub-section numbers

Revision 1.12  2002/06/27 16:53:31  graham
Minor editorial changes
Regenerate table of contents

Revision 1.11  2002/06/27 15:55:09  graham
Added graph equality description

Revision 1.10  2002/06/26 22:05:33  graham
Completed initial cut of all issues
Only introduction and abstract to do

Revision 1.9  2002/06/26 21:41:14  graham
Completed initial coverage of intended semantics
Completed additional technical issues section
Added proposal for fragment identifier handling
Reorganized and cross-reference issues list

Revision 1.7  2002/06/26 10:24:01  graham
Added text for graph syntax, excerpted from:
http://lists.w3.org/Archives/Public/w3c-rdfcore-wg/2002May/att-0089/01-RDF-XML_Syntax_Specification__Revised_.htm

Revision 1.6  2002/06/25 17:49:34  graham
Filled in section 3 content
Included acknowledgements from original RDF documents

Revision 1.5  2002/06/24 16:50:03  graham
Saved 2002-06-24 working copy

Revision 1.4  2002/06/24 16:39:24  graham
Completed initial cut of section 2 text:
- 2.3.1 Semantics from Primer 7.1
- 2.3.2 social meaning adapted from text by DanBri
- 2.3.3-4 from text discussed at face-to-face
Some further renaming of sections

Revision 1.3  2002/06/24 13:27:16  graham
Update current/previous version links

Revision 1.2  2002/06/24 13:22:24  graham
Transcribe initial issue list to appendix X.
Rearrange outline with new sections for graph syntax
and informal semamntics for RDF vocabulary.

Revision 1.1  2002/06/21 14:57:22  graham
Update document name

Revision 1.3  2002/06/21 14:45:34  graham
Futher rearrangement of outline, to accommodate:
- list of RDF vocabulary terms
- RDF-in-HTML
- RDF namespaces
- Addressed issues appendix
- Note about pure syntax vocabulary (e.f. rdf:Description)
Renamed some section titles

Revision 1.2  2002/06/21 10:21:23  graham
Rearranged outline to accommodate material
from the primer on formal semantics

Revision 1.1  2002/06/20 20:47:03  graham
Initial version of document

Resource Description Framework (RDF): Overview and Abstract Data Model

6.1 Normative References

6.2 Informational References

Resource Description Framework (RDF):
Overview and Abstract Data Model