This document is also available in these non-normative formats: PostScript version, PDF version, ZIP archive, and Gzip'd TAR archive.
The English version of this specification is the only normative version. Non-normative translations may also be available.
Copyright © 2007-2008 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark and document use rules apply.
The modern Web is made up of an enormous number of documents that have been created using HTML. These documents contain significant amounts of structured data, which is largely unavailable to tools and applications. When publishers can express this data more completely, and when tools can read it, a new world of user functionality becomes available, letting users transfer structured data between applications and web sites, and allowing browsing applications to improve the user experience: an event on a web page can be directly imported into a user's desktop calendar; a license on a document can be detected so that users can be informed of their rights automatically; a photo's creator, camera setting information, resolution, location and topic can be published as easily as the original photo itself, enabling structured search and sharing.
RDFa is a specification for attributes to be used with languages such as HTML and XHTML to express structured data. The rendered, hypertext data of XHTML is reused by the RDFa markup, so that publishers don't need to repeat significant data in the document content. This document only specifies the use of the RDFa attributes with XHTML. The underlying abstract representation is RDF [RDF-PRIMER], which lets publishers build their own vocabulary, extend others, and evolve their vocabulary with maximal interoperability over time. The expressed structure is closely tied to the data, so that rendered data can be copied and pasted along with its relevant structure.
The rules for interpreting the data are generic, so that there is no need for different rules for different formats; this allows authors and publishers of data to define their own formats without having to update software, register formats via a central authority, or worry that two formats may interfere with each other.
RDFa shares some use cases with microformats. Whereas microformats specify both a syntax for embedding structured data into HTML documents and a vocabulary of specific terms for each microformat, RDFa specifies only a syntax and relies on independent specification of terms (RDF Classes and Properties) by others. RDFa allows terms from multiple independently-developed vocabularies to be freely intermixed and is designed such that the language can be parsed without knowledge of the specific term vocabulary being used.
This document is a detailed syntax specification for RDFa, aimed at:
For those looking for an introduction to the use of RDFa and some real-world examples, please consult the RDFa Primer.
If you are already familiar with RDFa, and you want to examine the processing rules — perhaps to create a parser — then you'll find the Processing Model section of most interest. It contains an overview of each of the processing steps, followed by more detailed sections, one for each rule.
If you are not familiar with RDFa, but you are familiar with RDF, then you might find reading the Syntax Overview useful, before looking at the Processing Model since it gives a range of examples of XHTML mark-up that use RDFa. Seeing some examples first should make reading the processing rules easier.
If you are not familiar with RDF, then you might want to take a look at the section on RDF Terminology before trying to do too much with RDFa. Although RDFa is designed to be easy to author—and authors don't need to understand RDF to use it—anyone writing applications that consume RDFa will need to understand RDF. There is a lot of material about RDF on the web, and a growing range of tools that will support RDFa, so all we try to do in this document is provide enough background on RDF to make the goals of RDFa clearer.
And finally, if you are not familiar with either RDFa or RDF, and simply want to add RDFa to your documents, then you may find the RDFa Primer [RDFaPRIMER] to be a better introduction.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.
This is an interim Editor's Draft produced jointly by the Semantic Web Deployment Working Group and the XHTML 2 Working Group. The goals of the Semantic Web Deployment Working Group are discussed in the Semantic web Deployment Working Group Charter and the goals of the XHTML 2 Working Group are discussed in the XHTML 2 Working Group charter.
The design described in this Working Draft is nearly final. The draft contains some editorial notes with distinct markup (see, e.g., the section RDFa Processor Conformance). Some open technical issues are also identified with the same markup. These include an open issue on the interpretation of @instanceof when @about is present, and These include the handling of some unprefixed @rel and @rev values, whether @src sets the subject, and whether @instanceof can apply to @resource. Some open issues not fully identified in this Working Draft may be described in the RDFa issue tracking data.
This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the XHTML 2 group; and another public list of any patent disclosures made in connection with the deliverables of the Semantic Web Deployment Working Group; those pages also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
Publication as a Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
Please report errors in this specification to public-rdf-in-xhtml-tf@w3.org (archive).
This section is informative.
RDF/XML [RDF-SYNTAX] provides sufficient flexibility to represent all of the abstract concepts in RDF [RDF-CONCEPTS]. However, it presents two challenges; first it is difficult or impossible to validate documents that contain RDF/XML using XML Schemas or DTDs, which makes it difficult to import RDF/XML into other markup languages. Whilst newer schema languages such as RELAX NG [RELAXNG] do provide a way to validate documents that contain arbitrary RDF/XML, it will be a while before they gain wide support.
Second, even if one could add RDF/XML directly into an XML dialect like XHTML, there would be significant data duplication between the rendered data and the RDF/XML structured data. It would be far better to add RDF to a document without repeating the document's existing data. For example, an XHTML document that explicitly renders its author's name in the text—perhaps as a byline on a news site—should not need to repeat this name for the RDF expression of the same concept: it should be possible to supplement the existing markup in such a way that it can also be interpreted as RDF.
Third, as users often want to transfer structured data from one application to another, sometimes to or from a non-web-based application, it is highly beneficial to express the web data's structure "in context." The user experience could then be enhanced, for example by providing contextual information about specific rendered data, perhaps when the user "right-clicks" on an item of interest.
[It's also a gigantic pain in the ass to author RDF/XML by hand...]
In the past, many attributes were 'hard-wired' directly into the markup language to represent specific concepts. For example, in XHTML 1.1 [XHTML11] and HTML [HTML4] there is @cite; the attribute allows an author to add information to a document which is used to indicate the origin of a quote.
However, these 'hard-wired' attributes make it difficult to define a generic process for extracting metadata from any document since a parser would need to know about each of the special attributes. One motivation for RDFa has been to devise a means by which documents can be augmented with metadata in a general rather than hard-wired manner. This has been achieved by creating a fixed set of attributes and parsing rules, but allowing those attributes to contain properties from any of a number of the growing range of available RDF vocabularies. The values of those properties are in most cases the information that is already in an author's XHTML document.
RDFa alleviates the pressure on XML format authors to anticipate all the structural requirements users of their format might have, by outlining a new syntax for RDF that relies only on XML attributes. This specification deals specifically with the use of RDFa in XHTML, and defines an RDF mapping for a number of XHTML attributes, but RDFa can be easily imported into other XML-based markup languages.
This section is informative.
The following examples are intended to help readers who are not familiar with RDFa to quickly get a sense of how it works. For a more thorough introduction, please read the RDFa Primer [RDFaPRIMER].
RDFa in XHTML makes use of a number of XHTML attributes, as well as providing a few new ones. Attributes that already exist in XHTML will have the same meaning as in XHTML, although their syntax may be slightly modified. For example, in XHTML, @rel already defines the relationship between one document and another. However, in XHTML there is no clear way to add new values; RDFa sets out to explicitly solve this problem, and does so by allowing URIs as values. It also introduces the idea of 'compact URIs'—referred to as CURIEs in this document—which allow a full URI value to be expressed succinctly.
The XHTML attributes that are relevant are:
The new—RDFa-specific—attributes are:
For a normative definition of these attributes see the XHTML Metainformation Attributes Module.
As an XHTML author you will already be familiar with using meta
and link
to add additional information to your documents:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title>Page 7</title>
<meta name="author" content="Mark Birbeck" /> <link rel="prev" href="page6.html" /> <link rel="next" href="page8.html" /> </head> <body>...</body> </html>
RDFa makes use of this concept, enhancing it with the ability to make use of other vocabularies by using compact URIs:
<html
xmlns="http://www.w3.org/1999/xhtml"
xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:dc="http://purl.org/dc/elements/1.1/" > <head> <title>My home-page</title> <meta property="dc:creator" content="Mark Birbeck" />
<link rel="foaf:workplaceHomepage" href="http://www.formsPlayer.com/" />
</head>
<body>...</body>
</html>
Although not widely used, XHTML already supports the use of @rel and @rev on the a
element. This becomes more useful in RDFa with
the addition of support for different vocabularies:
This document is licensed under a
<a xmlns:cc="http://creativecommons.org/licenses/" rel="cc:license" href="http://creativecommons.org/licenses/by/nc-nd/3.0/"> Creative Commons License </a>.
Not only can URLs in the document be re-used to provide metadata, but so can inline text:
<html
xmlns="http://www.w3.org/1999/xhtml"
xmlns:cal="http://www.w3.org/2002/12/cal/ical#" > <head><title>Jo's Friends and Family Blog</title></head> <body> <p> I'm holding <span property="cal:summary">
one last summer Barbecue </span>, on September 16th at 4pm. </p> </body> </html>
If some displayed text is different to the actual 'value' it represents, more precise values can be added, which can optionally include datatypes:
<html
xmlns="http://www.w3.org/1999/xhtml"
xmlns:cal="http://www.w3.org/2002/12/cal/ical#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema" > <head><title>Jo's Friends and Family Blog</title></head> <body> <p> I'm holding <span property="cal:summary"> one last summer Barbecue </span>, on <span property="cal:dtstart" content="20070916T1600-0500" datatype="xsd:datetime">
September 16th at 4pm
</span>.
</p>
</body>
</html>
In many cases a block of mark-up will contain a number of properties that relate to the same item; it's possible with RDFa to indicate the type of that item:
<html
xmlns="http://www.w3.org/1999/xhtml"
xmlns:cal="http://www.w3.org/2002/12/cal/ical#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema"
>
<head><title>Jo's Friends and Family Blog</title></head>
<body>
<p instanceof="cal:Vevent">
I'm holding
<span property="cal:summary">
one last summer Barbecue
</span>,
on
<span property="cal:dtstart" content="20070916T1600-0500"
datatype="xsd:datetime">
September 16th at 4pm
</span>.
</p>
</body>
</html>
The metadata features available in XHTML only allow information to be expressed about the document itself. RDFa provides a means of referring to other documents and resources:
<html
xmlns="http://www.w3.org/1999/xhtml"
xmlns:bib="http://example.org/" [This should be xmlns:biblio=http://example.org/biblio/0.1/ to match the URL provided later in the document]
>
<head>
<title>Books by Marco Pierre White</title>
</head>
<body>
I think
<span about="urn:ISBN:0091808189" instanceof="bib:book"> [This should be biblio:Book]
White's book 'Canteen Cuisine'
</span>
is well worth getting since although it's quite advanced stuff, he
makes it pretty easy to follow. You might also like his
<span about="urn:ISBN:1596913614" instanceof="bib:book"[This should be biblio:Book]>
autobiography
</span>.
</body>
</html>
This section is informative.
The previous section gave examples of typical mark-up in order to illustrate what RDFa in XHTML looks like. But what RDFa in XHTML represents is RDF. In order to author RDFa in XHTML you do not need to understand RDF, although it would certainly help. However, if you are building a system that consumes the RDF output of an RDFa in XHTML document you will almost certainly need to understand RDF. In this section we introduce the basic concepts and terminology of RDF. For a more thorough explanation of RDF, please refer to the RDF Concepts document [RDF-CONCEPTS] and the RDF Sytax Document [RDF-SYNTAX].
The structured data that RDFa provides access to is a collection of statements. A statement is a basic unit of information that has been constructed in a specific format to make it easier to process. In turn, by breaking large sets of information down into a collection of statements, even very complex metadata can be processed using simple rules.
To illustrate, suppose we have the following set of facts:
Albert was born on March 14, 1879, in Germany. There is a picture of him at
the web address, http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg.
This would be quite difficult for a machine to interpret, and it is certainly not in a format that could be passed from one data application to another. However, if we convert the information to a set of statements it begins to be more manageable. The same information could therefore be represented by the following shorter 'statements':
Albert was born on March 14, 1879.
Albert was born in Germany.
Albert has a picture at
http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg.
To make this information machine-processable, RDF defines a structure for these statements. A statement is formally called a triple, meaning that it is made up of three components. The first is the subject of the triple, and is what we are making our statements about. In all of these examples the subject is 'Albert'.
The second part of a triple is the property of the subject that we want to define. In the examples here, the properties would be 'was born on', 'was born in', and 'has a picture at'. These are more usually called predicates in RDF.
The final part of a triple is called the object. In the examples here the three objects have the values 'March 14, 1879', 'Germany', and 'http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg'.
Breaking complex information into manageable units helps us be specific about our data, but there is still some ambiguity. For example, which 'Albert' are we talking about? If another system has more facts about 'Albert', how could we know whether they are about the same person, and so add them to the list of things we know about that person? If we wanted to find people born in Germany, how could we know that the predicate 'was born in' has the same purpose as the predicate 'birthplace' that might exist in some other system? RDF solves this problem by replacing our vague terms with URI references.
URIs are most commonly used to identify web pages, but RDF makes use of them as a way to provide unique identifiers for concepts. For example, we could identify the subject of all of our statements (the first part of each triple) by using the DBPedia [?ref] [Where's this reference?] URI for Albert Einstein, instead of the ambiguous string 'Albert':
<http://dbpedia.org/resource/Albert_Einstein>
has the name
Albert Einstein.
<http://dbpedia.org/resource/Albert_Einstein>
was born on
March 14, 1879.
<http://dbpedia.org/resource/Albert_Einstein>
was born in
Germany.
<http://dbpedia.org/resource/Albert_Einstein>
has a picture at
http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg.
URI references are also used to uniquely identify the objects in metadata statements (the third part of each triple). The picture of Einstein is already a URI, but we could also use a URI to uniquely identify the country Germany. At the same time we'll indicate that the name and date of birth really are literals (and not URIs), by putting quotes around them:
<http://dbpedia.org/resource/Albert_Einstein>
has the name
"Albert Einstein".
<http://dbpedia.org/resource/Albert_Einstein>
was born on
"March 14, 1879".
<http://dbpedia.org/resource/Albert_Einstein>
was born in
<http://dbpedia.org/resource/Germany>.
<http://dbpedia.org/resource/Albert_Einstein>
has a picture at
<http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg>.
URI references are also used to ensure that predicates are unambiguous; now we can be sure that 'birthplace', 'place of birth', 'Lieu de naissance' and so on, all mean the same thing:
<http://dbpedia.org/resource/Albert_Einstein>
<http://xmlns.com/foaf/0.1/name> "Albert Einstein". <http://dbpedia.org/resource/Albert_Einstein> <http://dbpedia.org/property/dateOfBirth> "March 14, 1879". <http://dbpedia.org/resource/Albert_Einstein> <http://dbpedia.org/property/birthPlace> <http://dbpedia.org/resource/Germany>. <http://dbpedia.org/resource/Albert_Einstein> <http://xmlns.com/foaf/0.1/depiction> <http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg>.
Although URI resources are always used for subjects and predicates, the object part of a triple can be either a URI or a literal. In the example triples, Einstein's name is represented by a plain literal, which means that it is a basic string with no type or language information:
<http://dbpedia.org/resource/Albert_Einstein>
<http://xmlns.com/foaf/0.1/name> "Albert Einstein".
Some literals, such as dates and numbers, have very specific
meanings, so RDF provides a mechanism for indicating the type of a
literal. A typed literal is indicated by attaching a URI
to the end of a plain literal, and this URI indicates the literal's
datatype. This URI is usually based on datatypes defined in the XML
Schema Datatypes specification [XMLSCHEMA]. The following syntax would be used to unambiguously express Einstein's date of birth as a literal of type http://www.w3.org/2001/XMLSchema#date
:
<http://dbpedia.org/resource/Albert_Einstein>
<http://dbpedia.org/property/dateOfBirth> "1879-03-14"^^<http://www.w3.org/2001/XMLSchema#date>.
RDF itself does not have one set way to express triples, since the key ideas of RDF are the triple and the use of URIs, and not any particular syntax. However, there are a number of mechanisms, such as RDF/XML, N-Triples [N-TRIPLES], and of course RDFa. Most discussions of RDF make use of the N-Triple syntax to explain their ideas, since it is quite compact. The examples we have just seen are already using this syntax, and we'll continue to use it throughout this document when we need to talk about the RDF that could be generated from some RDFa. There is one small change that we make to N-Triples, which is to allow long URIs to be abbreviated by using a URI mapping. This is indicated by expressing a compact URI as follows:
<http://dbpedia.org/resource/Albert_Einstein>
foaf:name "Albert Einstein" .
<http://dbpedia.org/resource/Albert_Einstein>
p:dateOfBirth "1879-03-14"^^xsd:date .
<http://dbpedia.org/resource/Albert_Einstein>
p:birthPlace <http://dbpedia.org/resource/Germany>.
<http://dbpedia.org/resource/Albert_Einstein>
foaf:depiction <http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg>.
Here 'p:' has been mapped to the URI for DBPedia, and 'foaf:' has been mapped to the URI for the 'Friend of a Friend' taxonomy. [p: is too short and could be confusing - use dbp: instead]
Note that this is merely a way to make examples more compact and the actual triples generated would always use the full URIs.
When writing examples, you will often see the following URI:
<>
This indicates the 'current document', i.e., the document being processed. In reality there would always be a full URI, but this serves to make examples more compact.
A collection of triples is called a graph.
For more information on the concepts described above, see [RDF-CONCEPTS]. RDFa additionally defines the following terms:
In order to allow for the compact expression of RDF statements, RDFa allows the contraction of all [URI reference]s into a form called a 'compact URI', or [CURIE]. Until recently QNames [QNames] have been the most common way to abbreviate URIs, but there is a well-known limitation that the syntax for QNames does not allow all possible [URI reference]s to be expressed. CURIEs have been specificially designed to look like QNames, but at the same time to get around their syntactic limitations.
Note that CURIEs are only used in the mark-up and N-Triples examples, and will never appear in the generated [triple]s, which will always use [URI reference]s.
The following is a brief description of RDFa in terms of the RDF terminology introduced here. It may be useful to readers with an RDF background:
The aim of RDFa is to allow a single [RDF graph]s to be carried in XML documents of any type, although this specification deals only with RDFa in XHTML. An [RDF graph] comprises [node]s linked by relationships. The basic unit of a graph is a [triple], in which a subject [node] is linked to an object [node] via a [predicate]. The subject [node] is always either an [URI reference] or a [blank node], the predicate is always an [URI reference], and the object of a statement can be an [URI reference], a [literal], or a [blank node].
In RDFa, a subject [URI reference] is generally indicated using @about, and predicates are represented using one of @property, @rel, or @rev. Objects which are [URI reference]s are represented using @href, @resource or @src, whilst objects that are [literal]s are represented either with @content (with an optional [datatype] expressed using @datatype), or the content of the element in question.
This section is normative.
The keywords "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119].
Note that all examples in this document are informative, and are not meant to be interpreted as normative requirements.
A strictly conforming XHTML+RDFa document is a document that requires only the facilities described as mandatory in this specification. Such a document MUST meet all the following criteria:
The document must conform to the constraints expressed in the schemas in Appendix B - XHTML+RDFa Document Type Definition.
The local part of the root element of the document must be html
.
The start tag of the root element of the document must explicitly contain an xmlns
declaration for the XHTML namespace [XMLNAMES]. The namespace URI for XHTML is defined to be http://www.w3.org/1999/xhtml
.
Sample root element
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" >
There MUST be a DOCTYPE declaration in the document prior to the root element. If present, the public identifier included in the DOCTYPE declaration must reference the DTD found in Appendix B - XHTML+RDFa Document Type Definition using its Public Identifier. The system identifier may [Should this be MAY] be modified appropriately. [Is the DOCTYPE strictly required, I thought we discussed that it SHOULD be there, not MUST be there... what if someone wants to cut/paste a snipped of RDFa into their HTML document?]
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"
"http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd">
head
element that includes the value http://www.w3.org/ns/rdfa
.Example of an XHTML+RDFa 1.0 document
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"
"http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd" >
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
<head>
<title>Virtual Library</title>
</head>
<body>
<p>Moved to <a href="http://example.org/">example.org</a>.</p>
</body>
</html>
Note that in this example, the XML declaration is included. An XML declaration like the one above is not required in all XML documents. XHTML document authors should use XML declarations in all their documents. XHTML document authors must use an XML declaration when the character encoding of the document is other than the default UTF-8 or UTF-16 and no encoding is specified by a higher-level protocol.
XHTML + RDFa documents SHOULD be labeled with the Internet Media Type "application/xhtml+xml" as defined in [RFC3236]. For further information on using media types with XHTML family markup languages, see the informative note [XHTMLMIME]. [Just curious - why are we using this instread of "application/xhtml+rdfa"? Isn't xhtml+xml sort of redundant? I'm guessing that it's because browsers wouldn't recognize xhtml+rdfa?]
A conforming user agent MUST support all of the features required in this specification. A conforming user agent must also support the User Agent conformance requirements as defined in XHTML Modularization [XHTMLMOD] section on "XHTML Family User Agent Conformance".
A conforming RDFa Processor MUST make available to a consuming application a single RDF [graph] containing all possible triples generated by using the rules in the Processing Model section. This is the 'default [graph]'. [Should this be [default graph]?]
A conforming RDFa Processor MAY make available additional triples that have been generated using rules not described here, but these triples MUST be made available in one or more additional RDF [graph]s, and not in the default [graph]. [Should this be [default graph]?]
A conforming RDFa Processor MUST process whitespace according to the rules of [CSS2]. Note that this same requirement is imposed upon conforming User Agents via [XHTMLMOD].
Test Suite
We have a test suite - we should likely reference it here, but I need to find the exact way to do that. -ShaneAssertion Annotation
All the assertions in this spec need to be annotated with the appropriate markup (must, should, etc.).This section is normative.
This section looks at a generic set of processing rules for creating a set of triples that represent the structured data present in an XHTML+RDFa document. Processing need not follow the DOM traversal technique outlined here, although the effect of following some other manner of processing must be the same as if the processing outlined here were followed. The processing model is explained using the idea of DOM traversal which makes it easier to describe (particularly in relation to the 'evaluation context').
Parsing a document for RDFa triples is carried out by starting at the document object, and then visiting each of its child elements in turn, in document order, applying processing rules. Processing is recursive in that for each child element the processor also visits each of its child elements, and applies the same processing rules.
(Note that in some environments there will be little difference between starting at the root element of the document, and starting at the document object itself. However, we define it this way since in some environments important information is present at the document object level which is not present on the root element.)
As processing continues, rules are applied which will either generate triples, or change the [evaluation context] information which will be used in subsequent processing. Some of the rules will be determined by the host language—in this case XHTML—and some of the rules will be part of RDFa.
Note that we don't say anything about what should happen to the triples generated, or whether more triples might be generated during processing than are outlined here. However, to be conformant, an RDFa processor needs to act as if at a minimum the rules in this section are applied.
During processing, each rule is applied within an 'evaluation context'. Rules may further modify this evaluation context, or create triples that can be established by making use of this evaluation context. The context itself consists of the following pieces of information:
base
element. The important
thing is that it establishes a URL against which relative paths can be evaluated.base
, but it will usually change during the course of processing.In addition, during the course of processing a number of locally scoped values are needed, as follows:
Note that these locally scoped values do not need to be preserved when traversing the tree, so do not need to be part of any stored evalutation context.
RDFa has the notion of 'chaining' which aims to combine statements together in as intuitive a way as possible, so as avoid unnecessary repetition of mark-up. For example, if an author were to add statements as children of an object that was a resource, these statements should be interpreted as being about that resource: [Please use dbp: instead of p: in all these examples]
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace" resource="http://dbpedia.org/resource/Germany">
<span property="p:conventionalLongName">Federal Republic of Germany</span> </div> <img rel="foaf:depiction" src="http://en.wikipedia.org/wiki/Image:Albert_Einstein_Head.jpg" /> </div>
In this example we can see that an object resource has become the subject for nested statements. It also illustrates the basic chaining pattern of 'A has a B has a C'.
It's also possible for the subject of nested statements to provide the object for containing statements--essentially the reverse of the example we have just seen. To illustrate, we'll take an example of the type of chaining just described, and show how it could be marked up more efficiently. To start, we mark up the fact that Albert Einstein had both German and American citizenship:
<div about="http://dbpedia.org/resource/Albert_Einstein">
<div rel="p:citizenship" resource="http://dbpedia.org/resource/Germany"></div> <div rel="p:citizenship" resource="http://dbpedia.org/resource/United_States"></div> </div>
Now, we show the same information, but this time we create an 'incomplete triple' from the citizenship part, and then use any number of further subjects to 'complete' that triple, as follows:
<div about="http://dbpedia.org/resource/Albert_Einstein" rel="p:citizenship">
<span about="http://dbpedia.org/resource/Germany"></span> <span about="http://dbpedia.org/resource/United_States"></span> </div>
In this example, the 'unfinished triple' actually gets completed twice, once for Germany and once for the USA, giving exactly the same information as we had in the earlier example:
<http://dbpedia.org/resource/Albert_Einstein>
p:citizenship <http://dbpedia.org/resource/Germany> .
<http://dbpedia.org/resource/Albert_Einstein>
p:citizenship <http://dbpedia.org/resource/United_States> .
Processing would normally begin after the document to be parsed has
been completely loaded. However, there is no requirement for this to be
the case, and it is certainly possible to use a
stream-based approach, such as SAX [http://en.wikipedia.org/wiki/SAX]
to extract the RDFa information. However, if some approach other than
the DOM traversal technique defined here is used, it is
important to ensure that any meta
or link
elements processed in the head
of the document honour any occurrences of base
which may appear
after
those elements. (In other words, XHTML processing rules must still be
applied, even if document processing takes place in a non-HTML
environment such as a search indexer.)
At the beginning of processing, the [current evaluation context] is initialised as follows:
base
element, if present;Processing begins with the document object, and all elements in the tree that contain one or more RDFa attributes (i.e., any attribute that features in the section The RDFa Attributes), are processed according to the rules described below, depth-first.
head
may
contain information that determines how the body should be parsed, the
[evaluation context] as set at the end of processing
head
is then passed to the processing of body
. It is as if instead of the DOM looking like this:
document object
|
|__ html
|
|__ head
|
|__ body
document object
|
|__ html
|
|__ head
|
|__ body
The processing rules are:
Whitespace normalising
So far we defer to CSS2, but I think we should copy the prose into here, so that it's clearer.rdf:XMLLiteral
. The value of the [XML literal] is a string created from the inner content of the [current element], i.e., not including the
element itself, with the datatype of rdf:XMLLiteral
.This section is normative.
This section provides an in-depth examination of the processing steps described in the previous section. It also includes examples which may help clarify some of the steps involved.
In the following examples, for brevity, assume that the following namespace prefixes have been defined:
biblio: | http://example.org/biblio/0.1 |
cc: | http://creativecommons.org/ns# |
dc: | http://purl.org/dc/elements/1.1/ |
ex: | http://example.org/ |
foaf: | http://xmlns.com/foaf/0.1/ |
dbp: | http://dbpedia.org/property/ |
rdf: | http://www.w3.org/1999/02/22-rdf-syntax-ns# |
rdfa: | http://www.w3.org/ns/rdfa/ |
rdfs: | http://www.w3.org/2000/01/rdf-schema# |
svg: | http://www.w3.org/2000/svg |
taxo: | http://purl.org/rss/1.0/modules/taxonomy/ |
xh11: | http://www.w3.org/1999/xhtml |
xsd: | http://www.w3.org/2001/XMLSchema# |
The key to processing is that a triple is generated whenever a predicate/object combination is detected. The actual triple generated will include a subject that may have been set previously, so this is tracked in the [current evaluation context] and is called the [current subject]. Since the subject will default to the current document if it hasn't been set explicitly, then a predicate/object combination is always enough to generate one or more triples.
The attributes for setting a predicate are @rel, @rev and @property, whilst the attributes for setting an object are @resource, @href, @content, and @src [@src is under debate]. @instanceof is unique in that it sets both a predicate and an object at the same time, and inline content might set an object if @content is not present, but @property is [I thought @instanceof only applied to @about?].
When triples are created they will always be in relation to the [current subject].
When parsing begins, the [current subject] will be the URI of the document being parsed, or a value as set by base
. This means that any metadata found in the head
of the
document will concern the document itself:
<html>
<head>
<title>Jo's Friends and Family Blog</title>
<link rel="foaf:primaryTopic" href="#bbq" /> <meta property="dc:creator" content="Jo" /> </head> <body> ... </body> </html>
This would generate the following triples:
<> foaf:primaryTopic <#bbq> .
<> dc:creator "Jo" .
It is possible for the data to appear elsewhere in the document:
<html>
<head>
<title>Jo's Blog</title>
</head>
<body>
<h1><span property="dc:creator">Jo</span>'s blog</h1>
<p>
Welcome to my blog.
</p>
</body>
</html>
which would still generate the triple:
<> dc:creator "Jo" .
The value of base
may change the initial value of [current subject]:
<html>
<head>
<base href="http://www.example.org/jo/blog" /> <title>Jo's Friends and Family Blog</title> <link rel="foaf:primaryTopic" href="#bbq" /> <meta property="dc:creator" content="Jo" /> </head> <body> ... </body> </html>
A parser should now generate the following triples, regardless of the URL from which the XHTML document is served:
<http://www.example.org/jo/blog> foaf:primaryTopic <#bbq> .
<http://www.example.org/jo/blog> dc:creator "Jo" .
As processing progresses, any @about attributes will change the [current subject]. The value of @about
is a URI or a CURIE. If it is a relative
URI then it needs to be resolved against the current [base] value. To
illustrate how this affects the statements, note in this mark-up how
the properties inside the body
element become
part of a new calendar event object, rather than referring to the document as they do in the head of the document:
<html>
<head>
<title>Jo's Friends and Family Blog</title>
<link rel="foaf:primaryTopic" href="#bbq" />
<meta property="dc:creator" content="Jo" />
</head>
<body>
<p about="#bbq" instanceof="cal:Vevent">[Should @instanceof be highlighted in red here too?]
I'm holding
<span property="cal:summary">
one last summer barbecue
</span>,
on
<span property="cal:dtstart" content="20070916T1600-0500"
datatype="xsd:datetime">
September 16th at 4pm
</span>.
</p>
</body>
</html>
With this mark-up a parser should generate the following triples:
<> foaf:primaryTopic <#bbq> .
<> dc:creator "Jo" .
<#bbq> rdf:type cal:Vevent . <#bbq> cal:summary "one last summer barbecue" . <#bbq> cal:dtastart "20070916T1600-0500"^^xsd:datetime .
Other kinds of resources can be used to set the [current subject], not just references to web-pages. Although not advised, email addresses might be used to represent a person:
John knows
<a about="mailto:john@example.org" rel="foaf:knows" href="mailto:sue@example.org">Sue</a>. Sue knows <a about="mailto:sue@example.org" rel="foaf:knows" href="mailto:jim@example.org">Jim</a>.
This should generate the following triples:
<mailto:john@example.org>
foaf:knows <mailto:sue@example.org> .
<mailto:sue@example.org>
foaf:knows <mailto:jim@example.org> .
Similarly, authors may make statements about images:
<div about="photo1.jpg">
this photo was taken by
<span property="dc:creator">Mark Birbeck</span>
</div>
which should generate the following triples:
<photo1.jpg> dc:creator "Mark Birbeck" .
Whilst @about explicitly creates a new context for statements, @instanceof does so implicitly. @instanceof works
differently to other ways of setting a predicate since the predicate is always rdf:type
, which means that the processor only requires one attribute, the value of the type.
Since @instanceof is setting the type of an item, this means that if no item exists one should automatically be created. This involves generating a new bnode, and is examined in more detail below; it is mentioned here because the bnode used by the new item will become the subject for further statements.
For example, an author may wish to create mark-up for a person using the FOAF vocabulary, but without having a clear identifier for the item:
<div instanceof="foaf:Person">
<span property="foaf:name">Albert Einstein</span>
<span property="foaf:givenname">Albert</span>
</div>
This mark-up would cause a bnode to be created which has a 'type' of foaf:Person
, as well as name and given name properties:
_:a rdf:type foaf:Person .
_:a foaf:name "Albert Einstein" .
_:a foaf:givenname "Albert" .
_:a
as being distinct for _:b
. But by not exposing these values to any external software, it is possible to have complete control over the
identifier, as well as preventing further statements being made about the item.As described in the previous two sections, @about will always take precedence and mark a new subject, but if no @about value is available then @instanceof will do the same job, although using an implied identifier, or bnode. [This is a bit confusing... do you mean "using an implied identifier, which is a bnode", here? If so, couldn't you just say "using a bnode"?]
But if neither @about or @instanceof are present, there are a number of ways that the subject could be arrived at. One of these is to 'inherit' the subject from the containing statement, with the value to be inherited set either explicitly, or implicitly.
The most usual way that an inherited subject might get set would be when the parent statement has an object that is a resource. Returning to the earlier example, in which the long name for Germany was added, the following mark-up was used:
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace" resource="http://dbpedia.org/resource/Germany" />
<span about="http://dbpedia.org/resource/Germany"
property="p:conventionalLongName">Federal Republic of Germany</span> </div>
In an earlier illustration the subject and object for Germany were elided by removing the @resource, relying on the @about to set the object:
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace"> <span about="http://dbpedia.org/resource/Germany" property="p:conventionalLongName">Federal Republic of Germany</span> </div> </div>
but it is also possible for authors to achieve the same effect by removing the @about and leaving the @resource:
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace" resource="http://dbpedia.org/resource/Germany">
<span property="p:conventionalLongName">Federal Republic of Germany</span>
</div>
</div>
In this situation, all statements that are 'contained' by the object resource representing Germany (the value in @resource) will have the same subject, making it easy for authors to add additional statements: [While I agree with allowing the author to do this, I don't see how we prevent the following from happening: <div resource="#betty" rel="foaf:knows"><span resource="#phil"></span></div> -- and I'm pretty sure we never talked about allowing something like this to happen... maybe I wasn't there for that discussion?]
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace" resource="http://dbpedia.org/resource/Germany">
<span property="p:conventionalLongName">Federal Republic of Germany</span>
<span rel="p:capital" resource="http://dbpedia.org/resource/Berlin" /> </div> </div>
Looking at the triples that a parser would generate, we can see that we actually have two groups of statements; the first group are set to refer to the @about that contains them:
<http://dbpedia.org/resource/Albert_Einstein> foaf:name "Albert Einstein" .
<http://dbpedia.org/resource/Albert_Einstein> p:dateOfBirth "1879-03-14"^^xsd:date .
<http://dbpedia.org/resource/Albert_Einstein> p:birthPlace <http://dbpedia.org/resource/Germany> .
whilst the second group refer to the @resource that contains them:
<http://dbpedia.org/resource/Germany>
p:conventionalLongName "Federal Republic of Germany" .
<http://dbpedia.org/resource/Germany>
p:capital <http://dbpedia.org/resource/Berlin> .
Note also that the same principle described here applies to @src and @href. [So doesn't this mean we can also do: <div href="#betty" rel="foaf:knows"><span href="#phil"></span></div> and it would generate <#betty> <foaf:knows> <#phil> . ?]
There will be occasions when the the author wants to elide the subject and object as shown above, but is not concerned to name the resource that is common to the two statements (i.e., the object of the first statement, which is the subject of the second). For example, to indicate that Einstein was influenced by Spinoza the following mark-up could well be used:
<div about="http://dbpedia.org/resource/Baruch_Spinoza" rel="p:influenced">
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
</div>
</div>
A parser should generate the following triples:
<http://dbpedia.org/resource/Baruch_Spinoza>
p:influenced <http://dbpedia.org/resource/Albert_Einstein> .
<http://dbpedia.org/resource/Albert_Einstein> foaf:name "Albert Einstein" .
<http://dbpedia.org/resource/Albert_Einstein> p:dateOfBirth "1879-03-14"^^xsd:date .
However, an author could just as easily say that Spinoza influenced something by the name of Albert Einstein, that was born on March 14th, 1879:
<div about="http://dbpedia.org/resource/Baruch_Spinoza" rel="p:influenced">
<div> <span property="foaf:name">Albert Einstein</span> <span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span> </div> </div>
In RDF terms, the item that 'represents' Einstein is anonymous, since it has no URI to identify it. However, the item is given an automatically generated bnode, and it is onto this idenfifier that all child statements are attached:
A parser should generate the following triples:
<http://dbpedia.org/resource/Baruch_Spinoza> p:influenced _:a .
_:a foaf:name "Albert Einstein" .
_:a p:dateOfBirth "1879-03-14"^^xsd:date .
Note that the div
is superfluous, and a parser should create the intermediate object even if the element is removed:
<div about="http://dbpedia.org/resource/Baruch_Spinoza" rel="p:influenced">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
</div>
An alternative pattern is to keep the div
and move the @rel onto it:
<div about="http://dbpedia.org/resource/Baruch_Spinoza">
<div rel="p:influenced">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
</div>
</div>
From the point of view of the mark-up, this latter layout is to be preferred, since it draws attention to the 'hanging rel'. But from the point of view of a parser, all of these permutations need to be supported.
When a new subject is calculated, it is also used to complete any incomplete triples that are pending. This situation arises when the author wants to 'chain' a number of statements together. For example, an author could have a statement that Albert Einstein was born in Germany:
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace" resource="http://dbpedia.org/resource/Germany" /> </div>
and then a further statement that the 'long name' for Germany is the Federal Republic of Germany:
<span about="http://dbpedia.org/resource/Germany"
property="p:conventionalLongName">Federal Republic of Germany</span>
RDFa allows authors to insert this statement as a self-contained unit into other contexts:
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace" resource="http://dbpedia.org/resource/Germany" />
<span about="http://dbpedia.org/resource/Germany"
property="p:conventionalLongName">Federal Republic of Germany</span> </div>
But it also allows authors to avoid unnecessary repetition and to 'normalise' out duplicate identifiers, in this case the one for Germany:
<div about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:birthPlace"> <span about="http://dbpedia.org/resource/Germany" property="p:conventionalLongName">Federal Republic of Germany</span> </div> </div>
When this happens the @rel for 'birth place' is regarded as a 'hanging rel' because it has not yet generated any triples, but these 'incomplete triples' are completed by the @about that appears on the next line. The first step is therefore to store the two parts of the triple that the parser does have, but without an object:
<http://dbpedia.org/resource/Albert_Einstein> p:birthPlace ? .
Then as processing continues, the parser encounters the subject of the statement about the long name for Germany, and this is used in two ways. First it is used to complete the 'incomplete triple':
<http://dbpedia.org/resource/Albert_Einstein> p:birthPlace <http://dbpedia.org/resource/Germany> .
and second it is used to generate its own triple:
<http://dbpedia.org/resource/Germany> p:conventionalLongName "Federal Republic of Germany" .
Note that each occurrence of @about will complete any incomplete triples. For example, to mark up the fact that Albert Einstein had both German and American citizenship, an author need only specify one @rel value that is then used with multiple @about values:
<div about="http://dbpedia.org/resource/Albert_Einstein" rel="p:citizenship">
<span about="http://dbpedia.org/resource/Germany" /> <span about="http://dbpedia.org/resource/United_States" /> </div>
In this example there is This example starts with one incomplete triple:
<http://dbpedia.org/resource/Albert_Einstein> p:citizenship ? .
When the processor meets each of the @about values, this triple is completed, giving:
<http://dbpedia.org/resource/Albert_Einstein>
p:citizenship <http://dbpedia.org/resource/Germany> .
<http://dbpedia.org/resource/Albert_Einstein>
p:citizenship <http://dbpedia.org/resource/United_States> .
These examples show how @about completes triples, but there are other situations that can have the same effect. For example, when @instanceof creates a new bnode (as described above), that will be used to complete any incomplete triples' [Is that trailing ' gramatically correct - I don't know...]. To illustrate, to indicate that Spinoza influenced both Einstein and Schopenhauer, the following mark-up could be used:
<div about="http://dbpedia.org/resource/Baruch_Spinoza">
<div rel="p:influenced">
<div instanceof="foaf:Person">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
</div>
<div instanceof="foaf:Person">
<span property="foaf:name">Arthur Schopenhauer</span>
<span property="p:dateOfBirth" datatype="xsd:date">1788-02-22</span>
</div>
</div>
</div>
First the following incomplete triple is stored:
<http://dbpedia.org/resource/Baruch_Spinoza> p:influenced ? .
Then when the parser processes the two occurences of @instanceof, each generates a bnode, which is used to both complete the 'incomplete triple', and to set the subject for further statements:
<http://dbpedia.org/resource/Baruch_Spinoza"> p:influenced _:a .
_:a rdf:type foaf:Person .
_:a foaf:name "Albert Einstein" .
_:a p:dateOfBirth "1879-03-14"^^xsd:date .
<http://dbpedia.org/resource/Baruch_Spinoza"> p:influenced _:b .
_:b rdf:type foaf:Person .
_:b foaf:name "Arthur Schopenhauer" .
_:b p:dateOfBirth "1788-02-22"^^xsd:date .
Triples are also 'completed' if any one of @property, @rel or @rev are present. However, unlike the situation when @about or @instanceof are present, all predicates are attached to one bnode:
<div about="http://dbpedia.org/resource/Baruch_Spinoza" rel="p:influenced">
<span property="foaf:name">Albert Einstein</span>
<span property="p:dateOfBirth" datatype="xsd:date">1879-03-14</span>
<div rel="p:citizenship">
<span about="http://dbpedia.org/resource/Germany" />
<span about="http://dbpedia.org/resource/United_States" />
</div>
</div>
This example has two 'hanging rels', and so two situations when 'incomplete triples' will be created. Processing would proceed as follows; first an incomplete triple is stored:
<http://dbpedia.org/resource/Baruch_Spinoza> p:influenced ? .
Next, the parser processes the predicate values for foaf:name
, p:dateOfBirth
and p:citizenship
, but note that only the first needs to 'complete' the
'hanging rel'. So processing foaf:name
generates two triples:
<http://dbpedia.org/resource/Baruch_Spinoza> p:influenced _:a .
_:a foaf:name "Albert Einstein" .
but processing p:dateOfBirth
generates only one:
_:a p:dateOfBirth "1879-03-14"^^xsd:date .
Processing p:citizenship
also uses the same bnode, but note that it also generates its own 'incomplete triple':
_:a p:citizenship ? .
As before, the two occurrences of @about complete the 'incomplete triple', once each:
_:a p:citizenship <http://dbpedia.org/resource/Germany> .
_:a p:citizenship <http://dbpedia.org/resource/United_States> .
The entire set of triples that a parser should generate are as follows:
<http://dbpedia.org/resource/Baruch_Spinoza> p:influenced _:a .
_:a foaf:name "Alber Einstein" .
_:a p:dateOfBirth "1879-03-14"^^xsd:date .
_:a p:citizenship <http://dbpedia.org/resource/Germany> .
_:a p:citizenship <http://dbpedia.org/resource/United_States> .
Although objects have been discussed in the previous sections, as part of the explanation of subject resolution, chaining, evaluation contexts, and so on, this section will look at objects in more detail.
There are two types of object, [URI resource]s and [literal]s.
A [literal] object can be set by using @property to express a [predicate], and then using either @content, or the inline text of the element that @property is on.
A [URI resource] object can be set using one of @rel or @rev to express a [predicate], and then either using one of @href, @resource or @src [@src debate] to provide an object resource explicitly, or using the chaining techniques described above to obtain an object from a nested subject, or from a bnode.
An [object literal] will be generated when @property is present. @property provides the predicate, and the following sections describe how the actual literal to be generated is determined.
@content can be used to indicate a [plain literal], as follows:
<meta about="http://internet-apps.blogspot.com/"
property="dc:creator" content="Mark Birbeck" />
The [plain literal] can also be specified by using the content of the element:
<span about="http://internet-apps.blogspot.com/"
property="dc:creator">Mark Birbeck</span>
Both of these examples give the following triple:
<http://internet-apps.blogspot.com/> dc:creator "Mark Birbeck" .
The value of @content is given precedence over any element content, so the following would give exactly the same triple:
<span about="http://internet-apps.blogspot.com/"
property="dc:creator" content="Mark Birbeck">John Doe</span>
RDF allows [plain literal]s to have a language tag, as illustrated by the following example from [RDFTESTS-RDFMS-XMLLANG-TEST006]:
<http://example.org/node>
<http://example.org/property> "chat"@fr .
In RDFa the XML language attribute @xml:lang is used to add this information, whether the plain literal is designated by @content, or by the inline text of the element:
<meta about="http://example.org/node"
property="ex:property" xml:lang="fr" content="chat" />
Note that the language value can be inherited as defined in [XML-LANG], so the following syntax will give the same triple as above:
<html xmlns="http://www.w3.org/1999/xhtml"
xmlns:ex="http://www.example.com/ns/" xml:lang="fr">
<head>
<title xml:lang="en">Example</title>
<meta about="http://example.org/node"
property="ex:property" content="chat" />
</head>
...
</html>
[Literal]s can be given a data type using @datatype.
This can be represented in RDFa as follows:
<span property="cal:dtstart" content="20070916T1600-0500"
datatype="xsd:datetime">
September 16th at 4pm
</span>.
The triples that this mark-up generates include the datatype after the literal:
<> cal:dtstart "20070916T1600-0500"^^xsd:datetime .
XML documents cannot contain XML mark-up in their attributes, which means it is not possible to represent XML within @content (the following would cause an XML parser to generate an error):
<head>
<meta property="dc:title"
content="E = mc<sup>2</sup>: The Most Urgent Problem of Our Time" />
</head>
It does not help to escape the content, since the output would simply be a string of text containing numerous ampersands:
<> dc:title "E = mc<sup>2&lt;/sup>: The Most Urgent Problem of Our Time" .
RDFa therefore supports the use of normal mark-up to express XML literals, by using @datatype: [The markup that follows and the triple that is generated depends on how the browser cononicalizes the whitespace, we shouldn't assume a certain canonicalization - this example should not have carraige returns in it.]
<h2 property="dc:title" datatype="rdf:XMLLiteral">
E = mc<sup>2</sup>: The Most Urgent Problem of Our Time </h2>
This would generate the following triple, with the XML preserved in the literal [Again, this would only generate the following triple if we were using Internet Explorer... Firefox would preserve the \n if we were using Javascript to parse the DOM]:
<> dc:title "E = mc<sup>2</sup>: The Most Urgent Problem of Our Time"^^rdf:XMLLiteral .
Note that this requires that a URI mapping for the prefix rdf
has been defined. To make authoring easier, if there are child elements and no @datatype
attribute, then the effect is the same as if @datatype have been explicitly set to rdf:XMLLiteral
:
<h2 property="dc:title">
E = mc<sup>2</sup>: The Most Urgent Problem of Our Time
</h2>
In the examples given here the sup
element is actually
part of the meaning of the literal, but there will be situations where
the extra mark-up means nothing, and can therefore be
ignored. In this situation an empty @datatype value can be used to override the XML literal behaviour:
<p>You searched for <strong>Einstein</strong>:</p>
<p about="http://dbpedia.org/resource/Albert_Einstein">
<span property="foaf:name" datatype="">Albert <strong>Einstein</strong></span>
(March 14, 1879 –[UTF-8 issue for the previous 3 characters? Is the problem in my browser?] April 18, 1955) was a German-born theoretical physicist.
</p>
Although the rendering of this page has highlighted the term the user searched for, setting @datatype to nothing ensures that the data is interpreted as a plain literal, giving the following triples:
<http://dbpedia.org/resource/Albert_Einstein> foaf:name "Albert Einstein" .
Note that the value of this [XML Literal] is the exclusive canonicalization of the RDFa element's value.
Although the RDFa processing model requires visiting each element in the tree, if the processor meets an [XML literal] then it MUST NOT process any further down the tree. This is to prevent triples being generated from mark-up that is not actually in the hierarchy. For example, we might want to set the title of something to some XHTML that itself includes RDFa:
<h2 property="dc:title">
Example 3: <span about="#bbq" instanceof="cal:Vevent">...</span>
</h2>
In this example the nested RDFa should not be parsed. This effectively means that the presence of @property without @content will inhibit any further processing, so authors should watch out for stray attributes, especially if they find that they are getting fewer triples than they had expected.
Most of the rules governing the processing of objects that are resources are to be found in the processing descriptions given above, since they are important for establishing the subject. This section aims to highlight general concepts, and anything that might have been missed.
One or more [URI object]s are needed when @rel or @rev is present. Each attribute will cause triples to be generated when used with @href, @resource or @src, or with the subject value of any nested statement if none of these attributes are present.
@rel and @rev are essentially the inverse of each other; whilst @rel establishes a relationship between the [current subject] as subject, and the [object resource] as the object, @rev does the exact opposite, and uses the [object resource] as the subject, and the [current subject] as the object.
RDFa provides the @resource attribute as a way to set the object of statements. This is particularly useful when referring to resources that are not themselves navigable links:
<html>
<head>
<title>On Crime and Punishment</title>
<base href="http://www.example.com/candp.xhtml" />
</head>
<body>
<blockquote about="#q1" rel="dc:source" resource="urn:ISBN:0140449132" >
<p id="q1">
Rodion Romanovitch! My dear friend! If you go on in this way
you will go mad, I am positive! Drink, pray, if only a few drops!
</p>
</blockquote>
</body>
</html>
The blockquote
element generates the following triple:
<http://www.example.com/candp.xhtml#q1>
<http://purl.org/dc/elements/1.1/source> <urn:ISBN:0140449132>.
If no @resource is present, then @href is next in priority order, for setting the object.
When a predicate has been expressed using @rel, the @href on the [RDFa statement]'s element is used to identify the object with a [URI reference]. Its type is a URI:
<link about="mailto:john@example.org"
rel="foaf:knows" href="mailto:sue@example.org" />
It's also possible to use both @rel and @rev at the same time on an element. This is particularly useful when two things stand in two different relationships with each other, for example when a picture is taken by Mark, but that picture also depicts him:
<p>This photo was taken by
<a about="photo1.jpg" rel="dc:creator" rev="foaf:img" href="http://www.blogger.com/profile/1109404">Mark Birbeck</a>.</p>
which then yields two triples:
<photo1.jpg>
dc:creator <http://www.blogger.com/profile/1109404> .
<http://www.blogger.com/profile/1109404>
foaf:img <photo1.jpg> .
If neither @resource nor @href is present, then @src is next in priority order, for setting the object. For example,
we could indicate an image to use to represent a person, with the foaf:img
predicate, as follows: [Just flagging this because @src generating an object needs to be discussed further]
<img about="http://www.blogger.com/profile/1109404" rel="foaf:img" src="photo1.jpg" />
which then yields two triples:
<http://www.blogger.com/profile/1109404> foaf:img <photo1.jpg> .
When a triple predicate has been expressed using @rel or @rev, but no @href, @src, or @resource exists on the same [RDFa element], there is a 'hanging rel'. This causes the current subject and all possible predicates (with an indicator of whether they are 'forwards, i.e., @rel values, or not, i.e., @rev values), to be stored as 'incomplete triples' pending discovery of a subject that could be used to 'complete' those triples.
This process is described in more detail in Completing 'Incomplete Triples'.
To establish relationships between [blank node]s, the [unique anonymous ID] must be set explicity using a CURIE [blank node] as subject or object. For example, if our desired output is the following [triple]s:
_:a foaf:mbox <mailto:john@example.org> .
_:b foaf:mbox <mailto:sue@example.org> .
_:a foaf:knows _:b .
we could use the following XHTML:
<link about="[_:a]" rel="foaf:mbox"
href="mailto:john@example.org" />
<link about="[_:b]" rel="foaf:mbox"
href="mailto:sue@example.org" />
<link about="[_:a]" rel="foaf:knows"
href="[_:b]" />
or, alternatively, if we wish to partly render the information in XHTML:
<div about="[_:a]">
JohnSmith can be reached via
<a rel="foaf:mbox"
href="mailto:john@example.org">
</a>.
<span rel="foaf:knows" resource="[_:b]">He knows Sue.</span>
</div>
<div about="[_:b]">
Sue can be reached via
<a rel="foaf:mbox"
href="mailto:sue@example.org">
</a>
</div>
This section is normative.
The key component of RDF is the URI, but they are usually long and unwieldy. RDFa therefore supports a mechanism by which URIs can be abbreviated, called 'compact URIs' or simply, CURIEs.
A CURIE is comprised of two components, a prefix which maps to a URI, and a reference. The prefix is separated from the reference by a colon (:
). It is possible to
omit the prefix, and make use of the default prefix. It is also possible to omit both the prefix and the colon, leaving just a reference.
curie := [ [ prefix ] ':' ] reference prefix := NCName reference := irelative-ref (as defined in [IRI])
In some situations an attribute will allow either a CURIE, or a normal IRI. Since it is difficult to distinguish between CURIEs and IRIs, RDFa adds the notion of a [safe CURIE]. The syntax is simply to surround the CURIE with square brackets:
safe_curie := '[' curie ']'
To evaluate CURIEs during processing the following context needs to be set: [This next part is confusing to me. It seems like you're asking questions (beside the bullets) and then immediately answering them (in the blue parts). I had to read this next bit 3 times to understand what was going on. Please reorganize and just state each blue section as fact. For example: "The mapping to use with the default prefix, for example :next, is the current default namespace." Even that is a little confusing. What I'd really like to see is a "Here's how you setup the context mappings:" section.]
:p
); [I don't understand where ':p' came from or what it means. Use ':next' as an example.]
p
); [I don't understand where 'p' came from or what it means. Use 'next' as an example]
http://www.w3.org/1999/xhtml#
._:p
). ['_:p' doesn't make sense, use something like _:bnode0 as an example]
clarify the 'no prefix' situation
The advantage of setting the 'no prefix' mapping to the XHTML namespace is that we no longer need a preprocessing step to handle XHTML link types, such as anext
. However, this does have
the effect of moving all other values into the XHTML namespace, such as openid.delegate
. An alternative is to prohibit unprefixed CURIEs, other than those defined by XHTML.A CURIE is a representation of a full IRI. This
IRI is obtained by taking the currently in-scope mapping that is
associated with prefix
, and concatenating it with
the reference
. The result MUST be a syntactically valid IRI [IRI].
This section is normative.
The XHTML+RDFa document type is a fully functional document type with rich semantics. It is a superset of [XHTML11]. See that document for the details of the underlying language.
The XHTML+RDFa 1.0 document type is made up of the following XHTML modules. The elements, attributes, and content models associated with these modules are defined in "XHTML Modularization" [XHTMLMOD]. The elements are listed here for information purposes, but the definitions in "XHTML Modularization" should be considered authoritative. In the on-line version of this document, the module names in the list below link into the definitions of the modules within the current versions of "XHTML Modularization".
body, head, html, title
abbr, acronym, address, blockquote, br, cite, code, dfn, div,
em, h1, h2, h3, h4, h5, h6, kbd, p, pre, q, samp, span, strong, var
a
, and @href is available on all elements.dl, dt, dd, ol, ul, li
object, param
b, big, hr, i, small, sub, sup, tt
del, ins
bdo
button, fieldset, form, input, label, legend, select, optgroup, option, textarea
caption, col, colgroup, table, tbody, td, tfoot, th, thead, tr
img
area, map
ismap
on img
meta
noscript, script
style
elementlink
base
XHTML+RDFa also uses the Ruby Annotation module as defined in [RUBY]:
ruby, rbc, rtc, rb, rt, rp
There are no additional definitions required by this document type. An implementation of this document type as an XML DTD is defined in Appendix B.
This section is normative.
The Metainformation Attributes Module defines the Metainformation
attribute collection. This collection allows elements to be annotated with metadata throughout an XHTML-family
document. When this module is included in a markup language, this collection is added to the Common
attribute collection as defined in [XHTMLMOD].
Some of the attributes in this section use the following datatype:
Data type | Description |
---|---|
CURIE | A Compact URI or CURIE. |
URIorCURIE | A URI or safe_curie. |
This attribute specifies a URIorCURIE that indicates which resource has a specified property.
<meta about="http://www.example.com/" property="dc:created">2004-03-20</meta>
This attribute specifies a value of type CDATA that defines the metadata associated
with an element. If not specified, then the metadata for an element is its content. If it is specified, and there is no property
attribute, then the property is considered to be
reference
.
<meta about="http://www.example.com/" property="dc:created" content="2004-03-20"/>
This attribute defines as a CURIE the datatype of the content metadata of the element. If the attribute is not specified, then the default
value is string
as defined by [XMLSCHEMA].
<meta about="http://www.example.com/" property="dc:created" datatype="xsd:date">2004-03-20</meta>
This attribute indicates the rdf:type
of the associated resource(s).
[We need an @instanceof example here]
This attribute specifies a space-separated list of CURIEs that indicates which property is being defined by the element.
<meta about="http://www.example.com/" property="dc:creator">John Smith</meta>
The list of predefined values (in the XHTML namesapace)
is given below. Users may extend this collection of relationships,
however new values must be defined in their own vocabulary, and the
relationship names must be referenced in documents as CURIEs (e.g., dc:creator
for the Dublin Core "creator" relationship).
<html .... xmlns:dc="http://purl.org/dc/elements/1.1/">
Note that previous versions of XHTML included an author
property; this has now been replaced with the Dublin Core creator
property.
Note that the @title attribute is just a shorthand for a common case:
<a href="Jakob.html" title="Author biography">Jakob Nielsen</a>'s Alertbox for January 11, 1998
is equivalent to:
<h2 about="#jakob" property="title">Author biography</h2>
<p><a href="Jakob.html" id="jakob">Jakob Nielsen</a>'s Alertbox for January 11, 1998</p>
This allows you to specify richer, marked-up text for a title when needed.
This attribute describes the relationship between the resource specified by @about (or its default value) and the resource referred to by @href as defined in XHTML. The type for this attribute is a space-separated list of CURIEs.
<link href="top.html" rel="contents" />
This example defines a link to a table of contents for the current document.
<link href="doc.ps"
rel="alternate" media="print" type="application/postscript" />
This example defines a link to an alternate version of the document especially suited to printing.
Authors may use the following relationship names, listed here with their conventional interpretations.
User agents, search engines, etc. may interpret these relationships in a variety of ways. For example, user agents may provide access to linked documents through a navigation bar.
Users may extend this collection of relationships. However, extensions must be defined in their own vocabulary, and the relationship names must be referenced in documents as CURIEs (e.g., dc:creator for the Dublin Core "creator" relationship).
Note that in order to reference relationship definitions via CURIE, their prefix must be defined via an xmlns attribute somewhere suitable:
<html .... xmlns:dc="http://purl.org/dc/elements/1.1/">
Refers to a resource that defines a citation. In the following example, the cite
is used to reference the book from which the quotation is taken:
cite as book reference
As Gandalf the White said in
<span rel="cite" about="http://www.example.com/books/the_two_towers">
The Two Towers
</span>,
<quote xml:lang="en">"The hospitality of
your hall is somewhat lessened of late, Theoden King."</quote>
cite to reference another specification
More information can be found in
<span property="cite" about="http://www.w3.org/TR/REC-xml">[XML]</cite>.
No end or last value
We have a value of "start", but no corresponding "end" value. Do we need one?This attribute takes a URIorCURIE, and can be used to define the resource referenced by a @rel, @rev, or @property attribute. When provided, the value of @resource supercedes any value for the @href attribute on the same element.
This attribute is the complement of the @rel attribute and describes the reverse relationship between the resource specified by the @about attribute (or its default value) and the resource referred to by the @href attribute [That's not the only thing @rev does, it applies to more than @about and @href... @instanceof and @resource, for example... take this out, it's confusing and we state pretty clearly what @rev applies to earlier in the document]. Its value is a space-separated list of CURIEs. For a list of relationship names, see the @rel attribute [ref or link back to 9.2.6?].
<link href="doc.html" rev="contents"/>
This example states that the current document is the table of contents for the referenced document shown in the @href.
An implementation of this module can be found in Appendix B.
This section is informative.
While outside the scope of this specification, RDFa is intended to be extensible for use in languages beyond XHTML 1.1. The module defined in Metainformation Attibutes Module is compatible with XHTML Modularization [XHTMLMOD], and can be used to construct other such languages.
If a language includes @xml:base [XMLBASE], an RDFa parser for that host language must process it, and use its value to set [base].
An example follows to show how @xml:base affects the subject: [Both about=""s below are superfluous, we should remove them]
<span xml:base="http://internet-apps.blogspot.com/">
<span about="" rel="dc:creator" href="http://www.blogger.com/profile/1109404" />
<span about="" property="dc:title" content="Internet Applications" />
</span>
The triples generated would be as follows:
<http://internet-apps.blogspot.com/>
dc:creator <http://www.blogger.com/profile/1109404> .
<http://internet-apps.blogspot.com/>
dc:title "Internet Applications" .
This appendix is normative.
This appendix includes an implementation of the XHTML+RDFa 1.0 language as an XML DTD. It is implemented by by combining the XHTML 1.1 DTD with the XHTML Metainformation Attribute Module. This is done by using a content model module, and then a driver module:
<!-- ...................................................................... -->
<!-- XHTML MetaAttributes Module ......................................... -->
<!-- file: xhtml-metaAttributes-1.mod
This is XHTML-RDFa, modules to annotate XHTML family documents.
Copyright 2007-2008 W3C (MIT, ERCIM, Keio), All Rights Reserved.
Revision: $Id: Overview.html,v 1.2 2008/01/03 18:51:01 jigsaw Exp $
This DTD module is identified by the PUBLIC and SYSTEM identifiers:
PUBLIC "-//W3C//ENTITIES XHTML MetaAttributes 1.0//EN"
SYSTEM "http://www.w3.org/MarkUp/DTD/xhtml-metaAttributes-1.mod"
Revisions:
(none)
....................................................................... -->
<!-- Common Attributes
This module declares a collection of meta-information related
attributes.
%NS.decl.attrib; is declared in the XHTML Qname module.
This file also includes declarations of "global" versions of the
attributes. The global versions of the attributes are for use on
elements in other namespaces.
-->
<!ENTITY % QName.datatype "CDATA" >
<!ENTITY % QNames.datatype "CDATA" >
<!ENTITY % about.attrib "about %URI.datatype; #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.about.attrib "%XHTML.prefix;:about %URI.datatype; #IMPLIED" > ]]> <!ENTITY % instanceof.attrib "instanceof %QName.datatype; #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.instanceof.attrib "%XHTML.prefix;:instanceof %QName.datatype; #IMPLIED" > ]]> <!ENTITY % property.attrib "property %QNames.datatype; #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.property.attrib "%XHTML.prefix;:property %QNames.datatype; #IMPLIED" > ]]> <!ENTITY % resource.attrib "resource %URI.datatype; #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.resource.attrib "%XHTML.prefix;:resource %URI.datatype; #IMPLIED" > ]]> <!ENTITY % content.attrib "content CDATA #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.content.attrib "%XHTML.prefix;:content CDATA #IMPLIED" > ]]> <!ENTITY % datatype.attrib "datatype %QName.datatype; #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.datatype.attrib "%XHTML.prefix;:datatype %QName.datatype; #IMPLIED" > ]]> <!ENTITY % rel.attrib "rel %QNames.datatype; #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.rel.attrib "%XHTML.prefix;:rel %QNames.datatype; #IMPLIED" > ]]> <!ENTITY % rev.attrib "rev %QNames.datatype; #IMPLIED" > <![%XHTML.global.attrs.prefixed;[ <!ENTITY % XHTML.global.rev.attrib "%XHTML.prefix;:rev %QNames.datatype; #IMPLIED" > ]]> <!ENTITY % Metainformation.extra.attrib "" >
<!ENTITY % Metainformation.attrib "%about.attrib; %content.attrib; %datatype.attrib; %instanceof.attrib; %property.attrib; %rel.attrib; %resource.attrib; %rev.attrib; %Metainformation.extra.attrib;" > <!ENTITY % XHTML.global.metainformation.extra.attrib "" >
<![%XHTML.global.attrs.prefixed;[
<!ENTITY % XHTML.global.metainformation.attrib "%XHTML.global.about.attrib; %XHTML.global.content.attrib; %XHTML.global.datatype.attrib; %XHTML.global.instanceof.attrib; %XHTML.global.property.attrib; %XHTML.global.rel.attrib; %XHTML.global.resource.attrib; %XHTML.global.rev.attrib; %XHTML.global.metainformation.extra.attrib;" > ]]> <!ENTITY % XHTML.global.metainformation.attrib "" >
<!-- end of xhtml-metaAttributes-1.mod -->
<!-- ....................................................................... -->
<!-- XHTML+RDFa Document Model Module ..................................... -->
<!-- file: xhtml-rdfa-model-1.mod
This is XHTML+RDFa.
Copyright 1998-2008 W3C (MIT, ERCIM, Keio), All Rights Reserved.
Revision: $Id: Overview.html,v 1.2 2008/01/03 18:51:01 jigsaw Exp $ SMI
This DTD module is identified by the PUBLIC and SYSTEM identifiers:
PUBLIC "-//W3C//ENTITIES XHTML+RDFa Document Model 1.0//EN"
SYSTEM "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-model-1.mod"
Revisions:
(none)
....................................................................... -->
<!-- XHTML+RDFa Document Model
This module describes the groupings of elements that make up
common content models for XHTML elements.
XHTML has three basic content models:
%Inline.mix; character-level elements
%Block.mix; block-like elements, eg., paragraphs and lists
%Flow.mix; any block or inline elements
Any parameter entities declared in this module may be used
to create element content models, but the above three are
considered 'global' (insofar as that term applies here).
The reserved word '#PCDATA' (indicating a text string) is now
included explicitly with each element declaration that is
declared as mixed content, as XML requires that this token
occur first in a content model specification.
-->
<!-- Extending the Model
While in some cases this module may need to be rewritten to
accommodate changes to the document model, minor extensions
may be accomplished by redeclaring any of the three *.extra;
parameter entities to contain extension element types as follows:
%Misc.extra; whose parent may be any block or
inline element.
%Inline.extra; whose parent may be any inline element.
%Block.extra; whose parent may be any block element.
If used, these parameter entities must be an OR-separated
list beginning with an OR separator ("|"), eg., "| a | b | c"
All block and inline *.class parameter entities not part
of the *struct.class classes begin with "| " to allow for
exclusion from mixes.
-->
<!-- .............. Optional Elements in head .................. -->
<!ENTITY % HeadOpts.mix "( %script.qname; | %style.qname; | %meta.qname; | %link.qname; | %object.qname; )*" > <!-- ................. Miscellaneous Elements .................. --> <!-- ins and del are used to denote editing changes --> <!ENTITY % Edit.class "| %ins.qname; | %del.qname;" >
<!-- script and noscript are used to contain scripts
and alternative content
-->
<!ENTITY % Script.class "| %script.qname; | %noscript.qname;" >
<!ENTITY % Misc.extra "" >
<!-- These elements are neither block nor inline, and can
essentially be used anywhere in the document body.
-->
<!ENTITY % Misc.class "%Edit.class; %Script.class; %Misc.extra;" > <!-- .................... Inline Elements ...................... --> <!ENTITY % InlStruct.class "%br.qname; | %span.qname;" >
<!ENTITY % InlPhras.class "| %em.qname; | %strong.qname; | %dfn.qname; | %code.qname; | %samp.qname; | %kbd.qname; | %var.qname; | %cite.qname; | %abbr.qname; | %acronym.qname; | %q.qname;" > <!ENTITY % InlPres.class "| %tt.qname; | %i.qname; | %b.qname; | %big.qname; | %small.qname; | %sub.qname; | %sup.qname;" > <!ENTITY % I18n.class "| %bdo.qname;" >
<!ENTITY % Anchor.class "| %a.qname;" >
<!ENTITY % InlSpecial.class "| %img.qname; | %map.qname; | %object.qname;" > <!ENTITY % InlForm.class "| %input.qname; | %select.qname; | %textarea.qname; | %label.qname; | %button.qname;" > <!ENTITY % Inline.extra "" >
<!ENTITY % Ruby.class "| %ruby.qname;" >
<!-- %Inline.class; includes all inline elements,
used as a component in mixes
-->
<!ENTITY % Inline.class "%InlStruct.class; %InlPhras.class; %InlPres.class; %I18n.class; %Anchor.class; %InlSpecial.class; %InlForm.class; %Ruby.class; %Inline.extra;" > <!-- %InlNoRuby.class; includes all inline elements except ruby, used as a component in mixes --> <!ENTITY % InlNoRuby.class "%InlStruct.class; %InlPhras.class; %InlPres.class; %I18n.class; %Anchor.class; %InlSpecial.class; %InlForm.class; %Inline.extra;" > <!-- %NoRuby.content; includes all inlines except ruby --> <!ENTITY % NoRuby.content "( #PCDATA | %InlNoRuby.class; %Misc.class; )*" > <!-- %InlNoAnchor.class; includes all non-anchor inlines, used as a component in mixes --> <!ENTITY % InlNoAnchor.class "%InlStruct.class; %InlPhras.class; %InlPres.class; %I18n.class; %InlSpecial.class; %InlForm.class; %Ruby.class; %Inline.extra;" > <!-- %InlNoAnchor.mix; includes all non-anchor inlines --> <!ENTITY % InlNoAnchor.mix "%InlNoAnchor.class; %Misc.class;" > <!-- %Inline.mix; includes all inline elements, including %Misc.class; --> <!ENTITY % Inline.mix "%Inline.class; %Misc.class;" > <!-- ..................... Block Elements ...................... --> <!-- In the HTML 4.0 DTD, heading and list elements were included in the %block; parameter entity. The %Heading.class; and %List.class; parameter entities must now be included explicitly on element declarations where desired. --> <!ENTITY % Heading.class "%h1.qname; | %h2.qname; | %h3.qname; | %h4.qname; | %h5.qname; | %h6.qname;" > <!ENTITY % List.class "%ul.qname; | %ol.qname; | %dl.qname;" >
<!ENTITY % Table.class "| %table.qname;" >
<!ENTITY % Form.class "| %form.qname;" >
<!ENTITY % Fieldset.class "| %fieldset.qname;" >
<!ENTITY % BlkStruct.class "%p.qname; | %div.qname;" >
<!ENTITY % BlkPhras.class "| %pre.qname; | %blockquote.qname; | %address.qname;" > <!ENTITY % BlkPres.class "| %hr.qname; " >
<!ENTITY % BlkSpecial.class "%Table.class; %Form.class; %Fieldset.class;" > <!ENTITY % Block.extra "" >
<!-- %Block.class; includes all block elements,
used as an component in mixes
-->
<!ENTITY % Block.class "%BlkStruct.class; %BlkPhras.class; %BlkPres.class; %BlkSpecial.class; %Block.extra;" > <!-- %Block.mix; includes all block elements plus %Misc.class; --> <!ENTITY % Block.mix "%Heading.class; | %List.class; | %Block.class; %Misc.class;" > <!-- ................ All Content Elements .................. --> <!-- %Flow.mix; includes all text content, block and inline --> <!ENTITY % Flow.mix "%Heading.class; | %List.class; | %Block.class; | %Inline.class; %Misc.class;" > <!-- end of xhtml-rdfa-model-1.mod -->
<!-- ....................................................................... -->
<!-- XHTML 1.1 + RDFa DTD ................................................. -->
<!-- file: xhtml-rdfa-1.dtd
-->
<!-- XHTML 1.1 + RDFa DTD
This is an example markup language combining XHTML 1.1 and the RDFa
modules.
XHTML+RDFa
Copyright 1998-2008 World Wide Web Consortium
(Massachusetts Institute of Technology, European Research Consortium
for Informatics and Mathematics, Keio University).
All Rights Reserved.
Permission to use, copy, modify and distribute the XHTML DTD and its
accompanying documentation for any purpose and without fee is hereby
granted in perpetuity, provided that the above copyright notice and
this paragraph appear in all copies. The copyright holders make no
representation about the suitability of the DTD for any purpose.
It is provided "as is" without expressed or implied warranty.
-->
<!-- This is the driver file for version 1 of the XHTML + RDFa DTD.
Please use this public identifier to identify it:
"-//W3C//DTD XHTML+RDFa 1.0//EN"
-->
<!ENTITY % XHTML.version "-//W3C//DTD XHTML+RDFa 1.0//EN" >
<!-- Use this URI to identify the default namespace:
"http://www.w3.org/1999/xhtml"
See the Qualified Names module for information
on the use of namespace prefixes in the DTD.
Note that XHTML namespace elements are not prefixed by default,
but the XHTML namespace prefix is defined as "xhtml" so that
other markup languages can extend this one and use the XHTML
prefixed global attributes if required.
-->
<!ENTITY % NS.prefixed "IGNORE" >
<!ENTITY % XHTML.prefix "xhtml" >
<!-- Be sure to include prefixed global attributes - we don't need
them, but languages that extend XHTML 1.1 might.
-->
<!ENTITY % XHTML.global.attrs.prefixed "INCLUDE" >
<!-- Reserved for use with the XLink namespace:
-->
<!ENTITY % XLINK.xmlns "" >
<!ENTITY % XLINK.xmlns.attrib "" >
<!-- For example, if you are using XHTML 1.1 directly, use the public
identifier in the DOCTYPE declaration, with the namespace declaration
on the document element to identify the default namespace:
<?xml version="1.0"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"
"http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd">
<html xmlns="http://www.w3.org/1999/xhtml"
xml:lang="en">
...
</html>
Revisions:
(none)
-->
<!-- reserved for future use with document profiles -->
<!ENTITY % XHTML.profile "" >
<!-- ensure XHTML Notations are disabled -->
<!ENTITY % xhtml-notations.module "IGNORE" >
<!-- Bidirectional Text features
This feature-test entity is used to declare elements
and attributes used for bidirectional text support.
-->
<!ENTITY % XHTML.bidi "INCLUDE" >
<!-- ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: -->
<!-- Pre-Framework Redeclaration placeholder .................... -->
<!-- this serves as a location to insert markup declarations
into the DTD prior to the framework declarations.
-->
<!ENTITY % xhtml-prefw-redecl.module "IGNORE" >
<!ENTITY % xhtml-prefw-redecl.mod "" >
<![%xhtml-prefw-redecl.module;[
%xhtml-prefw-redecl.mod;
<!-- end of xhtml-prefw-redecl.module -->]]>
<!-- we need the datatypes now -->
<!ENTITY % xhtml-datatypes.module "INCLUDE" >
<![%xhtml-datatypes.module;[
<!ENTITY % xhtml-datatypes.mod PUBLIC "-//W3C//ENTITIES XHTML Datatypes 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-datatypes-1.mod" > %xhtml-datatypes.mod;]]> <!-- bring in the RDFa attributes cause we need them in Common --> <!ENTITY % xhtml-metaAttributes.module "INCLUDE" >
<![%xhtml-metaAttributes.module;[
<!ENTITY % xhtml-metaAttributes.mod PUBLIC "-//W3C//ENTITIES XHTML MetaAttributes 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-metaAttributes-1.mod" > %xhtml-metaAttributes.mod;]]> <!ENTITY % xhtml-events.module "INCLUDE" >
<!ENTITY % Common.extra.attrib "href %URI.datatype; #IMPLIED %Metainformation.attrib;" > <!-- Inline Style Module ........................................ --> <!ENTITY % xhtml-inlstyle.module "INCLUDE" >
<![%xhtml-inlstyle.module;[
<!ENTITY % xhtml-inlstyle.mod PUBLIC "-//W3C//ELEMENTS XHTML Inline Style 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-inlstyle-1.mod" > %xhtml-inlstyle.mod;]]> <!-- declare Document Model module instantiated in framework --> <!ENTITY % xhtml-model.mod PUBLIC "-//W3C//ENTITIES XHTML+RDFa Document Model 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-model-1.mod" > <!-- Modular Framework Module (required) ......................... --> <!ENTITY % xhtml-framework.module "INCLUDE" >
<![%xhtml-framework.module;[
<!ENTITY % xhtml-framework.mod PUBLIC "-//W3C//ENTITIES XHTML Modular Framework 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-framework-1.mod" > %xhtml-framework.mod;]]> <!-- Post-Framework Redeclaration placeholder ................... --> <!-- this serves as a location to insert markup declarations into the DTD following the framework declarations. --> <!ENTITY % xhtml-postfw-redecl.module "IGNORE" >
<!ENTITY % xhtml-postfw-redecl.mod "">
<![%xhtml-postfw-redecl.module;[
%xhtml-postfw-redecl.mod;
<!-- end of xhtml-postfw-redecl.module -->]]>
<!-- Text Module (Required) ..................................... -->
<!ENTITY % xhtml-text.module "INCLUDE" >
<![%xhtml-text.module;[
<!ENTITY % xhtml-text.mod PUBLIC "-//W3C//ELEMENTS XHTML Text 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-text-1.mod" > %xhtml-text.mod;]]> <!-- Hypertext Module (required) ................................. --> <!ENTITY % a.attlist "IGNORE" >
<!ENTITY % xhtml-hypertext.module "INCLUDE" >
<![%xhtml-hypertext.module;[
<!ENTITY % xhtml-hypertext.mod PUBLIC "-//W3C//ELEMENTS XHTML Hypertext 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-hypertext-1.mod" > %xhtml-hypertext.mod;]]> <!ATTLIST %a.qname; %Common.attrib; charset %Charset.datatype; #IMPLIED type %ContentType.datatype; #IMPLIED accesskey %Character.datatype; #IMPLIED tabindex %Number.datatype; #IMPLIED > <!-- Lists Module (required) .................................... --> <!ENTITY % xhtml-list.module "INCLUDE" >
<![%xhtml-list.module;[
<!ENTITY % xhtml-list.mod PUBLIC "-//W3C//ELEMENTS XHTML Lists 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-list-1.mod" > %xhtml-list.mod;]]> <!-- ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: --> <!-- Edit Module ................................................ --> <!ENTITY % xhtml-edit.module "INCLUDE" >
<![%xhtml-edit.module;[
<!ENTITY % xhtml-edit.mod PUBLIC "-//W3C//ELEMENTS XHTML Editing Elements 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-edit-1.mod" > %xhtml-edit.mod;]]> <!-- BIDI Override Module ....................................... --> <!ENTITY % xhtml-bdo.module "%XHTML.bidi;" >
<![%xhtml-bdo.module;[
<!ENTITY % xhtml-bdo.mod PUBLIC "-//W3C//ELEMENTS XHTML BIDI Override Element 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-bdo-1.mod" > %xhtml-bdo.mod;]]> <!-- Ruby Module ................................................ --> <!ENTITY % Ruby.common.attlists "INCLUDE" >
<!ENTITY % Ruby.common.attrib "%Common.attrib;" >
<!ENTITY % xhtml-ruby.module "INCLUDE" >
<![%xhtml-ruby.module;[
<!ENTITY % xhtml-ruby.mod PUBLIC "-//W3C//ELEMENTS XHTML Ruby 1.0//EN" "http://www.w3.org/TR/ruby/xhtml-ruby-1.mod" > %xhtml-ruby.mod;]]> <!-- Presentation Module ........................................ --> <!ENTITY % xhtml-pres.module "INCLUDE" >
<![%xhtml-pres.module;[
<!ENTITY % xhtml-pres.mod PUBLIC "-//W3C//ELEMENTS XHTML Presentation 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-pres-1.mod" > %xhtml-pres.mod;]]> <!ENTITY % link.attlist "IGNORE" >
<!-- Link Element Module ........................................ -->
<!ENTITY % xhtml-link.module "INCLUDE" >
<![%xhtml-link.module;[
<!ENTITY % xhtml-link.mod PUBLIC "-//W3C//ELEMENTS XHTML Link Element 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-link-1.mod" > %xhtml-link.mod;]]> <!ATTLIST %link.qname; %Common.attrib; charset %Charset.datatype; #IMPLIED hreflang %LanguageCode.datatype; #IMPLIED type %ContentType.datatype; #IMPLIED media %MediaDesc.datatype; #IMPLIED > <!-- Document Metainformation Module ............................ --> <!ENTITY % meta.attlist "IGNORE" >
<!ENTITY % xhtml-meta.module "INCLUDE" >
<![%xhtml-meta.module;[
<!ENTITY % xhtml-meta.mod PUBLIC "-//W3C//ELEMENTS XHTML Metainformation 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-meta-1.mod" > %xhtml-meta.mod;]]> <!ATTLIST %meta.qname; %Common.attrib; http-equiv NMTOKEN #IMPLIED name NMTOKEN #IMPLIED scheme CDATA #IMPLIED > <!-- Base Element Module ........................................ --> <!ENTITY % xhtml-base.module "INCLUDE" >
<![%xhtml-base.module;[
<!ENTITY % xhtml-base.mod PUBLIC "-//W3C//ELEMENTS XHTML Base Element 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-base-1.mod" > %xhtml-base.mod;]]> <!-- Scripting Module ........................................... --> <!ENTITY % xhtml-script.module "INCLUDE" >
<![%xhtml-script.module;[
<!ENTITY % xhtml-script.mod PUBLIC "-//W3C//ELEMENTS XHTML Scripting 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-script-1.mod" > %xhtml-script.mod;]]> <!-- Style Sheets Module ......................................... --> <!ENTITY % xhtml-style.module "INCLUDE" >
<![%xhtml-style.module;[
<!ENTITY % xhtml-style.mod PUBLIC "-//W3C//ELEMENTS XHTML Style Sheets 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-style-1.mod" > %xhtml-style.mod;]]> <!-- Image Module ............................................... --> <!ENTITY % xhtml-image.module "INCLUDE" >
<![%xhtml-image.module;[
<!ENTITY % xhtml-image.mod PUBLIC "-//W3C//ELEMENTS XHTML Images 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-image-1.mod" > %xhtml-image.mod;]]> <!-- Client-side Image Map Module ............................... --> <!ENTITY % area.attlist "IGNORE" >
<!ENTITY % xhtml-csismap.module "INCLUDE" >
<![%xhtml-csismap.module;[
<!ENTITY % xhtml-csismap.mod PUBLIC "-//W3C//ELEMENTS XHTML Client-side Image Maps 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-csismap-1.mod" > %xhtml-csismap.mod;]]> <!ATTLIST %area.qname; %Common.attrib; shape %Shape.datatype; 'rect' coords %Coords.datatype; #IMPLIED nohref ( nohref ) #IMPLIED alt %Text.datatype; #REQUIRED tabindex %Number.datatype; #IMPLIED accesskey %Character.datatype; #IMPLIED > <!-- Server-side Image Map Module ............................... --> <!ENTITY % xhtml-ssismap.module "INCLUDE" >
<![%xhtml-ssismap.module;[
<!ENTITY % xhtml-ssismap.mod PUBLIC "-//W3C//ELEMENTS XHTML Server-side Image Maps 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-ssismap-1.mod" > %xhtml-ssismap.mod;]]> <!-- Param Element Module ....................................... --> <!ENTITY % xhtml-param.module "INCLUDE" >
<![%xhtml-param.module;[
<!ENTITY % xhtml-param.mod PUBLIC "-//W3C//ELEMENTS XHTML Param Element 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-param-1.mod" > %xhtml-param.mod;]]> <!-- Embedded Object Module ..................................... --> <!ENTITY % xhtml-object.module "INCLUDE" >
<![%xhtml-object.module;[
<!ENTITY % xhtml-object.mod PUBLIC "-//W3C//ELEMENTS XHTML Embedded Object 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-object-1.mod" > %xhtml-object.mod;]]> <!-- Tables Module ............................................... --> <!ENTITY % xhtml-table.module "INCLUDE" >
<![%xhtml-table.module;[
<!ENTITY % xhtml-table.mod PUBLIC "-//W3C//ELEMENTS XHTML Tables 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-table-1.mod" > %xhtml-table.mod;]]> <!-- Forms Module ............................................... --> <!ENTITY % xhtml-form.module "INCLUDE" >
<![%xhtml-form.module;[
<!ENTITY % xhtml-form.mod PUBLIC "-//W3C//ELEMENTS XHTML Forms 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-form-1.mod" > %xhtml-form.mod;]]> <!-- Target Attribute Module .................................... --> <!ENTITY % xhtml-target.module "INCLUDE" >
<![%xhtml-target.module;[
<!ENTITY % xhtml-target.mod PUBLIC "-//W3C//ELEMENTS XHTML Target 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-target-1.mod" > %xhtml-target.mod;]]> <!-- Legacy Markup ............................................... --> <!ENTITY % xhtml-legacy.module "IGNORE" >
<![%xhtml-legacy.module;[
<!ENTITY % xhtml-legacy.mod PUBLIC "-//W3C//ELEMENTS XHTML Legacy Markup 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-legacy-1.mod" > %xhtml-legacy.mod;]]> <!-- Document Structure Module (required) ....................... --> <!-- <!ENTITY % html.attlist "IGNORE" > -->
<!ENTITY % head.attlist "IGNORE" >
<!ENTITY % title.attlist "IGNORE" >
<!ENTITY % xhtml-struct.module "INCLUDE" >
<![%xhtml-struct.module;[
<!ENTITY % xhtml-struct.mod PUBLIC "-//W3C//ELEMENTS XHTML Document Structure 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-struct-1.mod" > %xhtml-struct.mod;]]> <!ENTITY % profile.attrib "profile %URI.datatype; '%XHTML.profile;'" > <!-- <!ATTLIST %html.qname; %Common.attrib; %XSI.schemaLocation.attrib; %XHTML.version.attrib; > --> <!ATTLIST %head.qname; %Common.attrib; %profile.attrib; > <!ATTLIST %title.qname; %Common.attrib; > <!-- end of XHTML-RDFa DTD ................................................ --> <!-- ....................................................................... -->
This section contains the SGML Open Catalog-format definition [CATALOG] of the public identifiers for XHTML+RDFa 1.0.
-- .......................................................................... --
-- File catalog ............................................................ --
-- XHTML+RDFa Catalog Data File
Revision: $Revision: 1.2 $
See "Entity Management", SGML Open Technical Resolution 9401 for detailed
information on supplying and using catalog data. This document is available
from OASIS at URL:
<http://www.oasis-open.org/html/tr9401.html>
--
-- .......................................................................... --
-- SGML declaration associated with XHTML .................................. --
OVERRIDE YES
SGMLDECL "xml1.dcl"
-- :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: --
-- XHTML+RDFa modules .............................................. --
PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "xhtml-rdfa-1.dtd"
PUBLIC "-//W3C//ENTITIES XHTML+RDFa Document Model 1.0//EN" "xhtml-rdfa-model-1.mod"
PUBLIC "-//W3C//ENTITIES XHTML MetaAttributes 1.0//EN" "xhtml-metaAttributes-1.mod"
-- End of catalog data ..................................................... --
-- .......................................................................... --
This section is normative.
This section is informative.
This section is informative.
2007-12-??: Updated to reflect latest task-force thinking re: processing model, in particular regarding 'chaining', and the behaviour of @instanceof. [MarkBirbeck]
2007-10-19: Updated to reflect latest task-force thinking re: processing model. Integrated XHTML Module definition and hybrid markup language. Completed development as First Public Working Draft. [ShaneMcCarron], [MarkBirbeck]
2007-09-04: Migrated to XHTML 2 Working Group Publication System. Converted to a format that is consistent with REC-Track documents. Updated to reflect current processing model. Added normative definition of CURIEs. Started updating prose to be consistent with current task force agremeents. [ShaneMcCarron], [StevenPemberton], [MarkBirbeck]
2007-04-06: fixed some of the language to talk about "structure" rather than metadata. Added note regarding space-separated values in predicate-denoting attributes. [BenAdida]
2006-01-16: made the use of CURIE type for @rel, @rev, @property consistent across document (particularly section 2.4 was erroneous). [BenAdida]
This section is informative.
This section is informative.
At the time of publication, the participants in the XHTML 2 Working Group were: