W3C home > Mailing lists > Public > w3c-rdfcore-wg@w3.org > July 2003

Re: OOPS: (was: first pass parseType="Literal" text for primer)

From: Graham Klyne <gk@ninebynine.org>
Date: Mon, 28 Jul 2003 16:56:23 +0100
Message-Id: <>
To: Dave Beckett <dave.beckett@bristol.ac.uk>
Cc: w3c-rdfcore-wg <w3c-rdfcore-wg@w3.org>

At 14:34 28/07/03 +0100, Dave Beckett wrote:
>The XML Canonicalization documents define
>not only "canonical form of an XML document" (the octets) but
>"canonical XML" which is XML written in the canonical form.
>-- http://www.w3.org/TR/2001/REC-xml-c14n-20010315#Terminology

Got it, thanks.

Hmmm... it seems to say the right things, but it's not as clear as I'd like.

"The term canonical XML refers to XML ..." so far, so good, because XML is 
a s3equence of Unicode characters, right?

"... that is in canonical form."  But they've just said the canonical form 
is the UTF-8 encoding.

I'd like to believe this works, but maybe some belt-and-braces would help 
us to to be sure, such as:
The string used as the lexical form of the XML Literal is the Canonical XML 
obtained by applying Exclusive XML Canonicalization [XML-XC14N]), with 
comments and with empty InclusiveNamespaces PrefixList, to the literal text 
l, i.e. the entire element content of this property element.  That is, it 
is a Unicode string whose UTF-8 encoding is the Canonical XML form of the 

Hmph!  That's pretty heavy going.  Maybe just add the last sentence to what 
you already have?


Graham Klyne
PGP: 0FAA 69FF C083 000B A2E9  A131 01B9 1C7A DBCA CB5E
Received on Monday, 28 July 2003 12:36:18 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 20:24:24 UTC