Re: OOPS: (was: first pass parseType="Literal" text for primer) from Graham Klyne on 2003-07-28 (w3c-rdfcore-wg@w3.org from July 2003)

From: Graham Klyne <gk@ninebynine.org>
Date: Mon, 28 Jul 2003 16:56:23 +0100
To: Dave Beckett <dave.beckett@bristol.ac.uk>
Cc: w3c-rdfcore-wg <w3c-rdfcore-wg@w3.org>
Message-Id: <5.1.0.14.2.20030728165600.026757c0@127.0.0.1>

At 14:34 28/07/03 +0100, Dave Beckett wrote:
>The XML Canonicalization documents define
>not only "canonical form of an XML document" (the octets) but
>"canonical XML" which is XML written in the canonical form.
>-- http://www.w3.org/TR/2001/REC-xml-c14n-20010315#Terminology

Got it, thanks.

Hmmm... it seems to say the right things, but it's not as clear as I'd like.

"The term canonical XML refers to XML ..." so far, so good, because XML is 
a s3equence of Unicode characters, right?

"... that is in canonical form."  But they've just said the canonical form 
is the UTF-8 encoding.

I'd like to believe this works, but maybe some belt-and-braces would help 
us to to be sure, such as:
[[
The string used as the lexical form of the XML Literal is the Canonical XML 
obtained by applying Exclusive XML Canonicalization [XML-XC14N]), with 
comments and with empty InclusiveNamespaces PrefixList, to the literal text 
l, i.e. the entire element content of this property element.  That is, it 
is a Unicode string whose UTF-8 encoding is the Canonical XML form of the 
literal.
]]

Hmph!  That's pretty heavy going.  Maybe just add the last sentence to what 
you already have?

#g

-------------------
Graham Klyne
<GK@NineByNine.org>
PGP: 0FAA 69FF C083 000B A2E9  A131 01B9 1C7A DBCA CB5E

Received on Monday, 28 July 2003 12:36:18 UTC