W3C home > Mailing lists > Public > w3c-rdfcore-wg@w3.org > February 2002

RE: Outstanding Issues - rdfms-literal-is-xml-structure

From: Jeremy Carroll <jjc@hplb.hpl.hp.com>
Date: Wed, 20 Feb 2002 11:21:45 -0000
To: "Brian McBride" <bwm@hplb.hpl.hp.com>, "RDF Core" <w3c-rdfcore-wg@w3.org>
Message-ID: <JAEBJCLMIFLKLOJGMELDGEAPCDAA.jjc@hplb.hpl.hp.com>

> rdfms-literal-is-xml-structure :
> A literal containing XML markup is not a simple string, but is an XML

> This issue was put on hold pending the outcome of the datatypes

Again before this was on hold I suggested using XML Canonicalization.



+ MAY have their Unicode string component as given by the
  Unicode string of the XML Canonicalization of the document
  subset consisting of the element content. See XML
  Canonicalization section 2.4.
  The XML Canonicalization specifies a UTF-8 string, the
  RDF Literal is the encoded Unicode string.
  Such a canonicalization MAY or MAY NOT include comments.

For maximum interoperability RDF processors are RECOMMENDED
to use XML canonicalization without comments as the string
in the RDF Literal formed by the rdf:parseType="Literal"
property element production.

I would now modify this a bit.

1: Use the latest canonicalization spec.
  Exclusive XML Canonicalization
  see particularly:

   This has a parameter the InclusiveNamespaces PrefixList
   which should, for RDF/XML processing, generally be empty.

2: I would try to phrase it so that RDF/XML applications may return any
representation that can be transformed into the above "canonical"
representation; without requiring that conformant RDF processors actually do
the canonicalization.


"Literals arising from rdf:parseType="Literal" correspond to an XML
structure and MAY be represented as a canonical XML string using Exclusive
XML Canonicalization. Two such literals are equal whenever their Exclusive
XML Canonicalizations are equal."

Thus RDF processors that want to worry about equality of literals need to do
the works, whereas the DPH who isn't worried is provided with a semantics
for rdf:parseType="Literal" in terms of the abstract graph syntax  (it is
the canonical form as a representation of an XML structure), but the DPH has
no obligation to actually canonicalize.

Received on Wednesday, 20 February 2002 06:22:17 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 20:24:10 UTC