- From: <noah_mendelsohn@us.ibm.com>
- Date: Tue, 2 Sep 2008 10:43:48 -0400
- To: Ben Adida <ben@adida.net>, public-rdf-in-xhtml-tf@w3.org, Shane McCarron <shane@aptest.com>
- Cc: Jonathan Rees <jar@creativecommons.org>
I will be working in the coming days to pull together a consensus TAG
position on issues relating to a Schema Datatype for CURIEs, or else to
agree that the TAG has no such formal position. Of course, TAG members
such as Jonathan are welcome to pursue informal disucssions in parallel. I
did want to clarify my understanding of the status of lexical and value
spaces for anyURI in schema. The intention in Schema 1.0 was to
anticipate IRIs, which is why there is a reference to the XLink mappings.
XSD 1.1 mentions IRI's explicitly.
Jonathan Rees writes:
> You can't support your assertion using the document you cite! It says
> quite clearly that the lexical and value spaces are arbitrary strings,
> and that this mapping is the identity mapping:
Yes, for Schema 1.1, a bit less clear in Schema 1.0. From Schema 1.0 [1]:
"The ·lexical space· of anyURI is finite-length character sequences which,
when the algorithm defined in Section 5.4 of [XML Linking Language] is
applied to them, result in strings which are legal URIs according to [RFC
2396], as amended by [RFC 2732]. " The use of the "algorithm" tends to
ensure that internationalized forms are acceptable. I don't think it's
fair to say that this accepts any string, because of the appeal to RFC
2396. Then again, it's a little vague as to whether, for example, you are
allowed to enforce scheme-specific rules. Anyway, in the as yet
unapporved Schema 1.1, the strings are indeed unconstrainted [2]:
"The ·lexical space· of anyURI is the set of possibly empty finite-length
character sequences."
--and--
"This type should be used when the value fulfills the role of an IRI, as
defined in [RFC 3987] or its successor(s) in the IETF Standards Track. "
So, Schema 1.0 at least tries to enforce some checking, but to my reading
does it in a vague way. In schema 1.1, the lexical and value spaces are
arbitrary strings. In both versions the mapping from lexical to value is
the identity mapping. In either version, it is a misuse of anyURI to
apply it to lexical forms or values that don't conform to IRI, but that
restriction is not completely enforced by the definition of the type in
1.0, and is not enforced at all in 1.1. Both versions accept forms that
we would now recognize as IRIs, notwithstanding the fact that Schema 1.0
was published before the IRI RFC.
Noah
[1] http://www.w3.org/TR/2004/PER-xmlschema-2-20040318/#anyURI
[2] http://www.w3.org/TR/xmlschema11-2/#anyURI
--------------------------------------
Noah Mendelsohn
IBM Corporation
One Rogers Street
Cambridge, MA 02142
1-617-693-4036
--------------------------------------
Jonathan Rees <jar@creativecommons.org>
08/31/2008 01:36 PM
To: Shane McCarron <shane@aptest.com>
cc: Ben Adida <ben@adida.net>, public-rdf-in-xhtml-tf@w3.org,
Noah Mendelsohn <noah_mendelsohn@us.ibm.com>
Subject: Re: quick ping - ISSUE-104
On Aug 30, 2008, at 6:05 PM, Shane McCarron wrote:
> The lexical space of anyURI is the complete collection of URIs as
> defined in RFC 3986 (previously 2396 / 2732) [1].
> [1] http://www.w3.org/TR/2008/WD-xmlschema11-2-20080620/#anyURI
You can't support your assertion using the document you cite! It says
quite clearly that the lexical and value spaces are arbitrary strings,
and that this mapping is the identity mapping:
The ·lexical space· of anyURI is the set of possibly
empty finite-
length character sequences.
The ·lexical mapping· for anyURI is the identity mapping.
Even if it did support what you are saying, this is just a working
draft, and I don't think you can cite it in your document.
I'm not saying this is an easy rhetorical problem to fix - I'm just
saying that it is consequential, and your obligation to those who will
use your document in the future is to be careful and to dispel the
smoke. You've disagreed with me repeatedly on both points, which
leaves me uncertain as to how to proceed.
You need to live up to the rhetorical standard set by the document you
cite [1]. Put the following, normatively, in Appendix A of the CURIE
document:
The lexical space of CURIE is [fill in the blank, with
citations and/
or internal references as needed].
The lexical mapping for CURIE is ...
The lexical space of URIorSafeCURIE is ...
The lexical mapping for URIorSafeCURIE is ...
Otherwise, when five years from now someone tries to figure out how to
add CURIEs to SPARQL or OWL or RDF/XML or some other beast, they
will ... shall I just say, be less than happy.
Jonathan
[1] http://www.w3.org/TR/2008/WD-xmlschema11-2-20080620/#anyURI
Received on Tuesday, 2 September 2008 14:43:00 UTC