- From: Dave Peterson <davep@iit.edu>
- Date: Wed, 9 Jun 2004 18:17:54 -0400
- To: Daniel Barclay <daniel@fgm.com>, www-xml-schema-comments@w3.org
At 4:13 PM -0400 040609, Daniel Barclay wrote:
>Regarding the draft at
>http://www.w3.org/TR/2004/PER-xmlschema-2-20040318/:
>
>Section 2.5.1.2 says:
>
> The ·lexical space· of a ·list· datatype is a set of literals whose
> internal structure is a space-separated sequence of literals of
> the ·atomic· datatype of the items in the ·list·.
>
>It doesn't seem to to specify whether "space-separated" means
>"separated by space characters" or "separated by space" (each
>contiguous group of space characters).
To quote from the section on the whiteSpace facet:
>replace
>All occurrences of #x9 (tab), #xA (line feed) and #xD (carriage
>return) are replaced with #x20 (space)
>collapse
>After the processing implied by replace , contiguous sequences
>of #x20's are collapsed to a single #x20, and leading and trailing
>#x20's are removed.
>For all datatypes ·derived ·by ·list ·the value of whiteSpace is collapse
The point of all this is that whitespace normalization occurs *before*
you get to the lexical space, so in the lexical representations of
lists there is never more than one space (#x20) character. I think that
the 1.0 editors were pretty consistent in saying "space" meaning one
space (#x20) character, and "whitespace" when they meant a sequence
of any or all. Mayhap the 1.1 revision will be more explicit.
At 4:22 PM -0400 040609, Daniel Barclay wrote ("XML schema draft
comments: is list canonical form underspecified?"):
>Section 2.5.1.2 says:
>
> The canonical-lexical-representation for the ·list· datatype is
> defined as the lexical form in which each item in the ·list· has
> the canonical lexical representation of its ·itemType·.
>
>Is that canonical form underspecified? Specifically, doesn't
>it need to specify canonical form of space-separating the list
>of item lexical values?
>
>_If_ "a b" and "a b" (two spaces) are both legal lexical values,
>which is the canonical lexical representation?
See above.
Hope this helps.
--
Dave Peterson
SGMLWorks!
davep@iit.edu
Received on Wednesday, 9 June 2004 21:09:51 UTC