- From: <bugzilla@wiggum.w3.org>
- Date: Wed, 14 Sep 2005 18:11:45 +0000
- To: www-xml-schema-comments@w3.org
- Cc:
http://www.w3.org/Bugs/Public/show_bug.cgi?id=2173
Summary: R-175: Questions re: normalizedString
Product: XML Schema
Version: 1.0
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P2
Component: XSD Part 2: Datatypes
AssignedTo: cmsmcq@w3.org
ReportedBy: sandygao@ca.ibm.com
QAContact: www-xml-schema-comments@w3.org
The value space of normalizedString allows all characters except xD, xA, and
x9. The lexical space allows all characters except xD and x9. What is the
mapping from the lexical space to the value space: what happens to an xA
character in the lexical space (is it removed? replaced by an x20?). The
canonical lexical representation, presumably, is the same as the string in the
value space: I think we should be told.
Presumably the lexical space represents the value after the XML parser has done
its normalization. So in practice, a tab character is allowed in an attribute
of type normalizedString (because the XML parser will turn it to a space), but
a tab character is not allowed in an element of type normalizedString (because
the XML parser will leave it unchanged). Is this interpretation correct?
I find it hard to understand why the lexical space doesn't allow any string,
with a mapping to the value space achieved by normalizing whitespace
characters. Alternatively, the lexical space should be identical to the value
space. The current definition seems nonsensical.
See question 1 from the following:
http://lists.w3.org/Archives/Public/www-xml-schema-comments/2002JulSep/0026.html
Henry's response:
http://lists.w3.org/Archives/Public/www-xml-schema-comments/2002JulSep/0036.html
Received on Wednesday, 14 September 2005 18:11:52 UTC