- From: Henry Zongaro <zongaro@ca.ibm.com>
- Date: Tue, 31 Aug 2004 16:45:10 -0400
- To: Stephen Buxton <Stephen.Buxton@oracle.com>
- Cc: public-qt-comments@w3.org
Steve,
In [1], you submitted the following comment on the Last Call Working
Draft of XSLT 2.0 and XQuery 1.0 Serialization:
<<
SECTION 3: Serialization parameters
Phase 2, "Character markup", fourth bullet, mentions
escaping of special characters such as <. You could
also mention here the creation of character references
for characters that are not representable in the encoding.
>>
Thank you for this comment. The XSL and XML Query Working Groups
discussed your comment, and decided, because of the interactions between
Unicode normalization and creation of character references, to fold
together character expansion
and Unicode normalization, and at the same time, add creation of character
references to the character expansion phase.
Specifically, the working groups decided to replace the second and
third bullets of Section 4 of Serialization with
the following text:
<<
2. Character expansion is concerned with the representation of
characters appearing in text and attribute nodes in the
instance of the data model. The substitution processes that
may apply are listed below, in priority order: a character
that is handled by one process in this list will be
unaffected by processes appearing later in the list, except
that a character affected by Unicode normalization may be
affected by creation of CDATA sections and by character
escaping
o URI escaping (in the case of URI-valued attributes in the
HTML and XHTML output methods), as determined by the
escape-uri-attributes parameter
o Character mapping, as determined by the use-character-maps
parameter. Text nodes that are children of elements
specified by the cdata-section-elements parameter are not
affected by this step.
o Unicode Normalization, if requested by the
normalization-form parameter. Unicode normalization is
applied to the character stream that results after all
markup generation and character expansion has taken place.
For the definitions of the various normalization forms,
see [Character Model for the World Wide Web 1.0]
The meanings associated with the possible values of the
normalization-form parameter are as follows:
o NFC specifies the serialized result should be in Unicode
Normalization Form C.
o NFD specifies the serialized result should be in Unicode
Normalization Form D.
o NFKC specifies the serialized result should be in Unicode
Normalization Form KC.
o NFKD specifies the serialized result should be in Unicode
Normalization Form KD.
o fully-normalized specifies the serialized result should
be in fully normalized form.
o none specifies that no Unicode normalization should be
applied.
o An implementation-defined value has an implementation-
defined effect.
o Creation of CDATA sections, as determined by the
cdata-section-elements parameter. Note that this is also
affected by the encoding parameter, in that characters not
present in the selected encoding cannot be represented in
a CDATA section.
o Escaping according to XML or HTML rules of special
characters and of characters that cannot be represented in
the selected encoding. For example replacing < by <.
>>
The Unicode Normalization phase becomes the third step of character
expansion. Character mapping becomes the second step, with the
clarification that it does not affect elements to which
cdata-section-elements applies. This was done to make it clear that any
characters affected by character mapping are not affected by Unicode
Normalization. The lead-in to the bulleted list will be modified so that
CDATA section creation and escaping still apply to characters affected by
Unicode Normalization - this is a consequence of trying to fold the two
together. Finally, the last bullet will be modified to make it clear that
not only special characters, but characters that can't be represented in
the selected encoding are affected by that final step.
As a representative of Oracle was present when this decision was
made, I will assume the response is acceptable to you.
Thanks,
Henry [On behalf of the XSL and XML Query Working Groups]
[1]
http://lists.w3.org/Archives/Public/public-qt-comments/2004Feb/1040.html
------------------------------------------------------------------
Henry Zongaro Xalan development
IBM SWS Toronto Lab T/L 969-6044; Phone +1 905 413-6044
mailto:zongaro@ca.ibm.com
Received on Tuesday, 31 August 2004 20:45:40 UTC