- From: Liam R. E. Quin <liam@w3.org>
- Date: Wed, 06 May 2015 13:48:03 -0400
- To: Richard Ishida <ishida@w3.org>
- Cc: www International <www-international@w3.org>
On Wed, 2015-05-06 at 14:14 +0100, Richard Ishida wrote: > -------- Forwarded Message -------- > Date: Sat, 02 May 2015 01:30:10 +0000 > From: Alexey Neyman <stilor@att.net> > To: xml-editor@w3.org, www-i18n-comments@w3.org, > www-international@w3.org > [...] > > > The table with XML examples in section 3.3.2, however, assumes CDATA > sections are also considered language "include" mechanism: the 3rd > row from the bottom has the text "suc<![CDATA[,on]]>" (I replaced > cedilla with a regular comma so that it's displayed properly in most > email clients) listed as not include-normalized - which means, > reversing the definition in 3.2.3, that "the text contains character > escapes or includes whose expansion would cause the text to become > no longer Unicode-normalized", thus implying that the CDATA section > is an 'include'. No, it is a character escaping mechanism, the first clause in "the text contains character escapes or includes", so I think it is not a problem. Liam > I think this needs to be remedied in one of the two ways: > - The XML 1.1 specification [1] can be changed to define the term > 'include' to apply to both the entities replaced with their > replacement text and to the CDATA section content. > - The above mentioned example in [2] can be corrected to describe > that > string as "Unicode-normalized, include-normalized, NOT fully > normalized". > > I think the first approach would be more appropriate, given that XML > Information Set specification [4] considers character information > items equally, regardless of whence they came, be it from a CharData > production, character/entity reference, or CDATA section. > > Regards, > Alexey. > > [1] http://www.w3.org/TR/2006/REC-xml11-20060816/ > [2] http://www.w3.org/TR/2005/WD-charmod-norm-20051027/ > [3] http://www.w3.org/TR/2014/WD-charmod-norm-20140715/ > [4] http://www.w3.org/TR/2004/REC-xml-infoset-20040204/#infoitem.character > > > > > >
Received on Wednesday, 6 May 2015 17:48:23 UTC