Re: Discrepancy between XML1.1 and Character Model specifications

On Wed, 2015-05-06 at 14:14 +0100, Richard Ishida wrote:
> -------- Forwarded Message --------
> Date: Sat, 02 May 2015 01:30:10 +0000
> From: Alexey Neyman <stilor@att.net>
> To: xml-editor@w3.org, www-i18n-comments@w3.org, 
> www-international@w3.org
> 
[...]
> 
> 
> The table with XML examples in section 3.3.2, however, assumes CDATA 
> sections are also considered language "include" mechanism: the 3rd 
> row from the bottom has the text "suc<![CDATA[,on]]>" (I replaced 
> cedilla with a regular comma so that it's displayed properly in most 
> email clients) listed as not include-normalized - which means, 
> reversing the definition in 3.2.3, that "the text contains character 
> escapes or includes whose expansion would cause the text to become 
> no longer Unicode-normalized", thus implying that the CDATA section 
> is an 'include'.

No, it is a character escaping mechanism, the first clause in "the 
text contains character escapes or includes", so I think it is not a 
problem.

Liam

> I think this needs  to be remedied in one of the two ways:
> - The XML 1.1 specification [1] can be changed to define the term
> 'include' to apply to both the entities replaced with their 
> replacement text and to the CDATA section content.
> - The above mentioned example in [2] can be corrected to describe 
> that
> string as "Unicode-normalized, include-normalized, NOT fully 
> normalized".
> 
> I think the first approach would be more appropriate, given that XML 
> Information Set specification [4] considers character information 
> items equally, regardless of whence they came, be it from a CharData 
> production, character/entity reference, or CDATA section.
> 
> Regards,
> Alexey.
> 
> [1] http://www.w3.org/TR/2006/REC-xml11-20060816/
> [2] http://www.w3.org/TR/2005/WD-charmod-norm-20051027/
> [3] http://www.w3.org/TR/2014/WD-charmod-norm-20140715/
> [4] http://www.w3.org/TR/2004/REC-xml-infoset-20040204/#infoitem.character
> 
> 
> 
> 
> 
> 

Received on Wednesday, 6 May 2015 17:48:23 UTC