Full Range of Unicode Code Points Not Allowed in XML

This is a last call comment from Cliff Schmidt (cschmidt@microsoft.com) on
the Character Model for the World Wide Web 1.0
(http://www.w3.org/TR/2002/WD-charmod-20020430/).

Semi-structured version of the comment:

Submitted by: Cliff Schmidt (cschmidt@microsoft.com)
Submitted on behalf of (maybe empty): Microsoft
Comment type: editorial
Chapter/section the comment applies to: 3.5 Reference Processing Model
The comment will be visible to: public
Comment title: Full Range of Unicode Code Points Not Allowed in XML
Comment:
--------------------------------------------------------------------------
"[S] Specifications SHOULD allow the use of the full range of Unicode code points from U+0000 to U+10FFFF inclusive; code points above U+10FFFF MUST NOT be used."
--------------------------------------------------------------------------
CONCERN:
If this is truly a goal for text on the Web, users should understand why XML is unable to achieve this.  As a high profile W3C spec, readers are likely to notice the inconsistent message.  Does this mean that I18N believes that XML (1.1 or some later version) should support the characters 0x0-0x1F?

RECOMMENDATION:
If XML 1.1 is unable to achieve this goal, the Character Model spec should either remove this requirement or explain the discrepancy.



Structured version of  the comment:

<lc-comment
  visibility="public" status="pending"
  decision="pending" impact="editorial">
  <originator email="cschmidt@microsoft.com" represents="Microsoft"
      >Cliff Schmidt</originator>
  <charmod-section href='http://www.w3.org/TR/2002/WD-charmod-20020430/#sec-RefProcModel'
    >3.5</charmod-section>
  <title>Full Range of Unicode Code Points Not Allowed in XML</title>
  <description>
    <comment>
      <dated-link date="2002-06-06"
        >Full Range of Unicode Code Points Not Allowed in XML</dated-link>
      <para>--------------------------------------------------------------------------
"[S] Specifications SHOULD allow the use of the full range of Unicode code points from U+0000 to U+10FFFF inclusive; code points above U+10FFFF MUST NOT be used."
--------------------------------------------------------------------------
CONCERN:
If this is truly a goal for text on the Web, users should understand why XML is unable to achieve this.  As a high profile W3C spec, readers are likely to notice the inconsistent message.  Does this mean that I18N believes that XML (1.1 or some later version) should support the characters 0x0-0x1F?

RECOMMENDATION:
If XML 1.1 is unable to achieve this goal, the Character Model spec should either remove this requirement or explain the discrepancy.
</para>
    </comment>
  </description>
</lc-comment>

Received on Thursday, 6 June 2002 12:27:26 UTC