W3C home > Mailing lists > Public > www-xml-linking-comments@w3.org > July to September 2004

Issue against the XPointer Framework: Definition of UnicodeChar.

From: Thompson, Bryan B. <BRYAN.B.THOMPSON@saic.com>
Date: Fri, 20 Aug 2004 08:12:11 -0400
Message-Id: <D24D16A6707B0A4B9EF084299CE99B3912CB4317@mcl-its-exs02.mail.saic.com>
To: "'www-xml-linking-comments@w3.org'" <www-xml-linking-comments@w3.org>
Cc: "'bebee, thompsonbry, duerst@w3.org, michelsu@microsoft.com'" <public-xml-core-wg@w3.org,>

The legal character range specified in the XPointer Framework [1] for
the Unicode production is given as:

[9] UnicodeChar ::= [#x0-#x10FFFF]

This production includes Unicode code points which have been
explicitly identified as "noncharacters" and which are forbidden for
interchange [2], [3].

   The Unicode Standard sets aside 66 noncharacter code points. The
   last two code points of each plane are noncharacters: U+FFFE and
   U+FFFF on the BMP, U+1FFFE and U+1FFFF on Plane 1, and so on, up to
   U+10FFFE and U+10FFFF on Plane 16, for a total of 34 code points. In
   addition, there is a contiguous range of another 32 noncharacter
   code points in the BMP: U+FDD0..U+FDEF. [3]

[1] http://www.w3.org/TR/xptr-framework/
[2] http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf
[3] http://www.unicode.org/versions/Unicode4.0.0/ch15.pdf
[4] http://www.ietf.org/internet-drafts/draft-duerst-iri-09.txt
Received on Friday, 20 August 2004 12:12:33 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 October 2009 08:39:45 GMT