- From: Thompson, Bryan B. <BRYAN.B.THOMPSON@saic.com>
- Date: Fri, 20 Aug 2004 08:12:11 -0400
- To: "'www-xml-linking-comments@w3.org'" <www-xml-linking-comments@w3.org>
- Cc: "'bebee, thompsonbry, duerst@w3.org, michelsu@microsoft.com'" <public-xml-core-wg@w3.org,>
The legal character range specified in the XPointer Framework [1] for the Unicode production is given as: [9] UnicodeChar ::= [#x0-#x10FFFF] This production includes Unicode code points which have been explicitly identified as "noncharacters" and which are forbidden for interchange [2], [3]. The Unicode Standard sets aside 66 noncharacter code points. The last two code points of each plane are noncharacters: U+FFFE and U+FFFF on the BMP, U+1FFFE and U+1FFFF on Plane 1, and so on, up to U+10FFFE and U+10FFFF on Plane 16, for a total of 34 code points. In addition, there is a contiguous range of another 32 noncharacter code points in the BMP: U+FDD0..U+FDEF. [3] [1] http://www.w3.org/TR/xptr-framework/ [2] http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf [3] http://www.unicode.org/versions/Unicode4.0.0/ch15.pdf [4] http://www.ietf.org/internet-drafts/draft-duerst-iri-09.txt
Received on Friday, 20 August 2004 12:12:33 UTC