- From: Karlsson Kent - keka <keka@im.se>
- Date: Mon, 27 Jul 1998 16:54:46 +0200
- To: "'xml-editor@w3.org'" <xml-editor@w3.org>
XML grammar rule: [2] Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] /* any Unicode character, excluding the surrogate blocks, FFFE, and FFFF. */ Comments: a. The code for "DEL" (7F) should be excluded, since it is a control "character". b. 80-9F are unallocated, but are *permanently* unallocated in Unicode/10646, since these codes are reserved for those who wish to continue to support some additional control codes used in some older character encodings (EBCDIC...). (My understanding is that it is some kind of private use area for old control "characters". Nobody is not supposed to define new ones.) c. ISO/IEC 10646 excludes all codes that in hexadecimal end with FFFE or FFFF. I.e., also 1FFFE, 1FFFF, 2FFFE, 2FFFF, etc. ================ New [2] Char ::= #x9 | #xA | #xD | [#x20-#7E] | [#xA0-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x1FFFD] | [#x20000-#x2FFFD] | [#x30000-#x3FFFD] | [#x40000-#x4FFFD] | [#x50000-#x5FFFD] | [#x60000-#x6FFFD] | [#x70000-#x7FFFD] | [#x80000-#x8FFFD] | [#x90000-#x9FFFD] | [#xA0000-#xAFFFD] | [#xB0000-#xBFFFD] | [#xC0000-#xCFFFD] | [#xD0000-#xDFFFD] | [#xE0000-#xEFFFD] | [#xF0000-#xFFFFD] | [#x100000-#x10FFFD] /* Any Unicode character, excluding most control codes, the surrogate blocks, and codes that end with FFFE or FFFF in hexadecimal (up to plane 16). */ ================ Kind regards /kent k
Received on Monday, 27 July 1998 10:54:52 UTC