W3C home > Mailing lists > Public > xml-editor@w3.org > July to September 1998

XML spec. grammatical production 2

From: Karlsson Kent - keka <keka@im.se>
Date: Mon, 27 Jul 1998 16:54:46 +0200
Message-Id: <C110A2268F8DD111AA1A00805F85E58D024A21@ntgbg1>
To: "'xml-editor@w3.org'" <xml-editor@w3.org>

XML grammar rule:
[2]  Char ::=  #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] |
	[#x10000-#x10FFFF]
	/*  any Unicode character, excluding the surrogate blocks,
	FFFE, and FFFF. */ 

Comments:

a. The code for "DEL" (7F) should be excluded, since it is a control
"character".

b. 80-9F are unallocated, but are *permanently* unallocated in
Unicode/10646, since these codes are reserved for those who wish to
continue to support some additional control codes used in some older
character encodings (EBCDIC...).  (My understanding is that it is some
kind of private use area for old control "characters".  Nobody is not
supposed to define new ones.)

c. ISO/IEC 10646 excludes all codes that in hexadecimal end with FFFE or
FFFF.  I.e., also 1FFFE, 1FFFF, 2FFFE, 2FFFF, etc.

================
New
[2]  Char ::=  #x9 | #xA | #xD | [#x20-#7E] |
	[#xA0-#xD7FF] | [#xE000-#xFFFD] |
	[#x10000-#x1FFFD] | [#x20000-#x2FFFD] |
 	[#x30000-#x3FFFD] | [#x40000-#x4FFFD] |
 	[#x50000-#x5FFFD] | [#x60000-#x6FFFD] |
 	[#x70000-#x7FFFD] | [#x80000-#x8FFFD] | 
	[#x90000-#x9FFFD] | [#xA0000-#xAFFFD] | 
	[#xB0000-#xBFFFD] | [#xC0000-#xCFFFD] | 
	[#xD0000-#xDFFFD] | [#xE0000-#xEFFFD] | 
	[#xF0000-#xFFFFD] | [#x100000-#x10FFFD] 
	/*  Any Unicode character, excluding most control codes,
	   the surrogate blocks, and codes that end with FFFE or
	   FFFF in hexadecimal (up to plane 16). */
================

			Kind regards
			/kent k
Received on Monday, 27 July 1998 10:54:52 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:59:29 GMT