Re: 12. Are C1 controls and Unicode non-characters disallowed? from Liam R E Quin on 2012-09-09 (public-microxml@w3.org from September 2012)

From: Liam R E Quin <liam@w3.org>
Date: Sun, 09 Sep 2012 01:43:35 -0400
To: James Clark <jjc@jclark.com>
Cc: John Cowan <cowan@mercury.ccil.org>, public-microxml@w3.org
Message-ID: <1347169415.14263.42.camel@localhost.localdomain>

On Sun, 2012-09-09 at 10:50 +0700, James Clark wrote:
> Writing the production for char like this would, I think, make the logic
> behind the definition clearer:

[..]


> nonCharacterCodePoint ::= [#xFDD0-#xFDEF] | [#xFFFE-#xFFFF] |
> [#x1FFFE-#x1FFFF]
>                      | [#x2FFFE-#x2FFFF] | [#x3FFFE-#x3FFFF] |
> [#x4FFFE-#x4FFFF]

etc

It might be even clearer to ignore history :) and list the codepoints
individually where they are two-character ranges:

nonCharacterCodePoint ::= [#xFDD0-#xFDEF] | #xFFFE | #xFFFF |
#x1FFFE | #x1FFFF | ....

Liam

-- 
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org freenode/#xml
Co-author, "Frightening children with XML"

Received on Sunday, 9 September 2012 05:44:00 UTC