Re: 12. Are C1 controls and Unicode non-characters disallowed?

On Sun, 2012-09-09 at 10:50 +0700, James Clark wrote:
> Writing the production for char like this would, I think, make the logic
> behind the definition clearer:

[..]


> nonCharacterCodePoint ::= [#xFDD0-#xFDEF] | [#xFFFE-#xFFFF] |
> [#x1FFFE-#x1FFFF]
>                      | [#x2FFFE-#x2FFFF] | [#x3FFFE-#x3FFFF] |
> [#x4FFFE-#x4FFFF]

etc

It might be even clearer to ignore history :) and list the codepoints
individually where they are two-character ranges:

nonCharacterCodePoint ::= [#xFDD0-#xFDEF] | #xFFFE | #xFFFF |
#x1FFFE | #x1FFFF | ....

Liam

-- 
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org freenode/#xml
Co-author, "Frightening children with XML"

Received on Sunday, 9 September 2012 05:44:00 UTC