12. Are C1 controls and Unicode non-characters disallowed?

I've added a new issue: 12. Are C1 controls and Unicode non-characters
disallowed?

In XML 1.0 3e, the following text was added to 2.2, Characters:

    The characters defined in the following ranges are discouraged. They
    are either control characters or permanently undefined Unicode
    characters:

    [#x7F-#x84], [#x86-#x9F], [#xFDD0-#xFDDF],
    [#1FFFE-#x1FFFF], [#2FFFE-#x2FFFF], [#3FFFE-#x3FFFF],
    [#4FFFE-#x4FFFF], [#5FFFE-#x5FFFF], [#6FFFE-#x6FFFF],
    [#7FFFE-#x7FFFF], [#8FFFE-#x8FFFF], [#9FFFE-#x9FFFF],
    [#AFFFE-#xAFFFF], [#BFFFE-#xBFFFF], [#CFFFE-#xCFFFF],
    [#DFFFE-#xDFFFF], [#EFFFE-#xEFFFF], [#FFFFE-#xFFFFF],
    [#10FFFE-#x10FFFF].

These codepoints are either not very useful in interchange (the C1
controls [#x7F-#x84] and [#x86-#x9F], because Unicode doesn't say
what they mean) or are non-characters, code points permanently reserved
from being assigned to characters and meant for internal use only (all
the rest).

They couldn't be banned from XML 1.0 because of backward compatibility,
but I'd like to consider banning them from MicroXML.

Comments?

-- 
As you read this, I don't want you to feel      John Cowan
sorry for me, because, I believe everyone       cowan@ccil.org
will die someday.                               http://www.ccil.org/~cowan
        --From a Nigerian-type scam spam

Received on Saturday, 8 September 2012 02:19:35 UTC