- From: John Cowan <cowan@mercury.ccil.org>
- Date: Fri, 7 Sep 2012 22:19:13 -0400
- To: public-microxml@w3.org
I've added a new issue: 12. Are C1 controls and Unicode non-characters
disallowed?
In XML 1.0 3e, the following text was added to 2.2, Characters:
The characters defined in the following ranges are discouraged. They
are either control characters or permanently undefined Unicode
characters:
[#x7F-#x84], [#x86-#x9F], [#xFDD0-#xFDDF],
[#1FFFE-#x1FFFF], [#2FFFE-#x2FFFF], [#3FFFE-#x3FFFF],
[#4FFFE-#x4FFFF], [#5FFFE-#x5FFFF], [#6FFFE-#x6FFFF],
[#7FFFE-#x7FFFF], [#8FFFE-#x8FFFF], [#9FFFE-#x9FFFF],
[#AFFFE-#xAFFFF], [#BFFFE-#xBFFFF], [#CFFFE-#xCFFFF],
[#DFFFE-#xDFFFF], [#EFFFE-#xEFFFF], [#FFFFE-#xFFFFF],
[#10FFFE-#x10FFFF].
These codepoints are either not very useful in interchange (the C1
controls [#x7F-#x84] and [#x86-#x9F], because Unicode doesn't say
what they mean) or are non-characters, code points permanently reserved
from being assigned to characters and meant for internal use only (all
the rest).
They couldn't be banned from XML 1.0 because of backward compatibility,
but I'd like to consider banning them from MicroXML.
Comments?
--
As you read this, I don't want you to feel John Cowan
sorry for me, because, I believe everyone cowan@ccil.org
will die someday. http://www.ccil.org/~cowan
--From a Nigerian-type scam spam
Received on Saturday, 8 September 2012 02:19:35 UTC