Feedback on EXI: Appendix E deriving character sets from XML schema regular expressions

I think the terminology used In Appendix E can be improved.

 From the title, you would expect this appendix to give a function for 
guessing the character set: e.g. ISO 9959-1, or Latin1 or whatever.

It is much better to use the industry jargon from the Unicode Encoding 
Character Model: http://unicode.org/reports/tr17/
 
Using that terminology, the appendix should substitute "character 
repertoire" where is currently uses "character set", and "repertoire" 
instead of "charset". 

Flowing through,  7.1.10.1 Restricted Character Sets should have "coded 
character set (CCS) for restricted character repertoire" rather than 
"restricted character set". 

Cheers
Rick Jelliffe

Received on Saturday, 20 September 2008 13:48:02 UTC