Re: 12. Are C1 controls and Unicode non-characters disallowed?

On 09/12/2012 09:16 AM, Tony Graham wrote:
>
> I don't know whether this has been discussed, but while the current draft
> specifies UTF-8 only, but another way to simplify the character processing
> (post-parser) would be to also specify Normalization Form C [2][3], which
> would mean there would be only one way in MicroXML documents to represent
> particular characters.
>    
NFC is called out in the Editor's Draft; I think the idea is you can use 
what you want, but parsers are free to normalize, caveat emptor, you 
might not get what you expect unless you use NFC.  At least that was my 
breezy interpretation :) Read the spec if you want precision...

-Mike S

Received on Wednesday, 12 September 2012 13:26:01 UTC