Re: UTF-8 and BOM

At 00/08/22 17:41 -0400, wrote:
>      Why do we warn people about BOM but not about surrogates, anyway?  One
>is no more appropriate than the other in canonicalized UTF-8.

The difference is that surrogate pairs are explicitly disallowed
by the relevant specs (ISO 10646, Unicode, RFC 2379), but the BOM
issue is not mentioned in RFC 2379 and is as far as I remember
explicitly allowed in ISO 10646 and Unicode.

Regards,  Martin.

