Re: [charmod-norm] Canonically equivalent but may have different appearance.

If it is a broken font, it is compliant with the Unicode standard.  The renderer would only be non-compliant if it deliberately rendered them differently.  However, I would say that sensitivity to the codepoint sequence means it is not correctly displayed for all canonically equivalent forms.

However, there is one major problem with this approach.  Following the example of the Unicode standards, compatibility ideographs are often rendered differently to their canonical equivalents, e.g. 金 U+F90A CJK COMPATIBILITY IDEOGRAPH-F90A v. 金 U+91D1 CJK UNIFIED IDEOGRAPH-91D1.  Rendering them the same would be inferior, as many pairs are distinguished in a national standard.  This is a problem with the Unicode model.  The modern resolution to this mess is to use variation sequences. 

-- 
GitHub Notification of comment by Richard57
Please view or discuss this issue at https://github.com/w3c/charmod-norm/issues/163#issuecomment-388163247 using your GitHub account

Received on Thursday, 10 May 2018 19:42:34 UTC