W3C home > Mailing lists > Public > www-international@w3.org > January to March 2016

[charmod-norm] 2.2.1 Canonical vs. Compatibility Equivalence vs Canonical non-equivalence

From: klensin via GitHub <sysbot+gh@w3.org>
Date: Thu, 04 Feb 2016 04:15:31 +0000
To: www-international@w3.org
Message-ID: <issues.opened-131235953-1454559330-sysbot+gh@w3.org>
klensin has just created a new issue for 
https://github.com/w3c/charmod-norm:

== 2.2.1 Canonical vs. Compatibility Equivalence vs Canonical 
non-equivalence ==
Given your Latin-based examples, should the text comment on the 
relationship between U+00F8 and the combination U+006F U+0337.  When 
correctly displayed, they have the same visual appearance.  However, 
normalization is of no help at all.

This is a distant relative of the relationship between U+08A1 and the 
sequence U+0628 U+0654 and between U+0681 and U+076C and the sequences
 that can be used to form the same graphemes.  In none of these cases 
(and many others, some subtle) is normalization helpful.  In many of 
them, there are distinctions between the precomposed and combining 
sequence forms that are a function of language or locale within the 
same script.


See https://github.com/w3c/charmod-norm/issues/69
Further comments on this issue will NOT be notified to this list. If 
you'd like to follow the discussion, please do so by subscribing to 
the issue via the above link. Do not reply to this email.
Received on Thursday, 4 February 2016 04:15:34 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:40 UTC