[charmod-norm] Not all precomposed characters are reachable by NFC (#190)

r12a has just created a new issue for https://github.com/w3c/charmod-norm:

== Not all precomposed characters are reachable by NFC ==
https://w3c.github.io/charmod-norm/#composition_decomposition

> Text in a Unicode character encoding form (such as UTF-8 or UTF-16) is said to be in NFC if it doesn't contain any combining sequence that could be replaced with a precomposed character ...

Not entirely true. Eg. indic characters such as U+09DC BENGALI LETTER RRA decompose into consonant + nukta, but are not recomposed by NFC.

Is it worth tweaking the text to accomodate that?

Please view or discuss this issue at https://github.com/w3c/charmod-norm/issues/190 using your GitHub account

Received on Thursday, 17 January 2019 17:39:32 UTC