Re: [charmod-norm] which characters, exactly, should be removed in the matching algorithm?

The text in that section states about ZWJ/ZWNJ " Their original use 
was to control ligature formation—either preventing the formation of 
undesirable ligatures or encouraging the formation of desirable ones."
This is not in accordance with the history. They got **_started_** as 
"join controls" for Arabic, where, for example in Persian, they are 
needed to control joining that affects the way a word is read 
(meaning).
They were later generalized to allow breaking (ZWNJ) or requesting 
(ZWJ) ligatures and also to affect conjunct formation in Indic 
scripts.

-- 
GitHub Notification of comment by asmusf
Please view or discuss this issue at 
https://github.com/w3c/charmod-norm/issues/117#issuecomment-275885015 
using your GitHub account

Received on Sunday, 29 January 2017 00:29:42 UTC