Re: [charmod-norm] Does ZWJ/ZWNJ affect meaning?

@Ladsgroup Thanks for the comment. I'm not quite clear on the example 
at the end. Does the comment:

> make the work ineligible like "بهرهوری" (nonsense)

... mean that the text literally makes no sense without ZWNJ?

Reading this thread today, I'm thinking: should I be getting rid of 
the whole "affects the semantics" idea here? It seems tricky to define
 the boundary between whether a character's effect is "semantic" or 
not. Instead of saying:

> Unicode provides a number of special purpose control characters and 
other invisible markers that 
> help document authors control the appearance or performance of text.
 In poorly implemented 
> applications, these characters can interfere with string matching 
because, while they are not 
> semantically part of the text, they do form part of the encoded 
character sequence. 

I might instead say:

> Unicode provides a number of special purpose control characters and 
other invisible markers that 
> help document authors control the appearance or performance of text.
 Because these characters
> are invisible, users are not always aware of their presence or 
absence. As a result, these
> characters can interfere with string matching when they are part of 
the encoded character 
> sequence but the expected matching text does not include them.

We can then go on to eliminate the "ZW[N]J special case" discussion 
and just talk about each type of invisible control (what it is used 
for, etc.)

-- 
GitHub Notification of comment by aphillips
Please view or discuss this issue at 
https://github.com/w3c/charmod-norm/issues/44#issuecomment-172030905 
using your GitHub account

Received on Friday, 15 January 2016 17:46:02 UTC