Re: [i18n-activity] UTF-16 code points for addressable characters

Whatever it is supposed to mean -- and I think someone is trying to 
get an approximation to a character count, the units should be Unicode
 code points or, if something insists, UTF-32 code units.  I'm not 
quite sure what a UTF-16 code unit is but strongly suspect it invites 
getting tied up with surrogate pairs in a way that would not be very 
helpful.  Of course, it the text is really looking for a count of what
 we used to call "orint positions", the above won't do it.

-- 
GitHub Notification of comment by klensin
Please view or discuss this issue at 
https://github.com/w3c/i18n-activity/issues/216#issuecomment-244523342
 using your GitHub account

Received on Saturday, 3 September 2016 03:14:49 UTC