- From: klensin via GitHub <sysbot+gh@w3.org>
- Date: Sat, 03 Sep 2016 03:14:43 +0000
- To: public-i18n-core@w3.org
Whatever it is supposed to mean -- and I think someone is trying to get an approximation to a character count, the units should be Unicode code points or, if something insists, UTF-32 code units. I'm not quite sure what a UTF-16 code unit is but strongly suspect it invites getting tied up with surrogate pairs in a way that would not be very helpful. Of course, it the text is really looking for a count of what we used to call "orint positions", the above won't do it. -- GitHub Notification of comment by klensin Please view or discuss this issue at https://github.com/w3c/i18n-activity/issues/216#issuecomment-244523342 using your GitHub account
Received on Saturday, 3 September 2016 03:14:49 UTC