W3C home > Mailing lists > Public > public-i18n-mongolian@w3.org > October to December 2015

RE: U+1885 / U+1886 changed from Letter to Mark

From: Greg Eck <greck@postone.net>
Date: Sat, 14 Nov 2015 02:41:30 +0000
To: "public-i18n-mongolian@w3.org" <public-i18n-mongolian@w3.org>
Message-ID: <BN4PR10MB093254923C8A61BD39B9FFB3AF100@BN4PR10MB0932.namprd10.prod.outlook.com>
Andrew, Richard W,
Do you have time to comment on the situation with regard to the U+1885/U+1886?
Problem is that I don't know how we can implement these two further as a diacritic unless we modify their feature set.
In the BAITI implementation, the U+18A9 Dagalga shapes correctly (needs a small bit of refinement - but overall is correctly spaced on the left side of the preceeding character).
I think we are in agreement from the earlier posted images of the Baluda/s that the Baluda should be placed to the right side of the preceeding character.
It would seem that the category change to match the U+18A9 of "Mark, Non-spacing" would be appropriate.
I am not sure what the COMBINE does.
Does the BIDI parameter only affect sort/search?
Thanks,
Greg

>>>>>
Sent: Tuesday, November 10, 2015 1:10 AM
Subject: U+1885 / U+1886 changed from Letter to Mark

I had said earlier that the two Baludas (U+1885/1886) would probably be better processed as marks rather than letters.
I find the following differences between the two Baludas and the one unquestionable mark in the Mongolian block - U+18A9 Dagalga ...



U+18A9

U+1885/1886

CATEGORY

Mark, Nonspacing (MN)

Letter, Other

COMBINE

228

0 what does this do?

BIDI

Non-Spacing Mark

Left-to-Right

Character.getDirectionality()

Directionality_Nonspacing_Mark[8]

Left_to_Right 0

Character.getType()

6

5

Character.isJavaIdentifierStart()

No

Yes

Character.isLetter()

No

Yes

Character.isLetterOrDigit()

No

Yes

Character.isUnicodeIdentifierStart()

No

Yes


Given that the Baluda stations itself to the right of an existent vertical letter in similar fashion to the Dagalga stationing itself on the left side of the given vertical letter, I would say that we recommend redefining the features associated with the two Baludas to match the Dagalga. Then test it to verify that shaping behavior is as expected.

If we made the above changes to the feature set of the U+1885/1886 would this allow us to shape the Baludas like we do the Dagalga?

ArabicShaping.txt does not seem to make any distinction between the mark U+18A9 and the two Baludas.

Greg
>>>>>
Received on Saturday, 14 November 2015 02:42:52 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:07:45 UTC