RE: NNBSP-MVS Impact

I just ran the two methods Mr. Jirimutu mentioned under Java Standard Edition 8 and find the following ...
            Character.isSpaceChar(c180e) = true
            Character.isWhitespace(c180e) = true
            Character.isSpaceChar(c202f) = true
            Character.isWhitespace(c202f)) = false
So, we probably still need an Oracle/Java contact to inform.
Greg



From: Badral S. [mailto:badral@bolorsoft.com]
Sent: Wednesday, July 8, 2015 12:30 PM
To: Greg Eck; jrmt@almas.co.jp; public-i18n-mongolian@w3.org
Subject: Re: NNBSP-MVS Impact

Hi all,
MVS was classified temporally into space class between Unicode 5.0 and Unicode 6.3. So, it's no problem anymore.
There is nothing against it to check previous character of MVS, if we have no performance problems.

Badral

On 08.07.2015 06:07, Greg Eck wrote:
I would add even U+1822-I to the list { O | U | OE | UE + MVS + A | E } as we have about 10 occurrences { I + MVS + A | E } in our database.

The Unicode data on MVS seems to describe its relationship to spacing correctly ...
http://unicode.org/cldr/utility/character.jsp?a=180E
It does appear that the JAVA methods need to be updated.
Character.isSpaceChar() Yes
Character.isWhitespace() Yes
Does anyone have a reference on who could help update JAVA method information on character U+180E?

Greg

From: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp> [mailto:jrmt@almas.co.jp]
Sent: Wednesday, July 8, 2015 7:27 AM
To: 'Badral S.'; public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Subject: RE: NNBSP-MVS Impact

Hi Badral,

>Actually, the separated A|E is only used under N | G | H | M | L | Y | R | W in modern Mongolian, maybe additionally have { J + MVS A | E } for only one word.
>but someone raises there are additional { S | SH + MVS A|E}  in ancient Mongolian.
In some case the { W + MVS A|E}, have to be handled as { O | U | OE | UE + MVS + A | E } according to some linguistic professionals.

For example , LINGHU-A, CINU-A etc.

Regards,


Jirimutu
==========================================================
Almas Inc.
101-0021 601 Nitto-Bldg, 6-15-11, Soto-Kanda, Chiyoda-ku, Tokyo
E-Mail: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp>   Mobile : 090-6174-6115
Phone : 03-5688-2081,   Fax : 03-5688-2082
http://www.almas.co.jp/   http://www.compiere-japan.com/
==========================================================



From: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp> [mailto:jrmt@almas.co.jp]
Sent: Wednesday, July 8, 2015 7:53 AM
To: 'Badral S.'; public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Subject: RE: NNBSP-MVS Impact

Hi Badral,

>This symbol is used only before final variant of U1820 (A) and U1821 (E) to illustrate ORKHITS. Is it necessary to consider any prefixes? NO.
Actually, the separated A|E is only used under N | G | H | M | L | Y | R | W in modern Mongolian, maybe additionally have { J + MVS A | E } for only one word.
but someone raises there are additional { S | SH + MVS A|E}  in ancient Mongolian.

In my personal opinion, I don't want to see the appearance of other characters followed by MVS + A|E shows unreadable Mongolian.
It will destroy the original Mongolian script's harmony even if it is a wrong word.
It seems an accident like the pen ink breaks suddenly.

In my personal opinion, all of this irregular appearance of MVS should be filtered and , it should be same with  the original forms.
For example  { I + M + T + MVS + A|E  } = { I + M + T + A|E } .

It is just my personal opinion. It seems most of Mongolian can bear it in their mind.

>The only thing, we should consider for MVS is this character never be classified into space or bounder class! It should be in formatter, joiner or other in rendering systems.
I agree this. But actually, the MVS is classified in Space now. I am not sure who is the proper person to raise this issue to correct it.

For example, according to following URL
http://www.fileformat.info/info/unicode/char/180E/index.htm

The Java properties defines.
Character.isSpaceChar() Yes
Character.isWhitespace() Yes


Thanks and Best Regards

Jirimutu
==========================================================
Almas Inc.
101-0021 601 Nitto-Bldg, 6-15-11, Soto-Kanda, Chiyoda-ku, Tokyo
E-Mail: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp>   Mobile : 090-6174-6115
Phone : 03-5688-2081,   Fax : 03-5688-2082
http://www.almas.co.jp/   http://www.compiere-japan.com/
==========================================================



From: Badral S. [mailto:badral@bolorsoft.com]
Sent: Wednesday, July 8, 2015 12:39 AM
To: public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Subject: Re: NNBSP-MVS Impact

Hi all,
I don't understand, why MVS is so complicated to handle?
This symbol is used only before final variant of U1820 (A) and U1821 (E) to illustrate ORKHITS. Is it necessary to consider any prefixes? NO. It makes also no sense FVS1, FVS2, FVS3 after MVS because we don't have any variant of MVS. If FVS1-3 occured before MVS, these are only relevant to the previous letter of that characters. The only thing, we should consider for MVS is this character never be classified into space or bounder class! It should be in formatter, joiner or other in rendering systems.
Please redirect me, if I am wrong.

regards,
Badral

On 07.07.2015 14:24, jrmt@almas.co.jp<mailto:jrmt@almas.co.jp> wrote:
Hi Greg Eck,

Thank you very much for your clarification.
I understand your explanation for NNBSP status.
I think there is three direction to select.

1.       Add some supplementary definition to NNBSP to fit the Mongolian Encoding requirement. (maybe it is impossible because of the other languages)

2.       Replace NNBSP usage in Mongolian with other Mongolian Block character, as I mentioned for example U+180F

3.       Bear the restriction of the NNBSP usage and consider to use NNBSP in Mongolian encoding.

(We need to persuade each application one by one to fit Mongolian requirements just like Dr. Quejingzhabu had done with Microsoft)
Anyone have good idea or solution to resolve the issue ?

Anyway we need to continue to discuss the model to solve the Mongolian Suffix Model either use NNBSP or other character.

Let me send out my NNBSP Model Discussion paper to all members. In the discussion paper, I have included the NNBSP replacement proposals as well.
If you do not agree it, just please ignore it and skip into following discussion contents.
I am also updated the MVS Model discussion paper and included Greg's response as well as my comments in it.

Please let me know if anyone have any question about it.

Thanks and Best Regards,


Jirimutu
==========================================================
Almas Inc.
101-0021 601 Nitto-Bldg, 6-15-11, Soto-Kanda, Chiyoda-ku, Tokyo
E-Mail: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp>   Mobile : 090-6174-6115
Phone : 03-5688-2081,   Fax : 03-5688-2082
http://www.almas.co.jp/   http://www.compiere-japan.com/
==========================================================



From: Greg Eck [mailto:greck@postone.net]
Sent: Tuesday, July 7, 2015 5:57 PM
To: public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Cc: 'Behdad Esfahbod'
Subject: RE: NNBSP-MVS Impact

Mr. Jirimutu,

I am not the one to answer about redefining the NNBSP since it is defined by the Unicode Consortium.  However, in reading the past posts from Badral and others, it appears that redefinition would be problematic at this point as it is tied to other languages such as French and Russian - stabilization would be the issue. Can someone else answer this question as stated below?

I can represent Dr. Quejingzhabu in his request that upper-level layout features in MS Word dealing with the NNBSP have been broken for some time. I think we had a discussion on this at least two years back. His bug report dealt with word-count across NNBSP-bounded suffixes, along with CTL-RIGHT/CTL-LEFT word skipping across the same NNBSP-bounded suffixes.

Greg

Are there other problems that other some of you have found in dealing with the current implementation of the NNBSP? Let's see if we can put together a list of problems found in the current NNBSP implementation ...
Spell-checking - Badral
Xxx - Mr. Jirimutu
Word-count - Prof. Quejingzhabu
Word-skip (CTL-RIGHT/LEFT) - Prof. Quejingzhabu
...



From: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp> [mailto:jrmt@almas.co.jp]
Sent: Monday, July 6, 2015 9:17 AM
To: Greg Eck; public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Cc: 'Behdad Esfahbod'
Subject: RE: NNBSP-MVS Impact

Hi Greg Eck,

Thank you very much. I appreciate your attention.
I will prepare my discussion paper for NNBSP include our AAT implementation logic.
We are discussing in our team how to propose for NNBSP. I will send our discussion paper in one or two days.

As Mr. Badral's mail, the NNBSP is the big problem in the Mongolian Encodings now.
Before explain our solution of NNBSP on AAT,
I would like to ask if is it possible to redefine the Suffix Joiner ?

If it is possible to redefine the Suffix Joiner, I would like to propose U+180F to be the Mongolian Suffix Joiner.
That will be pretty on the code structure and simple on the handle.

If it is impossible to redefine, we have to use NNBSP, I would like to participate into the discussion.

Thanks and Best Regards,

Jirimutu
==========================================================
Almas Inc.
101-0021 601 Nitto-Bldg, 6-15-11, Soto-Kanda, Chiyoda-ku, Tokyo
E-Mail: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp>   Mobile : 090-6174-6115
Phone : 03-5688-2081,   Fax : 03-5688-2082
http://www.almas.co.jp/   http://www.compiere-japan.com/
==========================================================




From: Greg Eck [mailto:greck@postone.net]
Sent: Monday, July 6, 2015 1:17 AM
To: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp>; public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Cc: 'Behdad Esfahbod'
Subject: RE: NNBSP-MVS Impact

Mr. Jirimutu,
Thank you for the explanation on AAT's handling of the character preceding the MVS.
Please do let us know how the character preceding the NNBSP is handled also - that will be very helpful.
Greg


From: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp> [mailto:jrmt@almas.co.jp]
Sent: Monday, July 6, 2015 12:04 AM
To: Greg Eck; public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Cc: 'Behdad Esfahbod'
Subject: RE: NNBSP-MVS Impact

Hi Greg Eck,

I would like to send my some discussion about the MVS today.
I will follow up the NNBSP soon if I have any comment.

>It would be nice to find out whether Apple rendering systems follow suit.
>If anyone knows of an Apple engineer that we could ask,
>I will follow up on the matter. Or if there are other rendering systems >that we should consider, please bring them up.

We created the Apple AAT Font on our site and Apple handle this as an static machine stream.
In the case all of the rules we defined working well on Mac OS X and iOS system.
The MVS model processing on our AAT font is that

the string <MONG_INITIAL><MONG_MEDIAL>< MONG_LETTER ><MVS><U+1820 | U+1821> where MONG is the range U+1820 - U+18AA,

Our font will tag the MONG_LETTER as <fina>  only if the <MONG_LETTER> is one of <U+1828-n>, <U+182C-n>, <U+182D-g>,

<U+182E-m>, <U+182F-l>, <U+1830-s>, <U+1831-sh>, <U+1835-j>, <U+1836-y>, <U+1837-r>, <U+1838-w> as well as

<U+1823-o>, <U+1824-u>, <U+1825-oe>, <U+1826-ue>. Which Is the Mongolian MVS requested characters.

Note: Apple system site had one bug on the Mac OS X 10.8-10.10.2 and iOS 8.0-8.2

rdar://problem/18483089> REGRESSION: iOS8 did not able to correctly rendering Mongolian vowel separator(180E)

this bug had been fixed on Mac OS X 10.10.3 and iOS 8.3 now.


Thanks and Best Regards,


Jirimutu
==========================================================
Almas Inc.
101-0021 601 Nitto-Bldg, 6-15-11, Soto-Kanda, Chiyoda-ku, Tokyo
E-Mail: jrmt@almas.co.jp<mailto:jrmt@almas.co.jp>   Mobile : 090-6174-6115
Phone : 03-5688-2081,   Fax : 03-5688-2082
http://www.almas.co.jp/   http://www.compiere-japan.com/
==========================================================




From: Greg Eck [mailto:greck@postone.net]
Sent: Saturday, July 4, 2015 11:34 PM
To: public-i18n-mongolian@w3.org<mailto:public-i18n-mongolian@w3.org>
Cc: Behdad Esfahbod
Subject: RE: NNBSP-MVS Impact

Behdad Esfahbod, as the Harfbuzz designer,  was kind enough to answer my  questions of how Harfbuzz handles the MVS/NNBSP context. He confirmed my understanding that Harfbuzz follows the Microsoft Universal Shaping Engine in the following ... Thank you Behdad ...

>>>>>

Given the string <MONG_INITIAL><MONG_MEDIAL>< MONG_LETTER ><NNBSP><MONG_SUFFIX> where MONG is the range U+1820 - U+18AA, Harfbuzz applies the <fina> tag applied to MONG_LETTER

The same processing holds for <MONG_INITIAL><MONG_MEDIAL>< MONG_LETTER ><MVS><U+1820 | U+1821>
>>>>>

It would be nice to find out whether Apple rendering systems follow suit. If anyone knows of an Apple engineer that we could ask, I will follow up on the matter. Or if there are other rendering systems that we should consider, please bring them up.

The DS00 charts have been updated. I am also attaching two files that have been helpful to me in considering the range of usage of the MVS / NNBSP. DS04 deals with MVS usage. DS05 deals with NNBSP usage. Comments/corrections/questions are welcome.

Let's go on to consider six cases where the MVS/NNBSP affect the shaping behavior of the character immediately preceeding/following the MVS/NNBSP - U+1820, U+1828, U+182C, U+182D, U+1835, and U+1836.

Greg Eck



--

Badral Sanlig, Software architect

www.bolorsoft.com<http://www.bolorsoft.com> | www.badral.net<http://www.badral.net>

Bolorsoft LLC, Selbe Khotkhon 40/4 D2, District 11, Ulaanbaatar




--

Badral Sanlig, Software architect

www.bolorsoft.com<http://www.bolorsoft.com> | www.badral.net<http://www.badral.net>

Bolorsoft LLC, Selbe Khotkhon 40/4 D2, District 11, Ulaanbaatar

Received on Wednesday, 8 July 2015 06:47:52 UTC