RE: UAX #29, Unicode Text Segmentation, update to improve Mongolian word segmentation from Greg Eck on 2015-10-03 (public-i18n-mongolian@w3.org from October to December 2015)

From: Greg Eck <greck@postone.net>
Date: Sat, 3 Oct 2015 14:23:46 +0000
To: Richard Ishida <ishida@w3.org>, "public-i18n-mongolian@w3.org" <public-i18n-mongolian@w3.org>
Message-ID: <SN1PR10MB09432034F82E13D65DDF3D14AF4A0@SN1PR10MB0943.namprd10.prod.outlook.com>

Hi Richard (Ishida),

Could you convey my thanks to the UTC, as a representative of the discussion group, for their pushing through on the NNBSP changes. This will go a long way. We look forward to the changes to be seen in Mongolian applications' handling of NNBSP-words in the areas of words not breaking as these changes are implemented around the world.

Thanks,
Greg

-----Original Message-----
From: Richard Ishida [mailto:ishida@w3.org] 
Sent: Thursday, October 1, 2015 6:43 PM
To: public-i18n-mongolian@w3.org
Subject: Fwd: UAX #29, Unicode Text Segmentation, update to improve Mongolian word segmentation

FYI

>>>>
-------- Forwarded Message --------
Subject:  UAX #29, Unicode Text Segmentation, update to improve
Mongolian word segmentation
Date:  Wed, 30 Sep 2015 14:04:45 -0700
From:  announcements@unicode.org
Reply-To:  root@unicode.org
To:  announcements@unicode.org

/Unicode Standard Annex #29, Unicode Text Segmentation/, will be updated for Unicode 9.0. A draft of the proposed update <http://www.unicode.org/review/pri306/> is available for general public review and comment.

The Word_Break classification of U+202F NARROW NO-BREAK SPACE (NNBSP) is revised to correct the text segmentation behavior of U+202F for Mongolian usage. For further background on this issue and possible ways to address it, see PRI #308 <http://www.unicode.org/review/pri308/>,
/Property Change for U+202F NARROW NO-BREAK SPACE (NNBSP)/.

In this revision, the formerly empty Prepend class of the Grapheme_Cluster_Break property is redefined to consist of all prefixed format control characters and a few other characters with certain Indic_Syllabic_Category property values.

The corresponding property value changes will be incorporated in the UCD data files for Unicode 9.0.

http://blog.unicode.org/2015/09/uax-29-unicode-text-segmentation-update.html

>>>>>

Received on Saturday, 3 October 2015 14:24:39 UTC