W3C home > Mailing lists > Public > public-i18n-mongolian@w3.org > October to December 2015

Fwd: UAX #29, Unicode Text Segmentation, update to improve Mongolian word segmentation

From: Richard Ishida <ishida@w3.org>
Date: Thu, 1 Oct 2015 11:42:32 +0100
To: "public-i18n-mongolian@w3.org" <public-i18n-mongolian@w3.org>
Message-ID: <560D0E18.1090002@w3.org>
FYI


-------- Forwarded Message --------
Subject: 	UAX #29, Unicode Text Segmentation, update to improve
Mongolian word segmentation
Date: 	Wed, 30 Sep 2015 14:04:45 -0700
From: 	announcements@unicode.org
Reply-To: 	root@unicode.org
To: 	announcements@unicode.org



/Unicode Standard Annex #29, Unicode Text Segmentation/, will be updated 
for Unicode 9.0. A draft of the proposed update 
<http://www.unicode.org/review/pri306/> is available for general public 
review and comment.

The Word_Break classification of U+202F NARROW NO-BREAK SPACE (NNBSP) is 
revised to correct the text segmentation behavior of U+202F for 
Mongolian usage. For further background on this issue and possible ways 
to address it, see PRI #308 <http://www.unicode.org/review/pri308/>, 
/Property Change for U+202F NARROW NO-BREAK SPACE (NNBSP)/.

In this revision, the formerly empty Prepend class of the 
Grapheme_Cluster_Break property is redefined to consist of all prefixed 
format control characters and a few other characters with certain 
Indic_Syllabic_Category property values.

The corresponding property value changes will be incorporated in the UCD 
data files for Unicode 9.0.

http://blog.unicode.org/2015/09/uax-29-unicode-text-segmentation-update.html
Received on Thursday, 1 October 2015 10:42:41 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:07:44 UTC