W3C home > Mailing lists > Public > www-international@w3.org > April to June 2013

Re: Fw: [css3-text] Issues that need i18n help

From: Uma Umamaheswaran <umavs@ca.ibm.com>
Date: Tue, 18 Jun 2013 13:50:41 -0400
To: fantasai.lists@inkedblade.net
Cc: www-international@w3.org
Message-ID: <OF7AC8531C.590DB682-ON85257B8E.006110BA-85257B8E.00620695@ca.ibm.com>
>>(Ref: fantasai <fantasai.lists@inkedblade.net> posting of 2013-06-14 
07:20 ....

Issue 3: Anyone know of further information on line-breaking rules
languages other than Japanese? We have a note collecting references
to further information that could use some help from i18n folks:


Our Thai expert Nattapong Sirilappanich (natta@th.ibm.com) has provided 
the following feedback ... 

The only mandatory rule for line break of Thai language is "Never break 
within Grapheme cluster".
If this rule is broken, it can be said, Thai is not supported. The example 
would be "" can never be the last character in the line and "" can never 
be the first character of the line.

If more advanced linguistic tool were available, the rule will become.
- If line contains more than one word, do line break before a last word, 
except if the last word consume more than 10-15% of line space then:
-- If line contains more than one syllable, do line break before last 
syllable and perform hyphenation.
-- Otherwise, break at grapheme level.
(The problem is rule#2. I don't see any API providing such functionality.)

I hope this helps..  Uma

V.S. UMAmaheswaran, Ph.D.
Globalization Centre of Competency, IBM Toronto Lab
A3/SZ8, 8200 Warden Avenue, Markham, ON, Canada, L6G1C7; +1 905 413 3474; 
Fax: +1 905 413 4751; TieLine 313-3474; email: umavs@ca.ibm.com
Received on Tuesday, 18 June 2013 17:53:14 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:41:02 UTC