Re: [csswg-drafts] [css-text-3] Allow breaking anywhere when dictionary is missing for SEA scripts (#4284)

I think that in Lao, there should be a way to detect the syllables. ICU supports word boundary analysis/word tokenization, so this is less of an issue now except on the few applications not using ICU (Firefox doesn't yet use ICU's line-breaker, though they use other parts of ICU it seems). 

But, if we want to be safe, and not assume that the browser has ICU support, it would be much more desirable to break at the syllable, rather than cutting one in half. Some vowels in Lao (Thai, Khmer, Burmese) wrap around the nuclear consonant, so if you break it at the wrong place, it cuts your vowel in half between 2 lines (very difficult to read). There is a document I will link to that helps explain this... 

[panl10n.net Syllabification of Lao Script for Line Breaking](http://www.panl10n.net/english/outputs/Working%20Papers/Laos/Microsoft%20Word%20-%206_E_N_296.pdf)

-- 
GitHub Notification of comment by rober42539
Please view or discuss this issue at https://github.com/w3c/csswg-drafts/issues/4284#issuecomment-532042718 using your GitHub account

Received on Tuesday, 17 September 2019 03:26:45 UTC