- From: Richard Ishida <ishida@w3.org>
- Date: Thu, 13 Mar 2014 06:46:44 +0000
- To: public-i18n-indic@w3.org
In 13/03/2014 06:14, Cibu Johny (സിബു) wrote: > One quick feedback on the regular expression approach to decide a > grapheme cluster. In many Indic scripts, whether to display a sequence > contianing <..consonant, virama, consonant..> as a single cluster or to > split them with explicit visible virama is font dependent. > > For example, in Malayalam, sequence S-KHA (സ്ഖ) would be displayed with > with explicit virama in a reformed script font and as a single unit in > traditional script font. I think the important question is whether the whole conjunct should continue to be treated as a unit for first-letter styling, line breaking, vertical arrangements, etc, whether or not the conjunct is expressed using a visible virama (actually, in fact, whether the orthographic syllable continues to be the unit, since it may also include vowel signs and such). Are there any cases where it would not? RI
Received on Thursday, 13 March 2014 06:47:12 UTC