Fwd: Fwd: Re: CSS Text issues

Forwarding an email from Swaran Lata containing information that is not 
yet in the latest version of the indic layout requirements doc.

RI



-------- Original Message --------
Subject:  Fwd: Re: CSS Text issues
Date:  Fri, 27 Jun 2014 17:05:58 +0530
From:  Swaran Lata <slata@deity.gov.in>
To:  member-i18n-core@w3.org <member-i18n-core@w3.org>
CC:  Koji Ishii <kojiishi@gluesoft.co.jp>, "Phillips, Addison"
<addison@lab126.com>, fantasai <fantasai@inkedblade.net>, Richard Ishida
<ishida@w3.org>



   Dear All ,
Specific requirements for Indic languages requirments on issue 316  and
334 are as below. May pl be considered.
*[css-text] I18N-ISSUE-316: Line breaking defaults*

In Indian languages, it is recommended that line should be break
preferably at word boundary .If unavoidable the guiding principles are
as below:

Rule 1: New line cannot begin with following symbols/Punctuation marks.
Also these should be retain with the associated text

·Closing brackets

·Devanagari Danda /Purnaviram

·Commas

·Visarga

·Decimal symbols

·Semicolon

·Repetition of punctuation marks such as semicolon with closing
brackets, Semicolon with single/ Double quotes , Closing brackets with
commas/Semicolon etc

·Mathematical operators

Rule 2: The definition of Indic syllable may be used to break the line
and a hyphen should be at the breaking point so that word can be read
intuitively

Rule 3: The hyphenated words can be broken at the hyphen e.g.:नर-नारी
should be treated as:नर- on the first line and नारी on the next line

Rule 4: Expression with mathematical symbol should be treated as single
unit so that at the end of the line expression shouldn’t breaks at
operator level

Rule 5: Breaking should not be allowed at numerical values such as
currency values, year etc. e.g.“100.00” or “10,000”, nor in “12:59”

**

*2.**[css-text] I18N-ISSUE-334: 'letter-spacing' and Indic*

**

For letter spacing in Indian languages it is recommended that spacing
should follow Indic syllable definition. The ABNF form of Indic syllable
is given below :

*V[m] |{CH}C[v][m]|CH*

Rule 1 : V[m]

Rule 2 : {CH}C[v][m]

Rule 3 : CH(This rule is applicable only at the end of the word)

V(Upper case) is complete vowel

m is modifier(Anusvara/Visarga/Chandrabindu)

C is Consonant as per Unicode definition which may or may not include nukta

v (lower case) is any dependent vowel or vowel sign (mātrā)

H is halant / virama

| is a rule seperator

[ ] - The enclosed items is optional under this bracket

{} - The enclosed item/items occurs once or repeated multiple times

Here is the some examples of letter spacing that based on abovedefinition :

1.अं त र्रा ष्ट्री य क र ण

2.स्वा ग त म्

3.सु स ज्जि त

4.स म्प्र ति

The above examples corresponds to Unicode tailored grapheme cluster
instead of extended grapheme cluster. So it is recommended that the
letter spacing in Indian languages should based on the Indic syllable
definition.

The issues may be discussed further.
/Swaran Lata.

Received on Monday, 14 July 2014 17:16:33 UTC