I18n comment: defining letter

Comment from the i18n review of:
http://www.w3.org/TR/2005/WD-css3-selectors-20051215/

Comment 20
At http://www.w3.org/International/reviews/0601-css3-selectors/
Editorial/substantive: S
Location in reviewed document:
Sec. 7.2 [http://www.w3.org/TR/2005/WD-css3-selectors-20051215/#pseudo-elements]

Comment:
There should be a definition of what constitutes a 'letter'. We propose that this should be the first *grapheme cluster* (after any punctuation such as you list).

We believe that the specification must, as a minimum, require a letter to be defined as a 'default grapheme cluster', as defined by Unicode 
http://www.unicode.org/reports/tr29/#Grapheme_Cluster_Boundaries [http://www.unicode.org/reports/tr29/#Grapheme_Cluster_Boundaries]
.

This would mean, for example, that 0065: e LATIN SMALL LETTER E + 0301: ́ COMBINING ACUTE ACCENT would be handled appropriately.


The specification should then also state that implementors should implement additional rules based on the language of the text where necesssary.

This may be particularly important for scripts such as those of south asia, where an initial character appears as part of a syllabic group and may be preceded on the display by a later character, or where the initial character is a ligature.


The i18n WG will make some attempts toinvestigate further the applicability of ::first-letter styling to other scripts and report back.



> From: Daniel Glazman 
> [mailto:daniel.glazman@disruptive-innovations.com] 
> Sent: 20 January 2006 16:00

> #20 oh, no please... That's certainly something I don't want 
> us to dive into.
>      We could link to a definition, but adding one ourselves 
> is too dangerous.

Received on Saturday, 21 January 2006 10:52:35 UTC