W3C home > Mailing lists > Public > www-international@w3.org > October to December 2013

I18N-ISSUE-309: Haphazard use of term 'character' [.prep-CSS3-text]

From: Internationalization Working Group Issue Tracker <sysbot+tracker@w3.org>
Date: Fri, 06 Dec 2013 17:40:38 +0000
Message-Id: <E1VozOI-0005LU-VT@stuart.w3.org>
To: www-international@w3.org
I18N-ISSUE-309: Haphazard use of term 'character' [.prep-CSS3-text]


Raised by: Addison Phillips
On product: .prep-CSS3-text

Several members of the I18N WG are concerned about the following paragraph and subsequent use of the term "character" in the current draft:

Within this specification, the ambiguous term character is used as a friendlier synonym for grapheme cluster. See Characters and Properties for how to determine the Unicode properties of a character.

While "character" is a friendlier synonym and other CSS documents do the same "redefinition" for convenience of Spec users, this particular spec really does need to distinguish more clearly between character-as-a-term-or-art and character-as-a-grapheme-synonym.

For example, slightly after the definition above we see the first "ambiguous" use of "character":

The rendering characteristics of a character divided by an element boundary is undefined: it may be rendered as belonging to either side of the boundary, or as some approximation of belonging to both. Authors are forewarned that dividing grapheme clusters by element boundaries may give inconsistent or undesired results.

We would be happier if the words "character" and "grapheme cluster" (particularly in this instance) were used more strictly. As written, this paragraph is not completely clear. An illustration of "a character" being so divided might help. Other cases exist where it isn't clear whether grapheme clusters or code points are intended.
Received on Friday, 6 December 2013 17:40:40 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:41:03 UTC