- From: r12a via GitHub <sysbot+gh@w3.org>
- Date: Fri, 09 Sep 2016 16:26:24 +0000
- To: www-international@w3.org
r12a has just labeled an issue for https://github.com/w3c/svgwg as "i18n": == UTF-16 code points for addressable characters == 11.1.1. Definitions https://svgwg.org/svg2-draft/text.html#Definitions > addressable character > A character that is addressable by text positioning attributes and SVG DOM text methods. Characters discarded during layout such as collapsed white space characters are not addressable, neither are characters within an element with a value of none for the display property. Characters are **addressed by their index measured in UTF-16 code units**, prior to applying any text-transform conversions, as described for the methods in the SVGTextContentElement interface; **as a result, a single Unicode character may map to multiple addressable characters**. We hope that it will not be possible to address a single surrogate character, and think that this should be stated in the definition. We also suggest "a single Unicode code point may map to" to clarify the meaning better. We would actually prefer that accessible characters be defined in terms of Unicode code points, rather than UTF-16 code units. Addison proposes the following alternative text for "Characters are addressed..." > The address of a given Unicode character (codepoint) is measured in UTF-16 code units, prior to applying any text-transform conversions, as described for the methods in the SVGTextContentElement interface; as a result, a single Unicode code point may be represented by multiple UTF-16 code units. Finally, a quick example of what it means to address a character means would probably help readers understand the purpose of an addressable character. We struggled with that. See https://github.com/w3c/svgwg/issues/259
Received on Friday, 9 September 2016 16:26:35 UTC