Re: [css3-gcpm] value of content()

On Fri, Oct 07, 2011 at 03:58:29PM +1100, I wrote:

> one ... issue to consider [is] whitespace handling.

A related issue is the question of what language the resulting
string(...) content should be assumed to have.  This affects things like
glyph choice, the behaviour of text-transform (e.g. what the uppercase of
"i" is), and (of lesser importance, for gcpm purposes) pronunciation if
spoken.  The corresponding string-set text might include multiple
languages.  Thus, it seems clear that the best behaviour (ignoring
implementation cost) would be if language information were considered a
part of the "unstyled text".

If we were to take that approach for language, then an obvious proposal
for the white-space issue would be for whitespace significance
information to be similarly considered part of the "unstyled text" that
gets copied.

[I describe this approach as an "obvious" proposal only by considering
 the XML's native mechanisms for representing language and whitespace
 significance information, which are very similar to each other (xml:lang
 and xml:space attributes).  Whereas when considering HTML, it's not
 quite so natural or appealing a proposal.]

As for how important it is for language-related behaviour to be preserved
("behave correctly"), for purposes of trading off against implementation
costs and likelihood/promptness of this and other features getting
implemented): it's quite natural for headings to include snippets of
foreign-language text, and I imagine (without having had a browse in a
library to check) that it's fairly common to want to apply text-transform
in margin box text.  With the usual disclaimer that I'm not very familiar
with authoring practice & needs, I'd estimate the importance as greater
than the importance of doing the "correct" thing for white-space; though
I'd nevertheless expect that it affects only a very small proportion of
documents, and (as with the white-space issue) there's a workaround in
the form of using running elements instead of named strings.

pjrm.

Received on Wednesday, 12 October 2011 12:52:06 UTC