- From: <ishida@w3.org>
- Date: Fri, 15 Apr 2016 14:31:31 +0100
- To: W3C Digital Publishing IG <public-digipub-ig@w3.org>, www International <www-international@w3.org>
On 06/04/2016 19:48, ishida@w3.org wrote: > This is a question about how quotation marks should behave in > multilingual text. I created some tests for the q element in HTML, with results for major browsers. See https://www.w3.org/International/tests/repo/results/the-q-element SUMMARY [1] all major desktop browsers put quote marks around content using the q element [2] they all use two different quotation marks for nested quotations, but they don't rotate the usage - they just use the secondary quote marks for third and more nested qs [3] Chrome, Opera, Safari and Edge all change quote marks as the language changes. Firefox doesn't. [4] The browsers that are language-sensitive follow the recommendations in the HTML5 spec (taken from CLDR) wrt which quotes are used by which language, except that Safari doesn't do the whole list, and occasionally inverts the primary and secondary quote marks. [5] Only in Firefox however do the quote marks match the HTML5 recommendation when no language is specified. [6] Browsers that are language-sensitive use the CSS rules defined in the HTML5 spec for the default presentation. This means that the quote marks used around a quotation are based on the language of the quotation itself - even at the top level. This means that in an English paragraph our example <p>But Lucy replies: <q lang="fr-CA">Embrassez George de ma part - seulement une fois. Dites-lui, <q>Embrouille.</q></q>.</p> will produce: But Lucy replies: «Embrassez George de ma part - seulement une fois. Dites-lui, ‹Embrouille.› ». instead of what people on this thread expect, which is But Lucy replies: “Embrassez George de ma part - seulement une fois. Dites-lui, ‘Embrouille.’”. A FLY IN THE OINTMENT Florian's CSS :lang(fr-CA) > *:not(q *) { quotes: "« " " »" "‹ " " ›" } :lang(en) > *:not(q *) { quotes: "“" "”" "‘" "’" } produces what's needed for the example as written above. However, there is one situation where it fails, and where afaict, there's not actually a way to produce the desired result. If we change our markup slightly, changing the first q into a span, we introduce the change in surrounding language before the q element is encountered. <p>But Lucy replies: <span lang="fr-CA">Embrassez George de ma part - seulement une fois. Dites-lui, <q>Embrouille.</q></span>.</p> In this case, the quotation is displayed with the quote marks appropriate for the surrounding language, but the surrounding language is not the language of the reader (ie. that of the paragraph, ie. en). The result is: But Lucy replies: Embrassez George de ma part - seulement une fois. Dites-lui, « Embrouille. ». ie. we now have French quotes around the word Embrouille. As failures go, i could imagine far worse. It's still possible to argue that there's some logic in having the French quotes here, even if it's not really what we want, so it's not a complete disaster. I can't really see a way around it, other than to *only* set the default quote marks on the basis of the language declared on the html tag. (The user would have to replicate that using CSS for elements lower down the hierarchy if they have a multilingual document.) Thoughts? ri
Received on Friday, 15 April 2016 13:31:40 UTC