W3C home > Mailing lists > Public > www-international@w3.org > October to December 2011

Re: Possible issues with the CLDR quote marks info

From: Martin J. Dürst <duerst@it.aoyama.ac.jp>
Date: Sat, 05 Nov 2011 16:37:05 +0900
Message-ID: <4EB4E7A1.5080003@it.aoyama.ac.jp>
To: Richard Ishida <ishida@w3.org>
CC: www International <www-international@w3.org>
Hello Richard,

I suggest you write to the Unicode CLDR TC as soon as possible, telling 
them about this use of their data. I also suggest you submit your 
observations on individual data items to CLDR with the bug reporting 
system they have.

Regards,   Martin.

On 2011/11/05 0:33, Richard Ishida wrote:
> Before I send a note to the Unicode Consortium, I thought I'd check for
> feedback here. Looking through the list of quotation marks that Ian
> Hickson {1} just added to the HTML5 spec I noticed one or two things
> that look like anomalies (in the Unicode data). (That table is generated
> automatically from the CLDR XML files.)
>
> [1] A couple of locales have non-paired punctuation marks for secondary
> quotes. They are af and tg. tg is not yet confirmed, but af is. Is this
> really correct?
>
> [2] The arabic entry has the following:
>
> '\201c' '\201d' '\2018' '\2019'
>
> ie.
>
> “ ‎U+201C LEFT DOUBLE QUOTATION MARK
> ” ‎U+201D RIGHT DOUBLE QUOTATION MARK
> ‘ ‎U+2018 LEFT SINGLE QUOTATION MARK
> ’ ‎U+2019 RIGHT SINGLE QUOTATION MARK
>
> which corresponds to
>
> quotationStart quotationEnd alternateQuotationStart alternateQuotationEnd
>
> I think this is wrong. Since these are not mirrored characters in
> Unicode, surely the order should be
>
> ” ‎U+201D RIGHT DOUBLE QUOTATION MARK
> “ ‎U+201C LEFT DOUBLE QUOTATION MARK
> ’ ‎U+2019 RIGHT SINGLE QUOTATION MARK
> ‘ ‎U+2018 LEFT SINGLE QUOTATION MARK
>
> Same applies for Hebrew and i assume other languages when they are
> written in rtl scripts.
>
> (Note, btw, that these assignments are only default settings. They can
> be changed using CSS if desired, eg. to substitute angle brackets for
> quotes in Arabic text.)
>
> Any thoughts on this?
>
> RI
>
>
> PS: (I guess I need to say ;-) Please keep replies to the questions
> above, rather than moving the discussion (at least in this thread) to
> whether the q element should or should not automatically apply quotation
> marks and if so all the pitfalls that that may entail.
>
>
> {1} http://dev.w3.org/html5/spec/rendering.html#quotes
>
>
Received on Saturday, 5 November 2011 07:40:26 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Saturday, 5 November 2011 07:40:27 GMT