W3C home > Mailing lists > Public > www-international@w3.org > October to December 2011

Re: Possible issues with the CLDR quote marks info

From: Kent Karlsson <kent.karlsson14@telia.com>
Date: Sun, 06 Nov 2011 23:57:25 +0200
To: "Martin J. Dürst" <duerst@it.aoyama.ac.jp>, Richard Ishida <ishida@w3.org>
CC: www International <www-international@w3.org>
Message-ID: <CADCCF65.1BD2B%kent.karlsson14@telia.com>
I added a comment to a (new for today) CLDR bug by Mark:
<http://unicode.org/cldr/trac/ticket/4201#comment:2>.

    /Kent K


Den 2011-11-05 09:37, skrev "Martin J. Dürst" <duerst@it.aoyama.ac.jp>:

> Hello Richard,
> 
> I suggest you write to the Unicode CLDR TC as soon as possible, telling
> them about this use of their data. I also suggest you submit your
> observations on individual data items to CLDR with the bug reporting
> system they have.
> 
> Regards,   Martin.
> 
> On 2011/11/05 0:33, Richard Ishida wrote:
>> Before I send a note to the Unicode Consortium, I thought I'd check for
>> feedback here. Looking through the list of quotation marks that Ian
>> Hickson {1} just added to the HTML5 spec I noticed one or two things
>> that look like anomalies (in the Unicode data). (That table is generated
>> automatically from the CLDR XML files.)
>> 
>> [1] A couple of locales have non-paired punctuation marks for secondary
>> quotes. They are af and tg. tg is not yet confirmed, but af is. Is this
>> really correct?
>> 
>> [2] The arabic entry has the following:
>> 
>> '\201c' '\201d' '\2018' '\2019'
>> 
>> ie.
>> 
>> ³ U+201C LEFT DOUBLE QUOTATION MARK
>> ² U+201D RIGHT DOUBLE QUOTATION MARK
>> ΠU+2018 LEFT SINGLE QUOTATION MARK
>> ¹ U+2019 RIGHT SINGLE QUOTATION MARK
>> 
>> which corresponds to
>> 
>> quotationStart quotationEnd alternateQuotationStart alternateQuotationEnd
>> 
>> I think this is wrong. Since these are not mirrored characters in
>> Unicode, surely the order should be
>> 
>> ² U+201D RIGHT DOUBLE QUOTATION MARK
>> ³ U+201C LEFT DOUBLE QUOTATION MARK
>> ¹ U+2019 RIGHT SINGLE QUOTATION MARK
>> ΠU+2018 LEFT SINGLE QUOTATION MARK
>> 
>> Same applies for Hebrew and i assume other languages when they are
>> written in rtl scripts.
>> 
>> (Note, btw, that these assignments are only default settings. They can
>> be changed using CSS if desired, eg. to substitute angle brackets for
>> quotes in Arabic text.)
>> 
>> Any thoughts on this?
>> 
>> RI
>> 
>> 
>> PS: (I guess I need to say ;-) Please keep replies to the questions
>> above, rather than moving the discussion (at least in this thread) to
>> whether the q element should or should not automatically apply quotation
>> marks and if so all the pitfalls that that may entail.
>> 
>> 
>> {1} http://dev.w3.org/html5/spec/rendering.html#quotes
>> 
>> 
> 
Received on Tuesday, 8 November 2011 12:23:52 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 November 2011 12:23:53 GMT