W3C home > Mailing lists > Public > public-i18n-cjk@w3.org > April to June 2014

Re: [i18N-ISSUE-255] CJK Compatibility ideographs

From: Asmus Freytag <asmusf@ix.netcom.com>
Date: Wed, 30 Apr 2014 16:22:41 -0700
Message-ID: <536185C1.8090108@ix.netcom.com>
To: Richard Ishida <ishida@w3.org>, hyunyoung kim <corolla.kim@gmail.com>, public-i18n-cjk@w3c.org
CC: 임순범 <sblim@sookmyung.ac.kr>
On 4/30/2014 9:52 AM, Richard Ishida wrote:
> The description of this issue can be found at 
> http://www.w3.org/International/track/issues/255
>
>
While I have no quibbles with the problem statement "While CJK 
compatibility ideographs (U+F900~ and up) are mentioned several times, 
there's no mention of CJK ideographs (block U+4E00~ and others). Korean 
nowadays is normally written in Hangul, but still uses CJK ideographs 
occasionally", I believe neither kind of ideograph is "punctuation".

So, I would not expect either one to be listed under heading 2.1.2 (at 
least not without wording change of the heading).

Compatibility *ideographs* are something different from compatibility 
*Symbols and Punctuation*....

So, for 2.1.2, the mention of F900 is questionable and should not be 
compounded by adding 4E00... more I can't tell without delving deeper 
into the document and its history than I am able at this point.

A./
> On 30/04/2014 01:48, hyunyoung kim wrote:
>> Hello
>>
>> You are right. Korean is frequently using Both CJK compatibility
>> ideographs (U+F900~ and up) and CJK ideographs (block U+4E00~ and 
>> others).
>>
>> So I added "CJK ideographs" at 3 places. Please check the followings;
>>
>> *2.1.2 Hangul Punctuation Mark Code Ranges based on Unicode*
>>
>> Following punctuation marks are used in a Hangul environment. (Refer to
>> Appendix A for the code table.)
>>
>>   * Basic Latin (U+0020~U+007F): Latin alphabet and numerals
>>   * General Punctuation (U+2010~)
>>   * Superscripts and Subscripts (U+2070~)
>>   * Currency Symbols (U+20A0~)
>>   * Letterlike Symbols (U+2100~)
>>   * Number Forms (U+2050~)
>>   * Arrows (U+2190~)
>>   * Mathematical Operators (U+2200~)
>>   * Enclosed Alphanumerics (U+2460~)
>>   * Box Drawing (U+2500~)
>>   * Block Elements (U+2580~)
>>   * Geometric Shapes (U+25A0~)
>>   * Miscellaneous Symbols (U+2600~)
>>   * Dingbats (U+2700~)
>>   * CJK Symbols and Punctuation (U+3000~)
>>   * Enclosed CJK Letters and Months (U+3200~)
>>   * CJK Ideographs (U+4E00~)
>>   * CJK Compatibility Ideographs (U+F900~)
>>   * CJK Compatibility Symbols and Punctuation for Vertical Writing
>>     (U+FE30~FE48)
>>
>>
>> *3.1.2 Examples for Grouping by Typographic Characteristic of Characters
>> and Symbols*
>>
>> In a Hangul environment, characters and symbols are classified by
>> typographic characteristics, into 32 classes.
>>
>> *cl20. Hanja (CJK Ideographic Characters) *
>>
>> (U+F900~)
>>
>> *cl21. Proportional Width Latin Alphabet *
>>
>> (U+0041~U+005A, U+0061~U+007A)
>>
>> *cl22. Full-Width Unit Symbols *
>>
>> ㎥; mainly used for vertical writing
>>
>> *cl23. Latin Alphabet Space *
>>
>> (U+0020)
>>
>> *cl24. Latin Alphabet Characters *
>>
>> (U+002C~… ~U+261E)
>>
>> *cl25. Proportional-Width Numerals *
>>
>> (U+0030~U+0039)
>>
>> *cl26. Fixed Width Half-width Numerals *
>>
>> (U+0030~U+0039)
>>
>> *cl27. Fixed Width Full-width Numerals *
>>
>> (U+0020~U+007F)
>>
>> *cl28. Fixed Width Full-width Numerals *
>>
>> (U+FF21~U+FF5A)
>>
>> *cl29. CJK Ideographs*
>>
>>                    (U+4E00~U+9FFF)
>>
>>
>> *3.2.1 Character Types in Hangul Writing*
>>
>>  1. Hangul Compatibility Jamo (U+3130~)
>>  2. Enclosed CJK Characters and Numerals (U+3200~)
>>  3. CJK Ideographs (U+4E00~)
>>  4. Hangul Jamo Extended-A (U+A960~)
>>  5. Hangul Precomposed Syllables (U+AC00~U+D7A3)
>>  6. Hangul Jamo Extended-B (U+D7B0~)
>>  7. CJK Compatibility Ideographs (U+F900~)
>>
>>
>> Regards
>> HyunYoung Kim
>
>
Received on Wednesday, 30 April 2014 23:22:57 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 15:59:20 UTC