W3C home > Mailing lists > Public > public-zhreq@w3.org > December 2014

Solutions to unify middle dot usage in Traditional Chinese

From: Bobby Tung <bobbytung@wanderer.tw>
Date: Wed, 10 Dec 2014 22:23:26 +0800
Message-Id: <68A076EC-022A-4848-99A2-D977D6FCB068@wanderer.tw>
Cc: CJK discussion <public-i18n-cjk@w3.org>, 中文HTML5同樂會ML <public-html-ig-zh@w3.org>, Ken Lunde <lunde@adobe.com>
To: public-zhreq@w3.org
Hello,

There's a problem I found about the middle dot usage in Traditional Chinese.

--Usage

Middle dot for Traditional Chinese has 3 usages list below: 

1, separates translated latin name in Hanzi, e.g. 理查・石田

2, as decimal point in Hanzi e.g. 三・一四

3, separates book, chapter, title e.g.  詩經・魏風・碩鼠

In Traditional Chinese, the Middle dot should be full-width and a filled round dot in the middle.

--Codepoint

There's some codepoints general used for the middle dot in Traditional Chinese.

·	U+00B7 	MIDDLE DOT
‧	U+2027 	HYPHENATION POINT
・	U+30FB 	KATAKANA MIDDLE DOT
.	U+FF0E 	FULLWIDTH FULL STOP

And in Simplified Chinese usage, the middle dot is U+00B7.

U+00B7 from A150 and U+2027 from A145 on BIG 5 code table[1]. 

But I think U+00B7's definition more suitable for the middle dot than U+2027 / U+FF0E. 

--Solutions

Considering about interoperability and codepoint definition, I have 2 proposals.

1. use U+00B7 as general middle dot, if authors want to let it full-width, use U+30FB. But most Chinese fonts do not have the glyph, certainly fallback to Japanese font. [2]

2. use U+00B7 as general middle dot, and in Traditional Chinese subset, let glyph be full-width. 


=====


各位,我發現繁體字的中點在使用上相當混亂,想藉寫中文排版需求時把標準訂下來,提出兩個方案。

先提出繁體字「連接號」(舊稱音節號)使用的狀況:

1, 用來分隔漢譯姓與名,例如:理查・石田

2, 作為漢字數字的小數點,例如:三・一四

3, 用來分隔書、章、作品名,例如:詩經・魏風・碩鼠

而在繁體字的用法上,連接號應該為全形/全角,為置中的實心點。

再來從實際的文件上,會發現有最常使用的四個Codepoints:

·	U+00B7 	MIDDLE DOT
‧	U+2027 	HYPHENATION POINT
・	U+30FB 	KATAKANA MIDDLE DOT
.	U+FF0E 	FULLWIDTH FULL STOP

簡體字則是統一使用U+00B7,而U+00B7來自BIG 5的A150,但我認為U+00B7的定義比較符合使用狀況,所以不考慮使用U+2027與U+FF0E。

所以提出的方案如下:

1, 使用U+00B7作為標準中點,若作者想要全形,則使用U+30FB,但因為這個Codepoint許多中文字型沒有造,所以幾乎一定會Fallback到日文字型。

2, 使用U+00B7作為標準中點,但在繁體字字型中,將其造為全形。


[1]: http://www.khngai.com/chinese/charmap/tblbig.php?page=0 <http://www.khngai.com/chinese/charmap/tblbig.php?page=0>
[2]: http://www.unicode.org/reports/tr11/ <http://www.unicode.org/reports/tr11/>



WANDERER Digital Publishing Inc.
Bobby Tung @bobtung
Mobile:+886-975068558
bobbytung@wanderer.tw
http://wanderer.tw


Received on Wednesday, 10 December 2014 14:24:03 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 19:43:32 UTC