- From: Xidorn Quan <me@upsuper.org>
- Date: Wed, 27 Jul 2016 00:01:06 +1000
- To: Koji Ishii <kojiishi@gmail.com>, 董福興 <bobbytung@wanderer.tw>
- Cc: CJK discussion <public-i18n-cjk@w3.org>, Makoto Kato <m_kato@ga2.so-net.ne.jp>, 劉慶 <ryukeikun@gmail.com>
- Message-Id: <1469541666.3054986.677164057.47169676@webmail.messagingengine.com>
Hi Koji, Some editors of CJReq and I discussed about this question today. And it seems controversial about which should be mapped to which. We roughly have an agreement that, languages which are used in Taiwan and Hong Kong should be mapped to Tranditional Chinese by default even if they are also used in mainland China, and other languages which are used in mainland China should be mapped to Simplified Chinese by default. My reason is that, Taiwan and Hong Kong have some formal documents about the writing form of the languages they use from their governments, while mainland China doesn't. That said, people who actively write those languages in their "standard" form (rather than just speak) should be more likely from Taiwan or Hong Kong rather than mainland China. Based on that criterion, the mapping should be: Simplified Chinese: * Gan Chinese [gan] * Huizhou Chinese [czh] * Jinyu Chinese [cjy] * Min Bei Chinese [mnp] * Min Dong Chinese [cdo] * Min Zhong Chinese [czo] * Pu-Xian Chinese [cpx] * Wu Chinese [wuu] * Xiang Chinese [hsn] Traditional Chinese: * Hakka Chinese [hak] * Min Nan Chinese [nan] * Yue Chinese [yue] The only thing we currently have no idea is Literary Chinese (Classical Chinese, or kanbun in Japanese). In mainland China, Simplified Chinese characters are used for Literary Chinese, while in Taiwan, Tranditional Chinese characters are used. So basically the situation is similiar to Mandarin Chinese. - Xidorn On Tue, Jul 26, 2016, at 11:32 PM, Koji Ishii wrote: > Hi Bobby, > > 2016-07-26 14:25 GMT+09:00 董福興 <bobbytung@wanderer.tw>: >> Hi Koji, >> >> Yue(粵) follows hk standard that should include HKSCS set around 5,000 >> glyphs that is already in most TC system fonts. >> >> Min(閩)and Hakka(客家) do not have documented subset for now, >> still working on that[1]. But most of the glyphs is T-source, >> so point to TC. >> >> About Wu(吳), I believe that most users read SC, but some glyphs not >> included in PRC standards. > > Not having all glyphs is fine, browsers can find fallback fonts. The > question affects which font do you expect to see, when you set lang > and without explicit font-family (no font-family or generic family), > MS Yahei or MS JhengHei? > > As I replied to Ambrose, web authors can specify the script > explicitly, but the default is needed when the script is omitted. > > Off-topic but "zh-hans-yue" was wrong as I re-read BCP-47, it should > be "yue-hans" or "zh-yue-hans". That makes easier to parse for our > existing code. > >> CC Eric Liu for more information. >> >> [1]: http://bobbytung.github.io/TaigiHakkaIdeograph/ > Some of them must be obvious to you but I have to admit that I'm not > clear on any of the 14 languages, your support is appreciated. > > /koji
Received on Tuesday, 26 July 2016 14:01:35 UTC