- From: Martin Dürst via GitHub <sysbot+gh@w3.org>
- Date: Thu, 12 Nov 2015 06:11:30 +0000
- To: www-international@w3.org
I concur with @aphillips, but have some additional comments. First, it's not really about differences between Simplified and Traditional Chinese, because these are distinguished by using different code point when the difference is significant (e.g. 区 vs. 區). The differences in Han character (kanji) shapes between mainland China, Taiwan/HK, Japan, and so on are much finer, much closer to the level of font differences. Users in each country/region are used to read these characters in their preferred shapes, but basic readability isn't affected when another shape is used. As an example, I occasionally get email that displays Japanese text with a Chinese font; although this is somewhat suboptimal, it's nevertheless readable without problems. Depending on the person reading something, I can imagine different preferences by different people for cross-language content. I don't read Chinese, and so cannot speak from direct experience, but I could well imagine that a Japanese native person might want to see even Chinese content displayed using a Japanese font, and vice versa, because such a person is more familiar with the glyph shapes in their native fonts. The main problem would be cases where e.g. some Chinese characters are not available in the Japanese font; this might lead to a ransom note effect which of course would be undesirable. As for the different ways of showing U+005C, this is a hopeless remainder from a time where there were a lot of local code pages. I remember a time when on a German system, the {} frequently used in C were displayed as ö or some such; the rest was done by the viewer. Unfortunately, the yen/won sign effect won't disappear soon because it would require a Y2K-like effort without a deadline. I strongly suggest to recommend the following whenever appropriate: "To denote Japanese Yen, replace U+005C with U+00A5 (¥, Yen symbol in Latin-1), U+FFE5 (¥, full width Yen sign), 円 (Kanji for Yen),..., and to denote Korean Won, replace U+005C with U+FFE6 (₩, full width Won sign),... Only use U+005C for syntactic backslash (e.g. in programming languages). (readers of programming languages at least in Japan are used to see a Yen symbol were other readers would expect a backslash in a program)" Unfortunately, this is the best advice there's at the moment. Relying on language tagging for this issue is not appropriate and not safe enough. -- GitHub Notif of comment by duerst See https://github.com/w3c/presentation-api/issues/218#issuecomment-156012188
Received on Thursday, 12 November 2015 06:11:32 UTC