W3C home > Mailing lists > Public > www-international@w3.org > January to March 2002

Re: UTF-8 supporting Japanese characters

From: Martin Duerst <duerst@w3.org>
Date: Sat, 26 Jan 2002 20:48:13 +0900
Message-Id: <4.2.0.58.J.20020126204149.03d29af8@localhost>
To: Chris Lilley <chris@w3.org>, "souravm" <souravm@infy.com>
Cc: www-international@w3.org
At 20:19 02/01/25 +0100, Chris Lilley wrote:
>On Friday, January 25, 2002, 11:49:07 AM, souravm wrote:
>
>s> Hi All,
>
> >>From some source I came to know that UTF-8 does not support some
>s> Japanese charcaters. Just wanted to verify this point. Is it true ?
>
>Suppose you make up a new Japanese glyph to represent your name. That
>is perfectly legal - it happens for new emperors, for example.

Sorry, but that's not true. Japanese emperors get names composed
of more-or-less everyday Japanese characters, without any special
glyph forms.

For the current and the last two or three emperors, there are
also codepoints in Unicode that combine the two characters of
the name into a single display cell for compact printing with
older technology. But that doesn't mean that new characters are
created.

Regards,    Martin.



>Naturally, any version of unicode published before that date will not
>have this new character.
>
>But that is rather lie saying "I understand the Complete and
>Unabridged Oxford English Dictionary does not list all English words,
>fnowdwaffle" where fnordwaffle is a word whose first literary
>occurence was in this email.
>
>Its true, logically, but does not say very much.
>
>Turning once again to Japanese, Unicode 3.1.1 has quite a lot of
>japanese characters. Looking at the Unihan data file and searching for
>kIRG_JSource shows 13,119 occurences, drawn from JIS X 0208-1990, JIS
>X 0212-1990, JIS X 0213-2000, JIS X 0213-2000 and Unified Japanese IT
>Vendors Contemporary Ideographs, 1993.
>
>Plus of course Hiragana, Katakana, number forms, punctuation, etc.
>
>All of these can be represented in UTF-8 and in UTF-16.
>
>--
>  Chris                            mailto:chris@w3.org
Received on Saturday, 26 January 2002 16:02:30 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:58 GMT