W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > March 2010

[Bug 7444] EUC-JP and ISO-2022-JP also need replacement encodings: CP51932 (or eucJP-ms) and CP50221.

From: <bugzilla@wiggum.w3.org>
Date: Thu, 18 Mar 2010 14:01:33 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1NsGI1-00023a-4l@wiggum.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=7444


NARUSE, Yui <naruse@airemix.jp> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|VERIFIED                    |REOPENED
         Resolution|REMIND                      |




--- Comment #8 from NARUSE, Yui <naruse@airemix.jp>  2010-03-18 14:01:32 ---
First I described about EUC-JP.

See attached images begin with EUC-JP.
They are showing http://coq.no/X/charset5/test-EUC-JP.php?EUC-JP with
* Internet Explorer 6 on Windows XP
* Firefox 3.6 on Mac OS X 10.5
* Safari 4.0.5 on Mac OS X 10.5
* Google Chrome 5 on Mac OS X 10.5
* Opera 10.0 on Windows Vista

All of them can show
(0) ASCII (yen sign/back solidus is beyond this ticket)
(1) JIS X 0208 before 1990
(2) Half-width katakana
* NEC selected IBM extended characters
  (1st and 2nd character of labeled as `IBM')

IE, Firefox, Chrome and Opera can show
* NEC special characters (labeled as `KanjiTalk 6/7, NEC' and `NEC')

Firefox, Safari, Chrome and Opera can show
* JIS X 0212 derived from IBM extended character (3rd-6th of `IBM')

Firefox, Chrome and Opera can show
(3) JIS X 0212-1990

Safari and Chrome can show
* IBM extended chacater (last one of `IBM')

No one can show
(1) JIS X 0208 after 1990
* DEC Kanji and KanjiTalk

IANA defined EUC-JP as following but real implementations are above.

Name: Extended_UNIX_Code_Packed_Format_for_Japanese
MIBenum: 18
Source: Standardized by OSF, UNIX International, and UNIX Systems
        Laboratories Pacific.  Uses ISO 2022 rules to select
               code set 0: US-ASCII (a single 7-bit byte set)
               code set 1: JIS X0208-1990 (a double 8-bit byte set)
                           restricted to A0-FF in both bytes
               code set 2: Half Width Katakana (a single 7-bit byte set)
                           requiring SS2 as the character prefix
               code set 3: JIS X0212-1990 (a double 7-bit byte set)
                           restricted to A0-FF in both bytes
                           requiring SS3 as the character prefix
Alias: csEUCPkdFmtJapanese
Alias: EUC-JP  (preferred MIME name)

CP51932 is:
(0) ASCII (yen sign/back solidus is beyond this ticket)
(1) JIS X 0208-1983
    NEC special characters
    NEC selected IBM extended characters
(2) Half-width katakana
http://nkf.sourceforge.jp/ucm/cp51932.ucm

All browser without Safari can show this character set.

Safari cannnot show NEC special characters;
but Chrome, whose engine is the same of Safari: WebKit, can show,
so I think this is Safari's bug.


-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Thursday, 18 March 2010 14:01:34 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 18 March 2010 14:01:40 GMT