- From: Craig R. Cummings <Craig.Cummings@oracle.com>
- Date: Wed, 22 Aug 2001 11:21:40 -0700
- To: Markus Scherer <markus.scherer@jtcsv.com>, charsets <ietf-charsets@iana.org>
Markus, I believe you are not missing GBK -- IANA is. In the work I did to enable the Oracle XML Parser for a large list of charsets, I have also found the following charsets are missing from IANA. A brief check today indicates that these are still missing, but I could have missed something. EUC-TW ISO2022CN-CNS ISO2022CN-GB MS874 MS932 MS936 MS949 MS950 ISCII GB18030 Additionally, missing from IANA registry were a large number of IBM code pages (CPXXX, CPXXXX, CPXXXXX). I can send these to you or the IETF list upon request, but I'm hesitant to litter just yet. I apologize that I have not yet had time to file RFCs for the addition of all of these charsets. Just the same, should someone else like to file these as RFCs, please don't wait for me. Thanks, Craig R. Cummings Java NLS Architect Oracle Corporation ----- Original Message ----- From: Markus Scherer <markus.scherer@jtcsv.com> To: charsets <ietf-charsets@iana.org> Sent: Wednesday, August 22, 2001 10:30 AM Subject: q about gb 2312/gbk > Hello, I have two questions about GB* simplified-Chinese charsets: > > 1. There are two entries for GB 2312: > 1.a) > Name: GB_2312-80 [RFC1345,KXS2] > MIBenum: 57 > Source: ECMA registry > Alias: iso-ir-58 > Alias: chinese > Alias: csISO58GB231280 > > 1.b) > Name: GB2312 (preferred MIME name) > MIBenum: 2025 > Source: Chinese for People's Republic of China (PRC) mixed one byte, > two byte set: > 20-7E = one byte ASCII > A1-FE = two byte PRC Kanji > See GB 2312-80 > PCL Symbol Set Id: 18C > Alias: csGB2312 > > How are they different? > The second one is clearly the commonly used MBCS charset. > Is the first one the DBCS-only part for ISO 2022, or is it also MBCS? Please clarify. > > > 2. I cannot find a registration for GBK (Microsoft 936). > Am I just missing it? > > markus >
Received on Wednesday, 22 August 2001 14:21:59 UTC