W3C home > Mailing lists > Public > www-archive@w3.org > June 2008

Internet Explorer / MLang character encoding alias definitions

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Mon, 30 Jun 2008 21:58:30 +0200
To: www-archive@w3.org
Message-ID: <1qci64582jcfqclr4182b82f5kc1g91c2m@hive.bjoern.hoehrmann.de>

Hi,

  These are code page alias names supported on my version of mlang.dll.
I've used a simple Perl script to extract all the UTF-16 strings from
the DLL and then passed each and every one of them to my resurrected
Win32::MultiLanguage module. This seems to be the most comprehensive
list of them on the web, the only other sources for the alias names are
the sscli20 sources, my tables in HTML Tidy, and some Microsoft Exchange
service pack. Oh and Masao Goho has what seems to be a code excerpt on
his spaces space. 

  +----------+-----------------------------------------------+
  | CodePage | Alias                                         |
  +----------+-----------------------------------------------+
  | 37       | IBM037                                        |
  | 37       | cp037                                         |
  | 37       | csIBM037                                      |
  | 37       | ebcdic-cp-ca                                  |
  | 37       | ebcdic-cp-nl                                  |
  | 37       | ebcdic-cp-us                                  |
  | 37       | ebcdic-cp-wt                                  |
  | 437      | 437                                           |
  | 437      | IBM437                                        |
  | 437      | cp437                                         |
  | 437      | csPC8CodePage437                              |
  | 500      | CP500                                         |
  | 500      | IBM500                                        |
  | 500      | csIBM500                                      |
  | 500      | ebcdic-cp-be                                  |
  | 500      | ebcdic-cp-ch                                  |
  | 708      | ASMO-708                                      |
  | 720      | DOS-720                                       |
  | 737      | ibm737                                        |
  | 775      | ibm775                                        |
  | 850      | IBM850                                        |
  | 850      | cp850                                         |
  | 850      | ibm850                                        |
  | 852      | IBM852                                        |
  | 852      | cp852                                         |
  | 852      | ibm852                                        |
  | 855      | IBM855                                        |
  | 855      | cp855                                         |
  | 857      | IBM857                                        |
  | 857      | cp857                                         |
  | 857      | ibm857                                        |
  | 858      | CCSID00858                                    |
  | 858      | CP00858                                       |
  | 858      | IBM00858                                      |
  | 858      | PC-Multilingual-850+euro                      |
  | 858      | cp858                                         |
  | 860      | IBM860                                        |
  | 860      | cp860                                         |
  | 861      | IBM861                                        |
  | 861      | cp861                                         |
  | 861      | ibm861                                        |
  | 862      | DOS-862                                       |
  | 862      | IBM862                                        |
  | 862      | cp862                                         |
  | 863      | IBM863                                        |
  | 863      | cp863                                         |
  | 864      | IBM864                                        |
  | 864      | cp864                                         |
  | 865      | IBM865                                        |
  | 865      | cp865                                         |
  | 866      | IBM866                                        |
  | 866      | cp866                                         |
  | 869      | IBM869                                        |
  | 869      | cp869                                         |
  | 869      | ibm869                                        |
  | 870      | CP870                                         |
  | 870      | IBM870                                        |
  | 870      | csIBM870                                      |
  | 870      | ebcdic-cp-roece                               |
  | 870      | ebcdic-cp-yu                                  |
  | 874      | DOS-874                                       |
  | 874      | TIS-620                                       |
  | 874      | iso-8859-11                                   |
  | 874      | windows-874                                   |
  | 875      | cp875                                         |
  | 932      | csShiftJIS                                    |
  | 932      | csWindows31J                                  |
  | 932      | ms_Kanji                                      |
  | 932      | shift-jis                                     |
  | 932      | shift_jis                                     |
  | 932      | sjis                                          |
  | 932      | x-ms-cp932                                    |
  | 932      | x-sjis                                        |
  | 936      | CN-GB                                         |
  | 936      | GB2312                                        |
  | 936      | GB231280                                      |
  | 936      | GBK                                           |
  | 936      | GB_2312-80                                    |
  | 936      | chinese                                       |
  | 936      | csGB2312                                      |
  | 936      | csGB231280                                    |
  | 936      | csISO58GB231280                               |
  | 936      | gb2312                                        |
  | 936      | iso-ir-58                                     |
  | 949      | KSC5601                                       |
  | 949      | KSC_5601                                      |
  | 949      | csKSC56011987                                 |
  | 949      | iso-ir-149                                    |
  | 949      | korean                                        |
  | 949      | ks-c-5601                                     |
  | 949      | ks-c5601                                      |
  | 949      | ks_c_5601                                     |
  | 949      | ks_c_5601-1987                                |
  | 949      | ks_c_5601-1989                                |
  | 949      | ks_c_5601_1987                                |
  | 950      | Big5                                          |
  | 950      | Big5-HKSCS                                    |
  | 950      | big5                                          |
  | 950      | cn-big5                                       |
  | 950      | csbig5                                        |
  | 950      | x-x-big5                                      |
  | 1026     | CP1026                                        |
  | 1026     | IBM1026                                       |
  | 1026     | csIBM1026                                     |
  | 1047     | IBM01047                                      |
  | 1140     | CCSID01140                                    |
  | 1140     | CP01140                                       |
  | 1140     | IBM01140                                      |
  | 1140     | ebcdic-us-37+euro                             |
  | 1141     | CCSID01141                                    |
  | 1141     | CP01141                                       |
  | 1141     | IBM01141                                      |
  | 1141     | ebcdic-de-273+euro                            |
  | 1142     | CCSID01142                                    |
  | 1142     | CP01142                                       |
  | 1142     | IBM01142                                      |
  | 1142     | ebcdic-dk-277+euro                            |
  | 1142     | ebcdic-no-277+euro                            |
  | 1143     | CCSID01143                                    |
  | 1143     | CP01143                                       |
  | 1143     | IBM01143                                      |
  | 1143     | ebcdic-fi-278+euro                            |
  | 1143     | ebcdic-se-278+euro                            |
  | 1144     | CCSID01144                                    |
  | 1144     | CP01144                                       |
  | 1144     | IBM01144                                      |
  | 1144     | ebcdic-it-280+euro                            |
  | 1145     | CCSID01145                                    |
  | 1145     | CP01145                                       |
  | 1145     | IBM01145                                      |
  | 1145     | ebcdic-es-284+euro                            |
  | 1146     | CCSID01146                                    |
  | 1146     | CP01146                                       |
  | 1146     | IBM01146                                      |
  | 1146     | ebcdic-gb-285+euro                            |
  | 1147     | CCSID01147                                    |
  | 1147     | CP01147                                       |
  | 1147     | IBM01147                                      |
  | 1147     | ebcdic-fr-297+euro                            |
  | 1148     | CCSID01148                                    |
  | 1148     | CP01148                                       |
  | 1148     | IBM01148                                      |
  | 1148     | ebcdic-international-500+euro                 |
  | 1149     | CCSID01149                                    |
  | 1149     | CP01149                                       |
  | 1149     | IBM01149                                      |
  | 1149     | ebcdic-is-871+euro                            |
  | 1200     | UTF-16LE                                      |
  | 1200     | unicode                                       |
  | 1200     | utf-16                                        |
  | 1201     | UTF-16BE                                      |
  | 1201     | unicodeFFFE                                   |
  | 1250     | windows-1250                                  |
  | 1250     | x-cp1250                                      |
  | 1251     | windows-1251                                  |
  | 1251     | x-cp1251                                      |
  | 1252     | Windows-1252                                  |
  | 1252     | windows-1252                                  |
  | 1252     | x-ansi                                        |
  | 1253     | windows-1253                                  |
  | 1254     | Windows-1254                                  |
  | 1254     | windows-1254                                  |
  | 1255     | windows-1255                                  |
  | 1256     | cp1256                                        |
  | 1256     | windows-1256                                  |
  | 1257     | windows-1257                                  |
  | 1258     | windows-1258                                  |
  | 1361     | Johab                                         |
  | 10000    | macintosh                                     |
  | 10001    | x-mac-japanese                                |
  | 10002    | x-mac-chinesetrad                             |
  | 10003    | x-mac-korean                                  |
  | 10004    | x-mac-arabic                                  |
  | 10005    | x-mac-hebrew                                  |
  | 10006    | x-mac-greek                                   |
  | 10007    | x-mac-cyrillic                                |
  | 10008    | x-mac-chinesesimp                             |
  | 10010    | x-mac-romanian                                |
  | 10017    | x-mac-ukrainian                               |
  | 10021    | x-mac-thai                                    |
  | 10029    | x-mac-ce                                      |
  | 10079    | x-mac-icelandic                               |
  | 10081    | x-mac-turkish                                 |
  | 10082    | x-mac-croatian                                |
  | 20000    | x-Chinese-CNS                                 |
  | 20001    | x-cp20001                                     |
  | 20002    | x-Chinese-Eten                                |
  | 20003    | x-cp20003                                     |
  | 20004    | x-cp20004                                     |
  | 20005    | x-cp20005                                     |
  | 20105    | irv                                           |
  | 20105    | x-IA5                                         |
  | 20106    | DIN_66003                                     |
  | 20106    | German                                        |
  | 20106    | x-IA5-German                                  |
  | 20107    | SEN_850200_B                                  |
  | 20107    | Swedish                                       |
  | 20107    | x-IA5-Swedish                                 |
  | 20108    | NS_4551-1                                     |
  | 20108    | Norwegian                                     |
  | 20108    | x-IA5-Norwegian                               |
  | 20127    | ANSI_X3.4-1968                                |
  | 20127    | ANSI_X3.4-1986                                |
  | 20127    | IBM367                                        |
  | 20127    | ISO646-US                                     |
  | 20127    | ISO_646.irv:1991                              |
  | 20127    | ascii                                         |
  | 20127    | cp367                                         |
  | 20127    | csASCII                                       |
  | 20127    | iso-ir-6                                      |
  | 20127    | us                                            |
  | 20127    | us-ascii                                      |
  | 20261    | x-cp20261                                     |
  | 20269    | x-cp20269                                     |
  | 20273    | CP273                                         |
  | 20273    | IBM273                                        |
  | 20273    | csIBM273                                      |
  | 20277    | EBCDIC-CP-DK                                  |
  | 20277    | EBCDIC-CP-NO                                  |
  | 20277    | IBM277                                        |
  | 20277    | csIBM277                                      |
  | 20278    | CP278                                         |
  | 20278    | IBM278                                        |
  | 20278    | csIBM278                                      |
  | 20278    | ebcdic-cp-fi                                  |
  | 20278    | ebcdic-cp-se                                  |
  | 20280    | CP280                                         |
  | 20280    | IBM280                                        |
  | 20280    | csIBM280                                      |
  | 20280    | ebcdic-cp-it                                  |
  | 20284    | CP284                                         |
  | 20284    | IBM284                                        |
  | 20284    | csIBM284                                      |
  | 20284    | ebcdic-cp-es                                  |
  | 20285    | CP285                                         |
  | 20285    | IBM285                                        |
  | 20285    | csIBM285                                      |
  | 20285    | ebcdic-cp-gb                                  |
  | 20290    | EBCDIC-JP-kana                                |
  | 20290    | IBM290                                        |
  | 20290    | cp290                                         |
  | 20290    | csIBM290                                      |
  | 20297    | IBM297                                        |
  | 20297    | cp297                                         |
  | 20297    | csIBM297                                      |
  | 20297    | ebcdic-cp-fr                                  |
  | 20420    | IBM420                                        |
  | 20420    | cp420                                         |
  | 20420    | csIBM420                                      |
  | 20420    | ebcdic-cp-ar1                                 |
  | 20423    | IBM423                                        |
  | 20423    | cp423                                         |
  | 20423    | csIBM423                                      |
  | 20423    | ebcdic-cp-gr                                  |
  | 20424    | IBM424                                        |
  | 20424    | cp424                                         |
  | 20424    | csIBM424                                      |
  | 20424    | ebcdic-cp-he                                  |
  | 20833    | X-EBCDIC-KoreanExtended                       |
  | 20833    | x-EBCDIC-KoreanExtended                       |
  | 20838    | IBM-Thai                                      |
  | 20838    | csIBMThai                                     |
  | 20866    | csKOI8R                                       |
  | 20866    | koi                                           |
  | 20866    | koi8                                          |
  | 20866    | koi8-r                                        |
  | 20866    | koi8r                                         |
  | 20871    | CP871                                         |
  | 20871    | IBM871                                        |
  | 20871    | csIBM871                                      |
  | 20871    | ebcdic-cp-is                                  |
  | 20880    | EBCDIC-Cyrillic                               |
  | 20880    | IBM880                                        |
  | 20880    | cp880                                         |
  | 20880    | csIBM880                                      |
  | 20905    | CP905                                         |
  | 20905    | IBM905                                        |
  | 20905    | csIBM905                                      |
  | 20905    | ebcdic-cp-tr                                  |
  | 20924    | CCSID00924                                    |
  | 20924    | CP00924                                       |
  | 20924    | IBM00924                                      |
  | 20924    | ebcdic-Latin9--euro                           |
  | 20936    | x-cp20936                                     |
  | 20949    | x-cp20949                                     |
  | 21025    | cp1025                                        |
  | 21027    | x-cp21027                                     |
  | 21866    | koi8-ru                                       |
  | 21866    | koi8-u                                        |
  | 28591    | cp819                                         |
  | 28591    | csISOLatin1                                   |
  | 28591    | ibm819                                        |
  | 28591    | iso-8859-1                                    |
  | 28591    | iso-ir-100                                    |
  | 28591    | iso8859-1                                     |
  | 28591    | iso_8859-1                                    |
  | 28591    | iso_8859-1:1987                               |
  | 28591    | l1                                            |
  | 28591    | latin1                                        |
  | 28592    | csISOLatin2                                   |
  | 28592    | iso-8859-2                                    |
  | 28592    | iso-ir-101                                    |
  | 28592    | iso8859-2                                     |
  | 28592    | iso_8859-2                                    |
  | 28592    | iso_8859-2:1987                               |
  | 28592    | l2                                            |
  | 28592    | latin2                                        |
  | 28593    | ISO_8859-3                                    |
  | 28593    | ISO_8859-3:1988                               |
  | 28593    | csISOLatin3                                   |
  | 28593    | iso-8859-3                                    |
  | 28593    | iso-ir-109                                    |
  | 28593    | l3                                            |
  | 28593    | latin3                                        |
  | 28594    | ISO_8859-4                                    |
  | 28594    | ISO_8859-4:1988                               |
  | 28594    | csISOLatin4                                   |
  | 28594    | iso-8859-4                                    |
  | 28594    | iso-ir-110                                    |
  | 28594    | l4                                            |
  | 28594    | latin4                                        |
  | 28595    | ISO_8859-5                                    |
  | 28595    | ISO_8859-5:1988                               |
  | 28595    | csISOLatinCyrillic                            |
  | 28595    | cyrillic                                      |
  | 28595    | iso-8859-5                                    |
  | 28595    | iso-ir-144                                    |
  | 28596    | ECMA-114                                      |
  | 28596    | ISO_8859-6                                    |
  | 28596    | ISO_8859-6:1987                               |
  | 28596    | arabic                                        |
  | 28596    | csISOLatinArabic                              |
  | 28596    | iso-8859-6                                    |
  | 28596    | iso-ir-127                                    |
  | 28597    | ECMA-118                                      |
  | 28597    | ELOT_928                                      |
  | 28597    | ISO_8859-7                                    |
  | 28597    | ISO_8859-7:1987                               |
  | 28597    | csISOLatinGreek                               |
  | 28597    | greek                                         |
  | 28597    | greek8                                        |
  | 28597    | iso-8859-7                                    |
  | 28597    | iso-ir-126                                    |
  | 28598    | ISO-8859-8 Visual                             |
  | 28598    | ISO_8859-8                                    |
  | 28598    | ISO_8859-8:1988                               |
  | 28598    | csISOLatinHebrew                              |
  | 28598    | hebrew                                        |
  | 28598    | iso-8859-8                                    |
  | 28598    | iso-ir-138                                    |
  | 28598    | logical                                       |
  | 28598    | visual                                        |
  | 28599    | ISO_8859-9                                    |
  | 28599    | ISO_8859-9:1989                               |
  | 28599    | csISOLatin5                                   |
  | 28599    | iso-8859-9                                    |
  | 28599    | iso-ir-148                                    |
  | 28599    | l5                                            |
  | 28599    | latin5                                        |
  | 28603    | iso-8859-13                                   |
  | 28605    | ISO_8859-15                                   |
  | 28605    | csISOLatin9                                   |
  | 28605    | iso-8859-15                                   |
  | 28605    | l9                                            |
  | 28605    | latin9                                        |
  | 29001    | x-Europa                                      |
  | 38598    | iso-8859-8-i                                  |
  | 50000    | x-user-defined                                |
  | 50001    | _autodetect_all                               |
  | 50220    | iso-2022-jp                                   |
  | 50221    | csISO2022JP                                   |
  | 50225    | csISO2022KR                                   |
  | 50225    | iso-2022-kr                                   |
  | 50225    | iso-2022-kr-7                                 |
  | 50225    | iso-2022-kr-7bit                              |
  | 50227    | x-cp50227                                     |
  | 50229    | x-cp50229                                     |
  | 50930    | cp930                                         |
  | 50931    | X-EBCDIC-JapaneseAndUSCanada                  |
  | 50931    | x-EBCDIC-JapaneseAndUSCanada                  |
  | 50932    | _autodetect                                   |
  | 50933    | cp933                                         |
  | 50935    | cp935                                         |
  | 50937    | cp937                                         |
  | 50939    | cp939                                         |
  | 50949    | _autodetect_kr                                |
  | 51932    | EUC-JP                                        |
  | 51932    | Extended_UNIX_Code_Packed_Format_for_Japanese |
  | 51932    | csEUCPkdFmtJapanese                           |
  | 51932    | euc-jp                                        |
  | 51932    | iso-2022-jpeuc                                |
  | 51932    | x-euc                                         |
  | 51932    | x-euc-jp                                      |
  | 51936    | EUC-CN                                        |
  | 51936    | euc-cn                                        |
  | 51936    | x-euc-cn                                      |
  | 51949    | csEUCKR                                       |
  | 51949    | euc-kr                                        |
  | 51949    | iso-2022-kr-8                                 |
  | 51949    | iso-2022-kr-8bit                              |
  | 52936    | hz-gb-2312                                    |
  | 54936    | GB18030                                       |
  | 57002    | x-iscii-de                                    |
  | 57003    | x-iscii-be                                    |
  | 57004    | x-iscii-ta                                    |
  | 57005    | x-iscii-te                                    |
  | 57006    | x-iscii-as                                    |
  | 57007    | x-iscii-or                                    |
  | 57008    | x-iscii-ka                                    |
  | 57009    | x-iscii-ma                                    |
  | 57010    | x-iscii-gu                                    |
  | 57011    | x-iscii-pa                                    |
  | 65000    | csUnicode11UTF7                               |
  | 65000    | unicode-1-1-utf-7                             |
  | 65000    | unicode-2-0-utf-7                             |
  | 65000    | utf-7                                         |
  | 65000    | x-unicode-1-1-utf-7                           |
  | 65000    | x-unicode-2-0-utf-7                           |
  | 65001    | unicode-1-1-utf-8                             |
  | 65001    | unicode-2-0-utf-8                             |
  | 65001    | utf-8                                         |
  | 65001    | x-unicode-1-1-utf-8                           |
  | 65001    | x-unicode-2-0-utf-8                           |
  +----------+-----------------------------------------------+

regards,
-- 
Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de
Weinh. Str. 22 · Telefon: +49(0)621/4309674 · http://www.bjoernsworld.de
68309 Mannheim · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 
Received on Monday, 30 June 2008 19:59:14 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 7 November 2012 14:18:18 GMT