Re: charsets vs codepages

Drazen Kacar (dave@fly.cc.fer.hr)
Wed, 10 Jul 1996 04:18:18 +0200 (MET DST)


From: Drazen Kacar <dave@fly.cc.fer.hr>
Message-Id: <199607100218.EAA03345@fly.cc.fer.hr>
Subject: Re: charsets vs codepages
To: boo@best.com (Walter Ian Kaye)
Date: Wed, 10 Jul 1996 04:18:18 +0200 (MET DST)
Cc: www-html@w3.org
In-Reply-To: <v03007809ae087a87e02d@[205.149.180.135]> from Walter Ian Kaye at "Jul 9, 96 02:35:46 pm"

Walter Ian Kaye wrote:
> 
>  The following table lists the code pages supported by FoxPro.
>  Code Page   Platform                   Code Page
>                                         Identifier
>  Code        Platform
>  437         U.S. MS-DOS                x01
>  850         International MS-DOS       x02
>  852         Eastern European MS-DOS    x64
>  857         Turkish MS-DOS             x6B
>  861         Icelandic MS-DOS           x67
>  865         Nordic MS-DOS              x66
>  866         Russian MS-DOS             x65

[3 code pages omitted]

Microsoft didn't make those up. IBM did. All of them are registered with IANA.
Names are IBMxxx, with several aliases. They are all defined in RFC 1345.
As an example here's the entry for 852:

Name: IBM852                                              [RFC1345,KXS2]
MIBenum: 2010
Source: IBM NLS RM Vol2 SE09-8002-01, March 1990
Alias: cp852
Alias: 852
Alias: csPCp852

> 
>  1250        Eastern European Windows   xC8
>  1251        Russian Windows            xC9
>  1252        Windows ANSI               x03
>  1253        Greek Windows              xCB
>  1254        Turkish Windows            xCA
> 
These are Microsoft's inventions, I believe. Nothing with those numbers exists
in IANA registry, but there are two entries with this description:

   Name: ISO-8859-2-Windows-Latin-2                               [HP-PCL5]
   MIBenum: 2002
   Source: Extended ISO 8859-2.  Latin-2 for Windows 3.1.
           PCL Symbol Set id: 9E
   Alias: csWindows31Latin2
   
   Name: ISO-8859-5-Windows-Latin-5                               [HP-PCL5]
   MIBenum: 2003
   Source: Extended ISO 8859-5.  Latin-5 for Windows 3.1
           PCL Symbol Set id: 5T
   Alias: csWindows31Latin5

I think that 1250 is ISO-8859-2-Windows-Latin-2 and 1251 is
ISO-8859-5-Windows-Latin-5. 1250 is not only extended ISO 8859-2, though.
It's a nightmare.

-- 
Life is a sexually transmitted disease.

dave@fly.cc.fer.hr
dave@zemris.fer.hr