Re: HTML - i18n / NCR & charsets

Keld J|rn Simonsen (keld@dkuug.dk)
Wed, 27 Nov 1996 02:07:26 +0100


Message-Id: <199611270107.CAA29978@dkuug.dk>
From: keld@dkuug.dk (Keld J|rn Simonsen)
Date: Wed, 27 Nov 1996 02:07:26 +0100
In-Reply-To: Misha Wolf <MISHA.WOLF@reuters.com>
To: Misha Wolf <MISHA.WOLF@reuters.com>, www-html <www-html@w3.org>,
        www-international <www-international@w3.org>,
Subject: Re: HTML - i18n / NCR & charsets

Misha Wolf writes:

> If we are considering Web pages using Windows Code Pages, in which 
> illegal numeric character references have been used for characters 
> in the range 80-9F (decimal 128-159) then there will be no clash 
> with anything in Unicode as these values do not represent characters 
> in Unicode or, for that matter, in ISO 8859-X.  A permissive browser 
> will simply map these to the expected characters.

I just checked, the AMD 3 to 10646 says that C1 is reserved
for control characters, and thus it cannot be used for graphic
characters like in CP1251

keld