- From: Martin J. Duerst <mduerst@ifi.unizh.ch>
- Date: Thu, 28 Nov 1996 10:49:46 +0100 (MET)
- To: Keld J|rn Simonsen <keld@dkuug.dk>
- cc: Misha Wolf <MISHA.WOLF@reuters.com>, www-html <www-html@w3.org>, www-international <www-international@w3.org>, Unicode <unicode@unicode.org>
On Wed, 27 Nov 1996, Keld J|rn Simonsen wrote: > Misha Wolf writes: > > > If we are considering Web pages using Windows Code Pages, in which > > illegal numeric character references have been used for characters > > in the range 80-9F (decimal 128-159) then there will be no clash > > with anything in Unicode as these values do not represent characters > > in Unicode or, for that matter, in ISO 8859-X. A permissive browser > > will simply map these to the expected characters. > > I just checked, the AMD 3 to 10646 says that C1 is reserved > for control characters, and thus it cannot be used for graphic > characters like in CP1251 The HTML DTDs at least since 2.0 officially disallow characters in this range. So e.g. € is illegal in HTML even if it is defined in AMD 3. It is therefore possible for a *permissive* browser to use some guessing to cope with these illegal (in HTML) values. Regards, Martin.
Received on Thursday, 28 November 1996 05:07:11 UTC