W3C home > Mailing lists > Public > www-international@w3.org > October to December 1996

Re: HTML - i18n / NCR & charsets

From: Francois Yergeau <yergeau@alis.com>
Date: Tue, 26 Nov 1996 17:05:13 -0500
Message-Id: <2.2.32.19961126220513.007123a8@genstar.alis.ca>
To: Misha Wolf <MISHA.WOLF@reuters.com>
Cc: www-html <www-html@w3.org>, www-international <www-international@w3.org>, Unicode <unicode@unicode.org>
À 21:35 26-11-96 -0500, Misha Wolf a écrit :
>If we are considering Web pages using Windows Code Pages, in which 
>illegal numeric character references have been used for characters 
>in the range 80-9F (decimal 128-159) then there will be no clash 
>with anything in Unicode as these values do not represent characters 
>in Unicode or, for that matter, in ISO 8859-X.  A permissive browser 
>will simply map these to the expected characters.

Agreed, but this should not be construed as allowing such misuse of NCRs,
but only as being "liberal in what you accept", in good old Internet tradition.

Somebody else wrote:
>But *AGAIN* I acknowledge that there _should_ be no problems, people
>should not have relied on NCRs in the low top bit range; but they have 
>done so. And if you have easy ways of marking your pages such that you do
>not break excising practice, you should do so.
>
>Dw.

If being liberal is not enough, and special marking is required, then the
broken pages with illegal NCRs should be so marked.  Doing otherwise (à la
text/html.i18n) would seem to indicate that the correct, standard way is
special, whereas the incorrect, non-standard way is normal. Just the
opposite of what a standard means.

-- 
François Yergeau <yergeau@alis.com>
Alis Technologies Inc., Montréal
Tél : +1 (514) 747-2547
Fax : +1 (514) 747-2561
Received on Tuesday, 26 November 1996 17:10:23 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:46 GMT