Re: HTML - i18n / NCR & charsets from Dirk.vanGulik@jrc.it on 1996-11-26 (www-html@w3.org from November 1996)

From: <Dirk.vanGulik@jrc.it>
Date: Tue, 26 Nov 1996 21:16:21 +0100 (MET)
To: Misha Wolf <MISHA.WOLF@reuters.com>
Cc: www-html <www-html@w3.org>
Message-Id: <Pine.SOL.3.91.961126211119.8528A-100000@elect6.jrc.it>

On Tue, 26 Nov 1996, Misha Wolf wrote:

> The following extract from RFC 1866, "Hypertext Markup Language - 2.0" shows 
> that legal numeric character references have been based on Unicode for quite 
> some time and certainly prior to the I18N draft.
> 
I quite agree here, and I do acknowledge this; but I do insist on current
practice beeing the problem. Doing a quick scan over all reachable pages 
linked
in from the webdirectory (www.webdirectory.com) last night; I do find a
substancial number of pages which would be broken. About 7%/4K pages. OF
these about a fifth dates of before RFC1866.

But *AGAIN* I acknowledge that there _should_ be no problems, people
should not have relied on NCRs in the low top bit range; but they have 
done so. And if you have easy ways of marking your pages such that you do
not break excising practice, you should do so.

Dw.

Received on Tuesday, 26 November 1996 15:16:59 UTC