W3C home > Mailing lists > Public > www-html@w3.org > January 2001

Re: Character Entity Reference for Single Quote.

From: Tim Bagot <tsb-w3-html-0002@earth.li>
Date: Fri, 26 Jan 2001 01:08:29 +0000 (UTC)
To: W3C HTML <www-html@w3.org>
Message-ID: <Pine.LNX.4.30.0101260044070.560-100000@windle.hack>
At 2001-01-25T16:23-0800, Russell O'Connor wrote:-

> Actually, Numeric character references refering to ISO 10646 only began
> with HTML 4.0.  But who's counting.
>
> For HTML 2.0, and HTML 3.2, the first 128 characters (0-127) refered to
> ISO 646, and the last 128 characters (128-255) refered to ECMA-94 Right
> Part of Latin Alphabet Nr. 1. ... Well something like that.

Aha, yes. This is a case of the SGML declarations not quite matching the
prose. To be fair, RFC 1866's references to ISO 10646 are largely in the
context of extensions to and future versions of HTML, but it does
nevertheless clearly state that all character references should correspond
to ISO 10646. The (IMHO) rather vague prose of REC-html32 doesn't seem to
address the subject at all. Is this minor confusion a result of lack of
cooperation between SGML and Unicode back in 1995, or just a convenient
semi-arbitrary decision?

Tim Bagot
Received on Thursday, 25 January 2001 20:08:37 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:15:45 GMT