W3C home > Mailing lists > Public > ietf-charsets@w3.org > July to September 2002

RE: Registration of new charset GB18030 (fwd)

From: Francois Yergeau <FYergeau@alis.com>
Date: Mon, 15 Jul 2002 13:18:19 -0400
To: ietf-charsets@iana.org
Message-id: <F7D4BDA0E5A1D14B99D32C022AEB7366549F20@alis-2k.alis.domain>

Lars Marius Garshol wrote:
> The character encoding identification code in the Opera web browser
> does precisely the same thing (it also ignores underscores). We had
> lots of trouble with people mixing up dashes and underscores, and
> inserting them in unexpected places, and did this to reduce our
> ever-increasing list of aliases.

We did the same in the Tango browser c. 1996.  IIRC, we had a three-step
matching procedure:

1) try an exact (but case-insensitive) match
2) try again, ignoring hyphens, underscores and periods
3) try again, ignoring any "x-" prefix

We never had any complaint.

-- 
François Yergeau
Received on Monday, 15 July 2002 13:20:44 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 5 June 2006 15:10:53 GMT