Re: Accept-Charset support from Martin J. Duerst on 1996-12-10 (ietf-http-wg@w3.org from October to December 1996)

From: Martin J. Duerst <mduerst@ifi.unizh.ch>
Date: Tue, 10 Dec 1996 15:39:51 +0100 (MET)
To: Keld J|rn Simonsen <keld@dkuug.dk>
Cc: Koen Holtman <koen@win.tue.nl>, Klaus Weide <kweide@tezcat.com>, www-international@w3.org, masinter@parc.xerox.com, http-wg%cuckoo.hpl.hp.com@hplb.hpl.hp.com
Message-Id: <Pine.SUN.3.95.961210153833.245F-100000@enoshima>

On Sun, 8 Dec 1996, Keld J|rn Simonsen wrote:

> Koen Holtman writes:
> 
> > But skimming the UTF-8 specification, I gather that UTF-8 is an encoding
> > mechanism, not a character set.
> 
> Well, no. UTF8 is an encoding of characters. It implies the character
> repertoire of ISO 10646. So it is a charset in MIME sense, including
> the specific character definitions of 10646. You cannot use UTF8
> to encode Japanese X0208 for example.

Exactly. Just see what UTF is standing for:
UCS Transfer Format. Now UCS is Universal Character Set, which
is the character set defined by ISO 10646.

Regards,	Martin.

Received on Tuesday, 10 December 1996 06:47:33 UTC