W3C home > Mailing lists > Public > ietf-charsets@w3.org > April to June 2003

Revision of UTF-8 history in draft-yergeau-rfc2279bis-05.txt

From: Markus Kuhn <Markus.Kuhn@cl.cam.ac.uk>
Date: Thu, 12 Jun 2003 19:06:50 +0100
To: ietf-charsets@iana.org
Message-id: <E19QWTL-0003i4-00@wisbech.cl.cam.ac.uk>

http://www.ietf.org/internet-drafts/draft-yergeau-rfc2279bis-05.txt says:
>   UTF-8 was originally a project of the X/Open Joint
>   Internationalization Group XOJIG with the objective to specify a File
>   System Safe UCS Transformation Format [FSS_UTF] that is compatible
>   with UNIX systems, supporting multilingual text in a single encoding.
>   The original authors were Gary Miller, Greger Leijonhufvud and John
>   Entenmann.  Later, Ken Thompson and Rob Pike did significant work for
>   the formal definition of UTF-8.

The currently ongoing revision of the UTF-8 RFC may be a good
opportunity to unrewrite the history of this encoding.
What we know today as UTF-8 was actually designed by Ken Thompson in the
presence of Rob Pike during the evening hours of 1992-09-02 in a New
Jersey diner. It was then taken on board the standards bandwagon by the
X/Open joint i18n group, who had drafted an earlier FSS-UTF shortly
before (the one you quote), which however had less useful
synchronization properties and was therefore quickly forgotten about.

See my short UTF-8 history on


for details and witness testimony.


Markus Kuhn, Computer Laboratory, University of Cambridge
http://www.cl.cam.ac.uk/~mgk25/ || CB3 0FD, Great Britain
Received on Thursday, 12 June 2003 14:14:26 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:52:20 UTC