Re: Using UTF-8 for non-ASCII Characters in URLs

Larry Masinter (masinter@parc.xerox.com)
Tue, 29 Apr 1997 21:09:42 PDT


Message-ID: <3366C606.786A@parc.xerox.com>
Date: Tue, 29 Apr 1997 21:09:42 PDT
From: Larry Masinter <masinter@parc.xerox.com>
To: "Michael Kung <MKUNG.US.ORACLE.COM>" <MKUNG@us.oracle.com>
CC: uri@bunyip.com
Subject: Re: Using UTF-8 for non-ASCII Characters in URLs

This isn't just a "small point", it's essential:

The only way to guarantee "round trip" is to stick to the smallest
repertoire of characters. Clearly you shouldn't enter "http" as
wide characters, and if you have 'wide characters' that need
to be distinguished from ascii characters, you should encode them
in hex-encoded-UTF8 always.