W3C home > Mailing lists > Public > www-international@w3.org > April to June 2001

RE: Unicode in HTTP streams

From: Marco Cimarosti <marco.cimarosti@essetre.it>
Date: Wed, 16 May 2001 10:34:03 +0200
Message-ID: <27E7FB58F42CD5119C0D0002557C0CCA03F86C@XCHANGE>
To: "'duerst@w3.org'" <duerst@w3.org>, www-international@w3.org

Paul Deuter wrote:
> Rather it seems to me that what is needed is an new HTTP encoding that
> explicitly indicates a Unicode codepoint analogous to the 
> &#xHHHH; format that what invented for this very purpose for HTML.
> In my investigations, I have already seen that some user agents
> will encode Unicode using the %uHHHH format.

How about Unicode values having more than 4 digits?

One possibility is of course using UTF-16, so that high codes become a
couple of surrogates.

But, as you are proposing a new format, wouldn't it be the case of adding a
terminator character, similar to the ";" in HTML, so that the number of
digits may be variable?

_ Marco Cimarosti
Received on Wednesday, 16 May 2001 04:34:31 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:20 UTC