W3C home > Mailing lists > Public > www-international@w3.org > July to September 2000

Re: Inputting Unicode form Browser

From: Martin J. Duerst <duerst@w3.org>
Date: Fri, 18 Aug 2000 14:28:50 +0900
Message-Id: <>
To: "Stephen Toner" <Stephen.Toner@virtualaccess.com>, www-international@w3.org
Stephen - Can you say which browser (and which server) you used?
Can you give some examples of your pages?

Regards,  Martin.

At 00/08/18 08:13 +0900, Stephen Toner wrote:
>I have been trying to input characters from various languages into a form 
>in my browser.  I want to then store this text as unicode in a 
>database.  I have found that if a set the charset to a western language or 
>if i leave it blank, that ordinary ascii characters are read in as ASCII 
>and characters such as Japanese are converted to &#xxxxx; form.  Is this 
>When I set the charset to "UTF-8" characters are chaged into combinations 
>of strange boxes and symbols.  I thought that this was maybe the unicode 
>for multibyte characters simply being displayed as their single 
>bytes.  However some of these then aren't displayed correctly on output.
>I would appreciate any advice, as article that I have read seem to have 
>contradictions in that some say that &#xxxx; is the unicde for that 
>character and others say something else.  Also most articles seem to 
>ignore the inputting aspect.
Received on Friday, 18 August 2000 02:15:52 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 22:04:17 UTC