W3C home > Mailing lists > Public > www-international@w3.org > July to September 2000

Re: Inputting Unicode form Browser

From: Martin J. Duerst <duerst@w3.org>
Date: Fri, 18 Aug 2000 14:28:50 +0900
Message-Id: <4.2.0.58.J.20000818142756.03948730@sh.w3.mag.keio.ac.jp>
To: "Stephen Toner" <Stephen.Toner@virtualaccess.com>, www-international@w3.org
Stephen - Can you say which browser (and which server) you used?
Can you give some examples of your pages?

Regards,  Martin.

At 00/08/18 08:13 +0900, Stephen Toner wrote:
>Hi,
>I have been trying to input characters from various languages into a form 
>in my browser.  I want to then store this text as unicode in a 
>database.  I have found that if a set the charset to a western language or 
>if i leave it blank, that ordinary ascii characters are read in as ASCII 
>and characters such as Japanese are converted to &#xxxxx; form.  Is this 
>unicode?
>When I set the charset to "UTF-8" characters are chaged into combinations 
>of strange boxes and symbols.  I thought that this was maybe the unicode 
>for multibyte characters simply being displayed as their single 
>bytes.  However some of these then aren't displayed correctly on output.
>I would appreciate any advice, as article that I have read seem to have 
>contradictions in that some say that &#xxxx; is the unicde for that 
>character and others say something else.  Also most articles seem to 
>ignore the inputting aspect.
>
>Thanks,
>Steohen
>
Received on Friday, 18 August 2000 02:15:52 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:55 GMT