W3C home > Mailing lists > Public > www-international@w3.org > July to September 2000

Re: Inputting Unicode form Browser

From: Mark Davis <markdavis@ispchannel.com>
Date: Fri, 18 Aug 2000 08:25:38 -0700
Message-ID: <399D5572.E746F4C4@ispchannel.com>
To: "Stephen Toner (by way of Martin J. Duerst <duerst@w3.org>)" <Stephen.Toner@virtualaccess.com>
CC: www-international@w3.org
Try the Unicode FAQ, at

"Stephen Toner (by way of Martin J. Duerst )" wrote:
> Hi,
> I have been trying to input characters from various languages into a form
> in my browser.  I want to then store this text as unicode in a database.  I
> have found that if a set the charset to a western language or if i leave it
> blank, that ordinary ascii characters are read in as ASCII and characters
> such as Japanese are converted to &#xxxxx; form.  Is this unicode?
> When I set the charset to "UTF-8" characters are chaged into combinations
> of strange boxes and symbols.  I thought that this was maybe the unicode
> for multibyte characters simply being displayed as their single
> bytes.  However some of these then aren't displayed correctly on output.
> I would appreciate any advice, as article that I have read seem to have
> contradictions in that some say that &#xxxx; is the unicde for that
> character and others say something else.  Also most articles seem to ignore
> the inputting aspect.
> Thanks,
> Steohen
Received on Friday, 18 August 2000 11:22:29 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:20 UTC