Re: Inputting Unicode form Browser from Mark Davis on 2000-08-18 (www-international@w3.org from July to September 2000)

From: Mark Davis <markdavis@ispchannel.com>
Date: Fri, 18 Aug 2000 08:25:38 -0700
To: "Stephen Toner (by way of Martin J. Duerst <duerst@w3.org>)" <Stephen.Toner@virtualaccess.com>
CC: www-international@w3.org
Message-ID: <399D5572.E746F4C4@ispchannel.com>

Try the Unicode FAQ, at
http://www.unicode.org/unicode/faq/unicode_web.html

"Stephen Toner (by way of Martin J. Duerst )" wrote:
> 
> Hi,
> I have been trying to input characters from various languages into a form
> in my browser.  I want to then store this text as unicode in a database.  I
> have found that if a set the charset to a western language or if i leave it
> blank, that ordinary ascii characters are read in as ASCII and characters
> such as Japanese are converted to &#xxxxx; form.  Is this unicode?
> When I set the charset to "UTF-8" characters are chaged into combinations
> of strange boxes and symbols.  I thought that this was maybe the unicode
> for multibyte characters simply being displayed as their single
> bytes.  However some of these then aren't displayed correctly on output.
> I would appreciate any advice, as article that I have read seem to have
> contradictions in that some say that &#xxxx; is the unicde for that
> character and others say something else.  Also most articles seem to ignore
> the inputting aspect.
> 
> Thanks,
> Steohen

Received on Friday, 18 August 2000 11:22:29 UTC