Inputting Unicode form Browser

Hi,
I have been trying to input characters from various languages into a form 
in my browser.  I want to then store this text as unicode in a database.  I 
have found that if a set the charset to a western language or if i leave it 
blank, that ordinary ascii characters are read in as ASCII and characters 
such as Japanese are converted to &#xxxxx; form.  Is this unicode?
When I set the charset to "UTF-8" characters are chaged into combinations 
of strange boxes and symbols.  I thought that this was maybe the unicode 
for multibyte characters simply being displayed as their single 
bytes.  However some of these then aren't displayed correctly on output.
I would appreciate any advice, as article that I have read seem to have 
contradictions in that some say that &#xxxx; is the unicde for that 
character and others say something else.  Also most articles seem to ignore 
the inputting aspect.

Thanks,
Steohen

Received on Thursday, 17 August 2000 23:28:55 UTC