- From: Martin J. Duerst <duerst@w3.org>
- Date: Fri, 18 Aug 2000 14:28:50 +0900
- To: "Stephen Toner" <Stephen.Toner@virtualaccess.com>, www-international@w3.org
Stephen - Can you say which browser (and which server) you used? Can you give some examples of your pages? Regards, Martin. At 00/08/18 08:13 +0900, Stephen Toner wrote: >Hi, >I have been trying to input characters from various languages into a form >in my browser. I want to then store this text as unicode in a >database. I have found that if a set the charset to a western language or >if i leave it blank, that ordinary ascii characters are read in as ASCII >and characters such as Japanese are converted to &#xxxxx; form. Is this >unicode? >When I set the charset to "UTF-8" characters are chaged into combinations >of strange boxes and symbols. I thought that this was maybe the unicode >for multibyte characters simply being displayed as their single >bytes. However some of these then aren't displayed correctly on output. >I would appreciate any advice, as article that I have read seem to have >contradictions in that some say that &#xxxx; is the unicde for that >character and others say something else. Also most articles seem to >ignore the inputting aspect. > >Thanks, >Steohen >
Received on Friday, 18 August 2000 02:15:52 UTC