- From: Mark Davis <markdavis@ispchannel.com>
- Date: Fri, 18 Aug 2000 08:25:38 -0700
- To: "Stephen Toner (by way of Martin J. Duerst <duerst@w3.org>)" <Stephen.Toner@virtualaccess.com>
- CC: www-international@w3.org
Try the Unicode FAQ, at http://www.unicode.org/unicode/faq/unicode_web.html "Stephen Toner (by way of Martin J. Duerst )" wrote: > > Hi, > I have been trying to input characters from various languages into a form > in my browser. I want to then store this text as unicode in a database. I > have found that if a set the charset to a western language or if i leave it > blank, that ordinary ascii characters are read in as ASCII and characters > such as Japanese are converted to &#xxxxx; form. Is this unicode? > When I set the charset to "UTF-8" characters are chaged into combinations > of strange boxes and symbols. I thought that this was maybe the unicode > for multibyte characters simply being displayed as their single > bytes. However some of these then aren't displayed correctly on output. > I would appreciate any advice, as article that I have read seem to have > contradictions in that some say that &#xxxx; is the unicde for that > character and others say something else. Also most articles seem to ignore > the inputting aspect. > > Thanks, > Steohen
Received on Friday, 18 August 2000 11:22:29 UTC