Try the Unicode FAQ, at http://www.unicode.org/unicode/faq/unicode_web.html "Stephen Toner (by way of Martin J. Duerst )" wrote: > > Hi, > I have been trying to input characters from various languages into a form > in my browser. I want to then store this text as unicode in a database. I > have found that if a set the charset to a western language or if i leave it > blank, that ordinary ascii characters are read in as ASCII and characters > such as Japanese are converted to &#xxxxx; form. Is this unicode? > When I set the charset to "UTF-8" characters are chaged into combinations > of strange boxes and symbols. I thought that this was maybe the unicode > for multibyte characters simply being displayed as their single > bytes. However some of these then aren't displayed correctly on output. > I would appreciate any advice, as article that I have read seem to have > contradictions in that some say that &#xxxx; is the unicde for that > character and others say something else. Also most articles seem to ignore > the inputting aspect. > > Thanks, > SteohenReceived on Friday, 18 August 2000 11:22:29 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:55 GMT