- From: Chris Wendt <christw@microsoft.com>
- Date: Thu, 19 Apr 2001 14:11:24 -0700
- To: "Donatella Caridi" <carididonatella@tiscalinet.it>, <www-international@w3.org>
If the page containing the <form> is in UTF-8, your server will see the submitted text in UTF-8. Works in the 2 leading browsers as of version 4. Numeric character references are always Unicode code points. For better compatibility use the decimal version only. FYI as of Internet Explorer 5 you can also use a hidden filed "_charset_" to let the browser tell you the charset, just in case your form page is in English and you want to avoid tagging it. -----Original Message----- From: Donatella Caridi [mailto:carididonatella@tiscalinet.it] Sent: Thursday, April 19, 2001 1:13 PM To: www-international@w3.org Subject: Input multilingual text in a form... Hello all, If someone input characters from various languages into a form in a browser and form's encoding is in UTF-8, what is the encoding of text insert? Is UTF-8 or something else? I want to store this text as unicode (UTF-8) in a database, but if this text is not UTF-8, i need a converter from all other encoding to UTF-8? Numeric reference of characters in ISO-8859-1 is the same of Unicode, not only for ASCII character (0-127)? I think no but i'm not sure. Any link to further information would be greatly appreciated. Thank you! Donatella
Received on Monday, 23 April 2001 16:28:32 UTC