RE: Input multilingual text in a form...

If the page containing the <form> is in UTF-8, your server will see the
submitted text in UTF-8.

Works in the 2 leading browsers as of version 4.

Numeric character references are always Unicode code points. For better
compatibility use the decimal version only.

FYI as of Internet Explorer 5 you can also use a hidden filed
"_charset_" to let the browser tell you the charset, just in case your
form page is in English and you want to avoid tagging it.


-----Original Message-----
From: Donatella Caridi [mailto:carididonatella@tiscalinet.it] 
Sent: Thursday, April 19, 2001 1:13 PM
To: www-international@w3.org
Subject: Input multilingual text in a form...


Hello all,

If someone input characters from various languages into a form in a
browser and  form's encoding is in UTF-8, what is the encoding of text
insert? Is UTF-8 or something else? 
I want to store this text as unicode (UTF-8) in a database, but if this
text is not UTF-8, i need a converter from all other encoding to UTF-8?

Numeric reference of characters in ISO-8859-1 is the same of Unicode,
not only for ASCII character (0-127)?
I think no but i'm not sure.

Any link to further information would be greatly appreciated.
Thank you!

Donatella

Received on Monday, 23 April 2001 16:28:32 UTC