W3C home > Mailing lists > Public > www-international@w3.org > April to June 2001

RE: Input multilingual text in a form...

From: Chris Wendt <christw@microsoft.com>
Date: Thu, 19 Apr 2001 14:11:24 -0700
Message-ID: <D524A0BD0DE5FF4E951B6EC5F919FD7A029AE492@red-msg-01.redmond.corp.microsoft.com>
To: "Donatella Caridi" <carididonatella@tiscalinet.it>, <www-international@w3.org>
If the page containing the <form> is in UTF-8, your server will see the
submitted text in UTF-8.

Works in the 2 leading browsers as of version 4.

Numeric character references are always Unicode code points. For better
compatibility use the decimal version only.

FYI as of Internet Explorer 5 you can also use a hidden filed
"_charset_" to let the browser tell you the charset, just in case your
form page is in English and you want to avoid tagging it.

-----Original Message-----
From: Donatella Caridi [mailto:carididonatella@tiscalinet.it] 
Sent: Thursday, April 19, 2001 1:13 PM
To: www-international@w3.org
Subject: Input multilingual text in a form...

Hello all,

If someone input characters from various languages into a form in a
browser and  form's encoding is in UTF-8, what is the encoding of text
insert? Is UTF-8 or something else? 
I want to store this text as unicode (UTF-8) in a database, but if this
text is not UTF-8, i need a converter from all other encoding to UTF-8?

Numeric reference of characters in ISO-8859-1 is the same of Unicode,
not only for ASCII character (0-127)?
I think no but i'm not sure.

Any link to further information would be greatly appreciated.
Thank you!

Received on Monday, 23 April 2001 16:28:32 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:20 UTC