W3C home > Mailing lists > Public > www-international@w3.org > July to September 2001

Java and Unicode

From: souravm <souravm@infy.com>
Date: Wed, 26 Sep 2001 11:43:31 +0530
Message-ID: <755FA95DB839D211856B0008C7287D930DE6A84E@kecmsg02.ad.infosys.com>
To: <www-international@w3.org>
Hi All,

Java supposed to store all strings internally in Unicode. In that case
what is the encoding form of Unicode (i.e. UTF-8/UTF-16/UTF-32) used ?

To be more specific -

Let us consider a string, strInput, which contains characters encoded
using Shift_JIS as encoding type.
Following code is supposed to convert it to Unicode.

String strConv = new String(strInput.getBytes(), "Shift_JIS");

My query is what would be the encoding type of the string strConv -
UTF-8/UTF-16 or UTF-32 ?

Regards,
Sourav
Received on Wednesday, 26 September 2001 02:19:37 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:57 GMT