W3C home > Mailing lists > Public > www-international@w3.org > July to September 2001

RE: Java and Unicode

From: Richard, Francois M <Francois.M.Richard@usa.xerox.com>
Date: Wed, 26 Sep 2001 06:10:54 -0400
To: "'souravm'" <souravm@infy.com>
Cc: "'www-international@w3.org'" <www-international@w3.org>
Message-id: <B08661D21F0FD311A21A00805FC7D65001EA34D2@usa0845ms1.svcdoc.mc.xerox.com>
UTF-16 definitely. And I believe if endianness  is an issue, it is going to
be utf-16BE.
Francois

> -----Original Message-----
> From: souravm [mailto:souravm@infy.com]
> Sent: Wednesday, September 26, 2001 2:14 AM
> To: 
> Subject: Java and Unicode
> 
> 
> Hi All,
> 
> Java supposed to store all strings internally in Unicode. In that case
> what is the encoding form of Unicode (i.e. UTF-8/UTF-16/UTF-32) used ?
> 
> To be more specific -
> 
> Let us consider a string, strInput, which contains characters encoded
> using Shift_JIS as encoding type.
> Following code is supposed to convert it to Unicode.
> 
> String strConv = new String(strInput.getBytes(), "Shift_JIS");
> 
> My query is what would be the encoding type of the string strConv -
> UTF-8/UTF-16 or UTF-32 ?
> 
> Regards,
> Sourav
> 
Received on Wednesday, 26 September 2001 06:11:17 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:57 GMT