question from bruce.wallman@us.pwcglobal.com on 2002-04-11 (www-international@w3.org from April to June 2002)

From: <bruce.wallman@us.pwcglobal.com>
Date: Thu, 11 Apr 2002 09:01:18 -0400
To: www-international@w3.org
Message-ID: <OFE1758C92.78FB8DBE-ON85256B98.0046DA19@nam.pwcinternal.com>

Hello. I need help in reading Korean (and Japanese) characters
arriving at the server via HTTP. The data is in response to text input
fields on an HTML form. I am receiving some characters that in the
HTTP input stream show as things like %2354466;

I have found that the %23 is a # sign and that subtracting 65536 from
the remaining 5 character number and then taking the ChrW of the
result gives me the right ideograph for the many that I have tested.
Is this all there is to it? Are there limits to the algorithm such that the
subtract 65536 algorithm only works for a certain range of these
characters and some other calculation is needed for others?
Is the same process that is working on a Korean machine likely
to work for a Japanese one. I am supplying the browser with
character set Meta tags and go through the same routines when
receiving DBCS languages.

Thanks ahead.

Regards

----------------------------------------------------------------
The information transmitted is intended only for the person or entity to
which it is addressed and may contain confidential and/or privileged
material.  Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipient is prohibited.   If you received
this in error, please contact the sender and delete the material from any
computer.

Received on Thursday, 11 April 2002 09:02:03 UTC