W3C home > Mailing lists > Public > www-international@w3.org > October to December 2001

RE: ISO-8859-1

From: Paul Deuter <Paul.Deuter@plumtree.com>
Date: Mon, 1 Oct 2001 08:47:01 -0700
Message-ID: <C7F00D7948B8E4468BB330152C6BA4E00AACE7@cstaex03.USIPLUMTREE.AD>
To: "souravm" <souravm@infy.com>, <www-international@w3.org>
That is a tempting challenge.  But of course it depends on specifics
that you do not mention such as the browser you are using, the web
server that is running and the underlying operating systems for both.
Here is why:
The browser will normally send back characters encoded in the charset
that was specified by the server in the HTTP content-type header, or in
the meta tag.  However in your case you are entering characters outside
that repetoire - so it is unclear how the browser will handle that case.
It also could be that the HTTP content type header is actually Shift-Jis
even though your meta tag is 8859-1.
Here are the steps to figure this out:
1.  Download the sniffer calle "Ethereal" from www.ethereal.com
<http://www.ethereal.com>  .
2.  Make sure your browser is running on a different machine from the
web server.
3.  Install Ethereal on your browser machine (it takes about 1 minute)
4.  Start a "Capture" in ethereal specifying "ip" as the filter.
5.  Run your test
6.  Stop the capture and find one of the HTTP packets between the
browser and the server and right click and choose "Follow TCP Stream"
This will show you a trace of your session.  You will be able to see the
HTTP content-type header and also determine the charset of the
characters send down from the server.  Finally you will be able to see
how the browser sends the response back to the server.  With this tool,
you will be able to see what is actually sent over the wire and it will
be a tool for debugging character set issues.

Paul Deuter 
Internationalization Manager 
Plumtree Software 

-----Original Message-----
From: souravm [mailto:souravm@infy.com]
Sent: Monday, October 01, 2001 5:42 AM
To: www-international@w3.org
Subject: ISO-8859-1

Hi ,
Here is a small jsp code which I used for proof of concept for a multi
lingual project.
The interesting observation is that even if I put ISO-8859-1 as charset
in the meta tag it works for all languages. I tested it for Japanese,
Korean, Arabic and French (using IME on Windows 2000).
As far as I know ISO-8859-1 is supposed to cover only western european
languages. I'm suprised to find that it even supports the Asian
Can anyone please explain me how can it support the Asian language ?
The jsp file name is i18na.jsp
<%@ page import="java.util.*"%>
<%@ page import="java.io.*"%>
    String ucStr = request.getParameter("jap");
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=ISO-8859-1">
<BODY topmargin="0" marginheight="0" leftmargin="0" marginwidth="0">
String  = <%= ucStr%>
<FORM name="frmText"
<TABLE  border="0" cellspacing="0" cellpadding="5" width="200">
                <TD><INPUT TYPE="text" NAME="jap" SIZE="30"
                <TD><INPUT TYPE="submit" NAME="Submit"

Received on Monday, 1 October 2001 11:45:55 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:40:45 UTC