W3C home > Mailing lists > Public > www-international@w3.org > July to September 2005

Encoding in the HTML/HTTP header

From: Michael Monaghan <Michael.Monaghan@Sun.COM>
Date: Tue, 06 Sep 2005 16:25:50 -0400
To: www-international@w3.org
Message-id: <431DFB4E.5020805@sun.com>

Hi,

I'm a little confused. I'm testing a form on a page on an internal 
server to see that it handles non-western text properly.

It seems to handle everything fine, - the View/Encoding option in my 
browser tells me that it's in UTF-8, as it should be.

However neither the HTML nor HTTP headers declare any encoding.

I snooped the HTTP header by telnetting the server on port 80:

telnet server.domain 80
Trying 10.51.15.50...
Connected to server.domain.sun.com.
Escape character is '^]'.
GET /file.html HTTP/1.0

HTTP/1.1 200 OK
Server: Netscape-Enterprise/4.1
Date: Tue, 06 Sep 2005 20:03:52 GMT
Content-type: text/html
Content-length: 1222
Connection: close

<html>
.....etc....etc.

I just don't understand how the browser knows to treat the page as 
UTF-8, when I can find where it's declared. My browser default encoding 
is set to iso-1.

Thoughts please.

thanks,

-mm
Received on Tuesday, 6 September 2005 20:25:59 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:05 GMT