- From: आशीष शुक्ला \ <wahjava@gmail.com>
- Date: Sat, 3 Jun 2006 11:58:44 +0530
- To: www-html@w3.org
On 6/3/06, Philip TAYLOR <P.Taylor@rhul.ac.uk> wrote:
>
>
> आशीष शुक्ल> "Wah Java !!" wrote:
>
> > If UA (user agent), finds a "Content-Type" in <meta> tag in HTML document,
> > it should use that to identify the document's character encoding,
> > because it is a part of the document. The server's reply should only
> > be considered when document doesn't explicitly states its character
> > encoding.
>
> Much as I think your argument has merit, I cannot see how you
> can resolve the following paradox : suppose, in some as-yet
> unknown encoding (say, ISO-9999-9), the character positions
> which in ISO-8859-1 correspond to the letters "M", "E", "T"
> and "A" correspond instead to the letters "B", "O", "D" and "Y".
> Now the server says that the document is in ISO-8859-1,
> so when the UA sees
>
> <META http-equiv="content-type" content="text/html; charset=iso-9999-9">
>
> it interprets the META directive as you would wish. But in so
> doing, it starts to parse the document on the basis of it being
> expressed in ISO-9999-9, whereupon it discovers that there wasn't
> a META directive at all, there was, rather, a(n ill-formed) BODY
> tag. But because it now knows there /was/ no META directive, it
> parses using ISO-8859-1. But that means there IS a META
> directive. And so on. I'm sure you see the problem ...
>
> Philip Taylor
>
-- begin excerpt --
To address server or configuration limitations, HTML documents may
include explicit information about the document's character encoding;
the META element can be used to provide user agents with this
information.
For example, to specify that the character encoding of the current
document is "EUC-JP", a document should include the following META
declaration:
<META http-equiv="Content-Type" content="text/html; charset=EUC-JP">
The META declaration must only be used when the character encoding is
organized such that ASCII-valued bytes stand for ASCII characters (at
least until the META element is parsed). META declarations should
appear as early as possible in the HEAD element
-- end excerpt --
Copied from: http://www.w3.org/TR/html4/charset.html#h-5.2.2
An excerpt from HTML 4.01 specification. So in other words you've to
organize your content such that your content till <META> tag is ASCII.
I think this is what this excerpt means.
Thanks
Ashish Shukla
--
Ashish Shukla "Wah Java !!"
आशीष शुक्ल>
,= ,-_-. =.
((_/)o o(\_))
`-'(. .)`-'
\_/
My blah, blah, blah at http://wahjava.blogspot.com/
My webpages at http://www.geocities.com/wah_java_dotnet/
My GPG Fingerprint: BBA9 AD7D BA71 61EB BE46 8CF5 E44A C663 A03F 4261
My GPG keys at
http://keyserv.nic-se.se:11371/pks/lookup?op=get&search=0xA03F4261
--
Supercomputers are for people too rich and too stupid to design
efficient algorithms -- Steven Skiena, Department of Computer Science,
SUNY Stony Brook.
Received on Saturday, 3 June 2006 06:28:55 UTC