- From: Claudio Pellegrino <cloaked01@claudio-pellegrino.de>
- Date: Wed, 1 Nov 2006 18:27:10 +0100
- To: www-html@w3.org
Hello,
I have a small problem with interpreting part of the HTML 4.01 specs.
It is about character encoding assumptions from the user agent point
of view.
Assume the following scenario:
1. Let doc.html be an HTML 4.01 document served as "text/html".
2. The content of doc.html is encoded with UTF-8, and
the meta element declaration correctly says so.
3. The document also contains a "script" element having
type="text/javascript" and src="ext.js", but no
"charset" attribute.
4. The HTTP server does not serve a "charset" header,
neither for "doc.html" nor for "ext.js".
Given the above scenario, here are my two questions:
According to the HTML 4.01 specs, is the user agent
a) *required* to assume a specific character encoding
for the content of "ext.js"? If so, which one and
why?
b) *allowed* to ignore the list of three encoding
priorities at [2] and use heuristics instead
for determining the content encoding of "ext.js"?
Some reasons why I'm clueless about it:
- The "script" element specification [1] doesn't
specify a default behaviour for the user agent.
- The page about encodings [2] cites a list of three
encodings which may be applicable for "doc.html" -
but not necessarily for "ext.js" since the page
only mentions the "document's character encoding".
- The "charset" attribute description [3] doesn't
specify any implied default value or behaviour.
- I see similar problems in XHTML 1.0 [4].
- I feel I have checked with the appropriate sources,
including all the links within [5] and, of course,
the list archive, but found little more than
authoring advice ...
Is there a part of the specs I have missed?
Any pointer would be appreciated. Thanks a lot!
Regards,
Claudio Pellegrino
[1] http://www.w3.org/TR/html4/interact/scripts.html#h-18.2.1
[2] http://www.w3.org/TR/html4/charset.html#h-5.2.2
[3] http://www.w3.org/TR/html4/struct/links.html#adef-charset
[4] http://www.w3.org/TR/xhtml1/#C_9
[5] http://www.w3.org/International/articles/#chars
Received on Thursday, 2 November 2006 02:36:59 UTC