W3C home > Mailing lists > Public > www-international@w3.org > October to December 2002

Re: Question about Euro character...

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Tue, 12 Nov 2002 06:53:25 +0100
To: "Paul Deuter" <Paul.Deuter@plumtree.com>
Cc: <www-international@w3.org>
Message-ID: <3ded96ca.210566037@smtp.bjoern.hoehrmann.de>

* Paul Deuter wrote:
>I have an HTML file which has no character set identification whatsoever
>which contains both a hex 0x80 byte as well as the following NCR &#x0080;
>
>Both are displayed in IE as the Euro symbol.  Why is this?
>
>Since the default encoding for HTML is 8859-1 and not Windows-1252, the
>0x80 should not be seen as a Euro symbol.

HTML 4 says user agents must not assume any default encoding, they may
use heuristics to determine the encoding of the document and if there is
a 0x80 in the stream, it cannot be ISO-8859-1, thus it is legal to
assume Windows-1252. Using &#x0080; is strictly speaking invalid and
thus rendering is undefined.
Received on Tuesday, 12 November 2002 00:53:16 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:59 GMT