W3C home > Mailing lists > Public > www-validator@w3.org > April 2001

Re: 8-bit chars in US-ASCII documents (was Re: Embarrassing typo!)

From: Terje Bless <link@tss.no>
Date: Fri, 27 Apr 2001 09:27:28 +0200
To: Liam Quinn <liam@htmlhelp.com>
Cc: "'gerald et al.'" <www-validator@w3.org>
Message-ID: <20010427100549-b01010701-19a0a694@>
On 21.04.01 at 13:21, Liam Quinn <liam@htmlhelp.com> wrote:

>The 8-bit character is an error, but it's an error in a similar way to
>including <a href="foo bar"> in an HTML document.  URIs can't contain
>spaces, but HTML validators don't complain.

I'm leaning towards Björn's interpretation here. An 8bit char in a 7bit
ASCII document is an error and should be reported and make the
interpretation of the rest of the document entity suspect. Maybe this is
really a double-byte encoding that only happens to have most MSB's MSb off
(Most Significant (Byte|bit)).

If I can figure out some way to catch this I think I'll make it generate an
error or at the very least a warning.
Received on Friday, 27 April 2001 04:05:58 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 1 March 2016 14:17:29 UTC