W3C home > Mailing lists > Public > www-validator@w3.org > July 2001

Re: charset parameter

From: Martin Duerst <duerst@w3.org>
Date: Fri, 27 Jul 2001 11:37:27 +0900
Message-Id: <4.2.0.58.J.20010727113458.05da22d0@sh.w3.mag.keio.ac.jp>
To: Nick Kew <nick@webthing.com>, Terje Bless <link@pobox.com>
Cc: W3C Validator <www-validator@w3.org>
At 09:08 01/07/26 +0100, Nick Kew wrote:
> >       and "I'm sorry, but that Character Encoding is not in my
> > database. I cannot Validate this document."
>
>Hmmm ..
>
>Would it not be fair to say US-ASCII is a subset of every other encoding
>that might be considered as a sefault (certainly iso-8859-1 and utf-8)?
>so that a document that validates to it should always be fine?

That's some of the things we have thought about. Another is to go even
further, change all the non-ASCII bytes into '?', and run it through
the validator. The result would then say 'well, this looks like it
may validate if you make sure it comes with the correct character encoding.
We already have the code to do that for 'charset's set by hand.

Regards,  Martin.
Received on Friday, 27 July 2001 22:57:06 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:13:59 GMT