Re: charset parameter

From: Martin Duerst (duerst@w3.org)
Date: Thu, Jul 26 2001

  • Next message: Bjoern Hoehrmann: "Re: charset parameter"

    Message-Id: <4.2.0.58.J.20010727113458.05da22d0@sh.w3.mag.keio.ac.jp>
    Date: Fri, 27 Jul 2001 11:37:27 +0900
    To: Nick Kew <nick@webthing.com>, Terje Bless <link@pobox.com>
    From: Martin Duerst <duerst@w3.org>
    Cc: W3C Validator <www-validator@w3.org>
    Subject: Re: charset parameter
    
    At 09:08 01/07/26 +0100, Nick Kew wrote:
    > >       and "I'm sorry, but that Character Encoding is not in my
    > > database. I cannot Validate this document."
    >
    >Hmmm ..
    >
    >Would it not be fair to say US-ASCII is a subset of every other encoding
    >that might be considered as a sefault (certainly iso-8859-1 and utf-8)?
    >so that a document that validates to it should always be fine?
    
    That's some of the things we have thought about. Another is to go even
    further, change all the non-ASCII bytes into '?', and run it through
    the validator. The result would then say 'well, this looks like it
    may validate if you make sure it comes with the correct character encoding.
    We already have the code to do that for 'charset's set by hand.
    
    Regards,  Martin.