- From: Martin Duerst <duerst@w3.org>
- Date: Sat, 26 Oct 2002 09:39:56 +0900
- To: Terje Bless <link@pobox.com>, W3C Validator <www-validator@w3.org>
- Cc: Chris Lilley <chris@w3.org>
Hello Terje, Many thanks for working out the details. Two suggestions: - Print the actual content type (unless we do that anyway, don't remember exactly) - Say something about override (I guess this is particularly useful for uploads). Regards, Martin. At 19:50 02/10/25 +0200, Terje Bless wrote: >Martin Duerst <duerst@w3.org> wrote: > > >The current message is confusing for somebody using file > >upload. There is probably a better way to explain what happens. > >The current code in CVS now reads: > > if ($File->{'Is Upload'}) { > my @_source = ('sent by your web browser', 'browser send'); > } else { > my @_source = ('returned by your web server', 'server return'); > } > &add_warning($File, <<".EOF."); > <em>Note:</em> > The HTTP Content-Type field $_source[0] did not contain > a "charset" attribute, but the Content-Type was one of the XML text/* > sub-types (<code>$File->{ContentType}</code>). The relevant > specification (RFC 3023) specifies a strong default of "us-ascii" for > such documents so we will use this value regardless of any encoding > you may have indicated elsewhere. If you would like to use a > different encoding, you should arrange to have your $_source[1] this > new encoding information. >.EOF. > >IOW, in the case you reported it would have returned: > > The HTTP Content-Type field sent by your web browser did not contain a > "charset" attribute, but the Content-Type was one of the XML text/* > sub-types (text/xml). The relevant specification (RFC 3023) specifies a > strong default of "us-ascii" for such documents so we will use this > value regardless of any encoding you may have indicated elsewhere. If > you would like to use a different encoding, you should arrange to have > your web browser send this new encoding information. > >Is that a little clearer? > > > >>MS IE 6 sends an incorrect, sniffed MIME type of text/xml when > >>uploading the same file presumably because it sees the xml declaration. > >>I have not tried the exhaustive tests (removal of xml declaration, > >>inclusion of the string '<html' or ',HTML' in the first 256 bytes, > >>perhaps inside a comment) etc to try and describe the sniffing > >>algorithm correctly. > >And this is precisely why sniffing and guessing should be avoided at all >costs. There is also the issue that as long as browsers accept sloppy >markup etc. there will never be an incentive for authors to clean up their >act. This may be debatable for the case of browsers, but for the Validator, >it's its _job_ to be strict abiout these things! The Validator isn't there >to give people a fuzzy good feeling; it's there to help people make >absolutely sure their documents satisfy the most basic level of quality. > > >-- >When I decide that the situation is unacceptable for me, I'll simply fork >the tree. I do _not_ appreciate being enlisted into anyone's holy wars, >so unless you _really_ want to go _way_ up in my personal shitlist don't >play politics in my vicinity. -- Alexander Viro on lkml
Received on Friday, 25 October 2002 20:45:09 UTC