- From: Michael(tm) Smith <mike@w3.org>
- Date: Fri, 24 Jul 2009 09:43:53 +0900
- To: Eric Bierman <bierman@annmanor.ca>
- Cc: www-validator@w3.org
Eric Bierman <bierman@annmanor.ca>, 2009-07-21 14:47 -0400:
> Validator complains about missing character encoding definition and says it
> used the default utf-8. But that was what was actually specified in a new
> <meta charset="utf-8" /> tag.
>
> I understand this is an experimental html 5 checker, but this is rather
> basic html 5.
>
> Document is at http://www.annmanor.ca/news.shtml
I raised a new bug:
http://www.w3.org/Bugs/Public/show_bug.cgi?id=7135
I attached there a patch with a proposed fix:
Index: ./httpd/cgi-bin/check
===================================================================
RCS file: /sources/public/validator/httpd/cgi-bin/check,v
retrieving revision 1.673
diff -u -r1.673 check
--- ./httpd/cgi-bin/check 30 Jun 2009 18:49:07 -0000 1.673
+++ ./httpd/cgi-bin/check 24 Jul 2009 00:37:16 -0000
@@ -534,14 +534,14 @@
my ($override, undef) = split(/\s/, $File->{Opt}->{Charset}, 2);
$File->{Charset}->{Override} = lc($override);
- if ($File->{Opt}->{FB}->{Charset}) { # charset fallback mode
+ if ($File->{Opt}->{FB}->{Charset} and $File->{DOCTYPE} ne "HTML5") { # charset fallback mode
unless ($File->{Charset}->{Use}) { # no charset detected, actual fallback
&add_warning('W02', {W02_charset => $File->{Charset}->{Override}});
$File->{Tentative} |= T_ERROR; # Tag it as Invalid.
$File->{Charset}->{Use} = $File->{Charset}->{Override};
}
} else { # charset "hard override" mode
- if (! $File->{Charset}->{Use}) { # overriding "nothing"
+ if (! $File->{Charset}->{Use} and $File->{DOCTYPE} ne "HTML5") { # overriding "nothing"
&add_warning('W04', {W04_charset => $File->{Charset}->{Override}, W04_override => TRUE});
$File->{Tentative} |= T_ERROR;
$File->{Charset}->{Use} = $File->{Charset}->{Override};
I've not tested that and don't know if it's correct perl syntax,
but the idea is just to not do the encoding check at all for
doctype=HTML5 pages, because the HTML5 checker does its own
encoding check.
--Mike
--
Michael(tm) Smith
http://people.w3.org/mike/
Received on Friday, 24 July 2009 00:44:12 UTC