- From: olivier Thereaux <ot@w3.org>
- Date: Fri, 23 Apr 2004 11:06:57 +0900
- To: Pete Prodoehl <pete@rasterweb.net>
- Cc: www-validator <www-validator@w3.org>
Received on Thursday, 22 April 2004 22:07:53 UTC
On Apr 22, 2004, at 22:13, Pete Prodoehl wrote:
> I wrote some code to do whole site validation
Cool. Thanks for doing this and thanks for putting under the GPL.
> If anyone finds it useful, or has suggestions, please let me know.
Hmm, instead of:
[[
# we implement screen scraping, which is just wrong, wrong, wrong...
# this is likely to break in the future if the validator changes it's
output...
$content =~ s/^(.*?)Source\sListing.*/$1/s;
my $result;
if ($content =~ /This\sPage\sIs\sValid/s) {
$result = 'OK';
$okct++;
}
]]
You could try what the Log Validator is doing, i.e use LWP::UserAgent
to do a HEAD and have it check our specific HTTP headers with code
like:
[[
$self->valid($response->header('X-W3C-Validator-Status'));
$self->valid_err_num($response->header('X-W3C-Validator-Errors'));
]]
More at:
http://dev.w3.org/cvsweb/perl/modules/W3C/LogValidator/lib/W3C/
LogValidator/HTMLValidator.pm
Note that X-W3C-Validator-Status and X-W3C-Validator-Errors aren't
officially documented, but they're most likely to stay.
--
olivier
Received on Thursday, 22 April 2004 22:07:53 UTC