Questions concerning LogValidator output

Olivier and the LogValidator team,

I have some questions regarding the output of the W3C LogValidator
modules. I've pasted in the output in question at the end of this note.
This is the output I received on my first run of logprocess.pl. My
question is a general one: in the HTMLValidator, CCSValidator and
LinkCheckout outputs, the causes of the errors or problems are not
flagged. It seems like the only way I can find out what caused the
problem, for instance, for the HTMLValidator for my most popular page is
to submit the URL for http://www.jhuccp.org/ to your page at
http://validator.w3.org/. Is this the way it's supposed to operate, or
am I overlooking something? I tried to get more information about the
causes of the errors by specifying the '-v' (verbose) flag, but that
didn't seem to help. Is there anyway to generate a report that indicates
exactly the cause of the problems and the location within the file to
find and fix the problem?

Thanks, again, for all your help.

-Kevin 

Kevin Zembower
Internet Services Group manager
Center for Communication Programs
Bloomberg School of Public Health
Johns Hopkins University
111 Market Place, Suite 310
Baltimore, Maryland  21202
410-659-6139 
==================================================
************************************************************************
Results for module HTMLValidator
************************************************************************
Here are the 10 most popular invalid document(s) that I could find in
the 
logs for www.jhuccp.org.

 Rank   Hits   #Error(s)                         Address

------ ------ -----------
----------------------------------------------------- 
1      546    18          http://www.jhuccp.org/

2      263    20          http://www.jhuccp.org/cpconference/

3      70     55
http://www.jhuccp.org/training/PHPTEST/guestbook2.php 
4      43     12          http://www.jhuccp.org/jobs/

5      37     61
http://www.jhuccp.org/training/PHPTEST/guestbook.php  
6      34     57          http://www.jhuccp.org/fpsuccess/

7      23     27          http://www.jhuccp.org/research/

8      22     85
http://www.jhuccp.org/training/Workshop/LSHC.shtml    
9      22     28          http://www.jhuccp.org/pubs/

10     20     45          http://www.jhuccp.org/programs/


Conclusion :
I had to check 10 document(s) in order to find 10 invalid HTML
documents.
This means that about 100% of your most popular documents were invalid.
************************************************************************


************************************************************************
Results for module CSSValidator
************************************************************************
Here are the 7 most popular invalid document(s) that I could find in the

logs for www.jhuccp.org.

 Rank   Hits   #Error(s)                               Address

------ ------ -----------
---------------------------------------------------------------- 
1      585    1           http://www.jhuccp.org/ccp_styles.css

5      9      17
http://www.jhuccp.org/fpsuccess/themes/light/style.css           
22     6      1
http://www.jhuccp.org/LSHC_India/theme/standard/styles_ie7.css   
27     5      3           http://www.jhuccp.org/print_styles.css

28     5      4
http://www.jhuccp.org/training/scope/nepal/CssFiles/theories.css 
29     5      8
http://www.jhuccp.org/africa/mali/clic/styles.css                
32     5      55
http://www.jhuccp.org/training/collaborate/flexstore/blue.css    

Conclusion :
You asked for 10 invalid stylesheet document(s) but I could only find 7 
by processing (all the) 33 document(s) in your logs. 
This means that about 21.21% of your most popular documents were
invalid.
************************************************************************


************************************************************************
Results for module Link Checker
************************************************************************
Here are the 10 most popular document(s) with broken links 
that I could find in the logs for www.jhuccp.org.

 Rank   Hits   #Error(s)                          Address

------ ------ -----------
------------------------------------------------------ 
4      546    1           http://www.jhuccp.org/

49     70     1
http://www.jhuccp.org/training/PHPTEST/guestbook2.php  
67     37     1
http://www.jhuccp.org/training/PHPTEST/guestbook.php   
132    22     1
http://www.jhuccp.org/training/Workshop/LSHC.shtml     
268    10     2           http://www.jhuccp.org/topics/avian_flu.shtml

287    9      1           http://www.jhuccp.org/research/journalsDB.php

300    9      6           http://www.jhuccp.org/training/

320    9      6
http://www.jhuccp.org/research/researchDB/files/       
335    8      3
http://www.jhuccp.org/training/PHPTEST/guestbook.php,v 
355    8      1           http://www.jhuccp.org/pr/


Conclusion :
I had to check 355 document(s) in order to find 10 HTML documents with
broken links.
This means that about 2.81% of your most popular documents needs fixing.
************************************************************************

Received on Thursday, 7 February 2008 19:51:10 UTC