Validation error frequencies in Alexa top 500 front pages

I ran the front pages of Alexa global top 500 sites through the HTML5  
facet of Validator.nu:
http://hsivonen.iki.fi/test/moz/alexa500-error-counts.txt

The exact values of bad attribute values were replaced with  
"(consolidated)" as were duplicate IDs and usemap targets. After these  
replacements the occurrences of unique error message strings were  
tallied--this time *without* collapsing duplicates on a single page.  
Some requests timed out as seen in the message list. Support for old- 
style internal encoding declarations and checking for the contents of  
obsolete containers were added before running this study.

I hope the WG finds this data useful.

P.S. Surprisingly, there's one application/xhtml+xml front page in the  
global top 500: http://www.iwiw.hu

-- 
Henri Sivonen
hsivonen@iki.fi
http://hsivonen.iki.fi/

Received on Sunday, 3 February 2008 17:45:37 UTC