W3C home > Mailing lists > Public > public-html@w3.org > February 2008

Validation error frequencies in Alexa top 500 front pages

From: Henri Sivonen <hsivonen@iki.fi>
Date: Sun, 3 Feb 2008 19:45:22 +0200
Message-Id: <6D63585B-2ADC-4F17-97C8-AD59CFA38D5D@iki.fi>
To: HTML Issue Tracking WG <public-html@w3.org>

I ran the front pages of Alexa global top 500 sites through the HTML5  
facet of Validator.nu:
http://hsivonen.iki.fi/test/moz/alexa500-error-counts.txt

The exact values of bad attribute values were replaced with  
"(consolidated)" as were duplicate IDs and usemap targets. After these  
replacements the occurrences of unique error message strings were  
tallied--this time *without* collapsing duplicates on a single page.  
Some requests timed out as seen in the message list. Support for old- 
style internal encoding declarations and checking for the contents of  
obsolete containers were added before running this study.

I hope the WG finds this data useful.

P.S. Surprisingly, there's one application/xhtml+xml front page in the  
global top 500: http://www.iwiw.hu

-- 
Henri Sivonen
hsivonen@iki.fi
http://hsivonen.iki.fi/
Received on Sunday, 3 February 2008 17:45:37 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 9 May 2012 00:16:12 GMT