W3C home > Mailing lists > Public > public-evangelist@w3.org > March 2006

An experimental study on validation problems with existing HTML webpages

From: Karl Dubost <karl@w3.org>
Date: Fri, 10 Mar 2006 09:43:04 +0900
Message-Id: <ADE1725D-44DE-42B9-BAEE-77764A248C47@w3.org>
To: "'public-evangelist@w3.org' w3. org" <public-evangelist@w3.org>

Hi,

Very interesting paper giving more background on the study recently  
published by Google.


PDF file available: http://repository.ust.hk/dspace/bitstream/ 
1783.1/2298/1/100.pdf
[[[
Title: 	An experimental study on validation problems with existing  
HTML webpages
Authors: Chen, Shan / Hong, Dan / Shen, Vincent Y.S.
Issue Date: 	Jun-2005
Citation: 	Proceedings of the 2005 International Conference on  
Internet Computing, June 27-30, Las Vegas, USA. (pre-published version)

Abstract: 	
	In this paper we report the results of an experimental study on the  
validation problem of existing HTML pages in the World Wide Web. We  
have found that only about 5% of webpages are %u201Cvalid%u201D  
according to the HTML standard. An %u201Cinvalid%u201D webpage may be  
rendered differently by different browsers; it may not be machine- 
processable; it might not be translated into other Web document  
formats correctly using some of the translation engines available on  
the Web; etc. Through sampling and analyzing the results, we have  
identified the most common problems in existing webpages that made  
them invalid. We hope our discovery can encourage standard bodies,  
browser developers, authoring tool developers, and webpage designers  
to work together so that the number of valid webpages continues to  
grow and the Web can indeed reach its full potential.

URI: 	http://hdl.handle.net/1783.1/2298
]]]

-- HKUST Institutional Repository: Item 1783.1/2298
http://repository.ust.hk/dspace/handle/1783.1/2298
Fri, 10 Mar 2006 00:37:27 GMT



-- 
Karl Dubost - http://www.w3.org/People/karl/
W3C Conformance Manager, QA Activity Lead
   QA Weblog - http://www.w3.org/QA/
      *** Be Strict To Be Cool ***
Received on Friday, 10 March 2006 00:43:20 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 15 July 2011 00:13:23 GMT