- From: Jukka K. Korpela <jkorpela@cs.tut.fi>
- Date: Mon, 20 Aug 2012 07:28:06 +0300
- To: "www-validator@w3.org" <www-validator@w3.org>
- CC: richard lebeau <lebrpl61@gmail.com>
2012-08-19 23:32, David Dorward wrote:
> On 19 Aug 2012, at 19:51, richard lebeau <lebrpl61@gmail.com> wrote:
>
>> Mark-up/clean-up, Is it possible to copy clean-up and markup fixed errors into my existing website so I can correct errors on validator
>
> The validator has an option marked "Clean up Markup with HTML-Tidy". Obviously this is imperfect as guessing what an author meant is difficult.
Moreover, the "clean-up" simply runs HTML Tidy, performing questionable
transformations. It does not actually clean things up.
(There's also the technical issue that automatic copying of "fixed"
markup into a website would require quite a lot new code and could not
work if the pages are actually generated by PHP, ASP, or other
server-side technologies.)
Here's a simple document with some errors:
<!doctype html>
<p align=center>Hello</p>
Here's what HTML-Tidy turns it to:
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN">
<html>
<head>
<meta name="generator" content="HTML Tidy for Linux (vers 25 March
2009), see www.w3.org">
<title></title>
<style type="text/css">
p.c1 {text-align: center}
</style>
</head>
<body>
<p class="c1">Hello</p>
</body>
</html>
So it changes the doctype to a historic one, HTML 3.2, so that the
"cleaned-up" document does not validate (due to code that HTML-Tidy has
added!). It adds a meta tag about generator, referring to www.w3.org, as
if HTML-Tidy were W3C software, contrary to the statement "HTML-Tidy is
a third-party software not developed at W3C, and its output is provided
without any guarantee." It inserts a title element with empty content -
hardly an improvement in practice. It replaces the attribute
align=center by the use of CSS, using a generated class name, without
checking whether that name is already used in the document. Oh, and it
adds some optional tags like <html> and <head> - not required for validity.
Yucca
Received on Monday, 20 August 2012 04:28:37 UTC