W3C home > Mailing lists > Public > www-validator@w3.org > November 2010

Re: HTML5 validation message

From: Michael(tm) Smith <mike@w3.org>
Date: Tue, 9 Nov 2010 13:33:44 +0900
To: Thomas Gambet <tgambet@w3.org>
Cc: www-validator <www-validator@w3.org>, philipj@opera.com
Message-ID: <20101109043340.GH12423@sideshowbarker>
Thomas Gambet <tgambet@w3.org>, 2010-11-08 08:53 -0500:

> I'm forwarding the following message that was posted on Unicorn's track:
> 
> [[
> When a HTML5 page validates[1] without errors there's a message that says
> "This means that the resource in question identified itself as "HTML5" and
> that we successfully performed a formal validation using an SGML, HTML5
> and/or XML Parser(s) (depending on the markup language used)."
> 
> Surely this cannot be true, as HTML5 has no SGML serialization, and the
> switch between HTML and XML parser depends on the MIME type, not "the markup
> language used".
> 
> In other words, either this text is wrong, or the validator is actually
> doing it wrong.
> 
> [1] http://validator.w3.org/check?uri=http%3A%2F%2Ffoolip.org%2Fmicrodatajs%2Flive%2F
> ]]

The validator.nu backend code that's used for HTML5 validation is
definitely not ever using an SGML parser.

But I don't know whether there are cases when the W3C Perl code is doing
SGML parsing on HTML5 documents for some reason.

I also wondered why it says "XML Parser(s)" instead of just "XML parser"
but then I remembered that there are some cases where the Perl frontend
runs a document through libxml2 to do a well-formedness check even when you
are serving a document as text/html.

So I think that should be changed to:

  ...we successfully performed a formal validation using an HTML5 and/or
  XML parser(s) (depending on the MIME type with which the document is
  being served)

-- 
Michael(tm) Smith
http://people.w3.org/mike
Received on Tuesday, 9 November 2010 04:33:50 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:44 GMT