W3C home > Mailing lists > Public > www-dom@w3.org > July to September 2000

Re: Difference in DOM's

From: Joseph Kesselman/Watson/IBM <keshlam@us.ibm.com>
Date: Fri, 1 Sep 2000 12:15:11 -0400
To: Mark Brennan <Mark.Brennan@MediaServ.com>
Cc: "'www-dom@w3.org'" <www-dom@w3.org>
Message-ID: <OF52996C63.91BF4E0C-ON8525694D.0058BBA8@pok.ibm.com>

You may not like this answer:

The W3C DOM does not support malformed HTML.  It's up to the HTML processor
to either reject it as unparsable, or to repair it before the DOM is built.
The details of that repair are out of our scope.

A suggested, but not officially endorsed, repair algorithm appears in the
W3C's "tidy" tool.

Getting everyone to agree to yield the same results -- tidy's or any other
--  would be a task for the HTML Working Group. Given that they seem to
instead be moving toward an XML-based version of HTML, in which malformed
documents are not possible and consistant parsing can be guaranteed, I
don't know if they can enforce particular repairs or are interested in
doing so. Ask them?

Joe Kesselman  / IBM Research
Received on Friday, 1 September 2000 12:15:18 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 20 October 2015 10:46:07 UTC