W3C home > Mailing lists > Public > html-tidy@w3.org > January to March 2003

MSHTML as an alternative to Tidy

From: Lucas W. Fletcher <lucas@dealersinnotions.com>
Date: Thu, 13 Feb 2003 14:38:40 -0800
Message-ID: <005601c2d3b0$a893d280$0300a8c0@uncon>
To: <html-tidy@w3.org>
Is anyone aware of a publicly available program that uses the MSHTML API to convert an HTML file into XHTML? If it is performant, I think this would be a great alternative to Tidy.

If one assumes that the ultimate version of Tidy is one where it can parse pages in as fault-tolerant a manner as the popular browsers such as IE, then wouldn't it make sense to actually utilize the DOM exposed by the browser itself in order to create the XHTML?

A script-based version of this is at:

http://webfx.eae.net/dhtml/richedit/richedit2.html

Obviously this would be way too slow in a production environment, but via C++ and MSHTML probably not.

Even if Tidy were as fault-tolerant as IE, that wouldn't mean that its interpretations of coding errors would be the same. It seems a difficult task to try to second guess the popular browsers, so why not use the browser itself?

Lucas Fletcher
lucas@dealersinnotions.com
http://dealersinnotions.com
Received on Thursday, 13 February 2003 17:41:04 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:53 GMT