W3C home > Mailing lists > Public > whatwg@whatwg.org > March 2007

[whatwg] Configure Apache to send the right MIME type for XHTML

From: Elliotte Harold <elharo@metalab.unc.edu>
Date: Wed, 07 Mar 2007 13:14:25 -0500
Message-ID: <45EF0101.8030102@metalab.unc.edu>
Alexey Feldgendler wrote:

> If you are going to write XHTML documents whose DOM is not representable in HTML, then your documents won't be compatible with MSIE, and you won't need the described MIME type trick.
> 
> OTOH, if you are going to restrict yourself to XHTML documents whose DOM is representable in HTML, then you might as well use HTML. It has been said many times here that virtually every single advantage that XHTML has over HTML4 is now also inherent to HTML5 (predictable DOM, processing with XSLT, etc).


Neither is really my point. The problem with malformed HTML is that it 
has an inconsistent DOM. You get different DOMs in different browsers 
and tools. Making a document well-formed XHTML gives you a consistent, 
reproducible, predictable DOM.

If you go one step further and make the document valid strict XHTML, the 
DOM gets even simpler and cleaner. Many nodes are pruned from the tree. 
This makes it easier still to manage.

It's not about whether we can represent a DOM in HTML or not. It's about 
consistency and simplicity.

I'm not the first to point this out by any means, but I don't think it's 
been as widely recognized as it should. I suspect that's because many 
developers working with HTML aren't really using the DOM at all. They're 
hacking the text with IDs, document.write, innerHTML, and other tricks 
that don't really pay attention to the tree structure of the document 
they're working on. That may be a chicken-and-egg problem: as long as 
the document is malformed even figuring out what the tree is is nearly 
impossible and not cross-browser.

With HTML 5 parsers (which we don't have yet) the DOM of malformed 
documents may yet become predictable, but it won't ever be obvious. And 
it may well not be consistent across browsers unless Microsoft jumps on 
board. By contrast, valid strict XHTML (probably transitional too) has a 
reasonably consistent tree structure in all browsers today.

Once you achieve well-formedness you can begin looking at the document 
in  a new way. You get access to powerful tools like XPath and XSLT that 
you didn't have before. You can start doing more than merely tossing the 
page at a browser for rendering. And you don't have to wait for an HTML 
5 parser to do it either. You can do it today with a large tool chain.

-- 
?Elliotte Rusty Harold  elharo at metalab.unc.edu
Java I/O 2nd Edition Just Published!
http://www.cafeaulait.org/books/javaio2/
http://www.amazon.com/exec/obidos/ISBN=0596527500/ref=nosim/cafeaulaitA/
Received on Wednesday, 7 March 2007 10:14:25 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:58:53 UTC