> I don't understand why this is an issue.  The authoritative metadata
> finding makes it clear that the media type determines how the document
> is to be interpreted, and nothing in RFC 2854 (text/html) or the HTML
> family of specifications suggests that running a text/html document
> through an XML parser would yield anything which meaningfully
> represents what the sender was trying to convey.

A document can be both valid HTML and well-formed XML.  The categories 
are not mutually exclusive.

