[whatwg] Distinguishing XML and HTML by content sniffing from Michael Day on 2007-03-04 (public-whatwg-archive@w3.org from March 2007)

From: Michael Day <mikeday@yeslogic.com>
Date: Sun, 04 Mar 2007 17:33:51 +1100
Message-ID: <45EA684F.5030202@yeslogic.com>

Hi all,

For user agents like Prince that support XML and HTML content it is 
sometimes necessary to distinguish whether a .html file is actually XML 
or HTML in order for it to be processed correctly.

I've written an article for XML.com explaining exactly how Prince 
performs content sniffing to distinguish XML and HTML documents:

     What Does XML Smell Like?
     http://www.xml.com/pub/a/2007/02/28/what-does-xml-smell-like.html

Any feedback would be greatly appreciated. No doubt at some point it 
will be necessary to revise our heuristics for HTML5 :)

Best regards,

Michael

-- 
Print XML with Prince!
http://www.princexml.com

Received on Saturday, 3 March 2007 22:33:51 UTC