W3C home > Mailing lists > Public > www-tag@w3.org > December 2012

Re: HTML/XML TF Report glosses over Polyglot Markup

From: Noah Mendelsohn <nrm@arcanedomain.com>
Date: Tue, 04 Dec 2012 00:02:50 -0500
Message-ID: <50BD83FA.7040405@arcanedomain.com>
To: "Eric J. Bowman" <eric@bisonsystems.net>
CC: Robin Berjon <robin@w3.org>, "Henry S. Thompson" <ht@inf.ed.ac.uk>, Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>, Henri Sivonen <hsivonen@iki.fi>, public-html WG <public-html@w3.org>, www-tag@w3.org
Robin Berjon wrote:

> If
> you want to process HTML using an XML toolchain, put an HTML parser
> in front of it.


On 12/3/2012 6:36 PM, Eric J. Bowman wrote:
>   I used to do it that way,
> with Tidy and TagSoup, but have found it's simpler to just use an XSLT
> engine capable of reading raw HTML,

A question because I'm honestly curious: those XSLT engines don't use an 
HTML parser to do that? I would have thought most did. Maybe I'm guessing 
wrong.

Noah
Received on Tuesday, 4 December 2012 05:03:20 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 4 December 2012 05:03:20 GMT