[whatwg] HTML 5 parsing - not just for browsers?

On Mon, 13 Feb 2006 13:50:57 +0600, dolphinling  
<dolphinling at myrealbox.com> wrote:

> So, will the HTML 5 parsing section be of use here? Will it be of use to  
> things other than browsers? Are there small differences needed because  
> what's being parsed is a document fragment instead of a document? And  
> when it's re-serialized, how closely will today's browsers interpret the  
> original and the new?

A HTML parser is defenitely a thing which isn't only used by browsers.  
Search engines, archivation and comparison tools, web page translators --  
they all need a parser.

About websites like forums and blogs, it can be a bit trickier: many of  
them introduce their own markup (BBcode, LiveJournal tags) in addition to  
allowing some HTML, so they'd need a modified version of HTML 5 parser.


-- Opera M2 9.0 TP2 on Debian Linux 2.6.12-1-k7
* Origin: X-Man's Station at SW-Soft, Inc. [ICQ: 115226275]  
<alexey at feldgendler.ru>

Received on Monday, 13 February 2006 01:12:24 UTC