W3C home > Mailing lists > Public > www-html@w3.org > December 2002

Re: HTML parser ( in c++ )

From: Daniel Glazman <glazman@netscape.com>
Date: Mon, 23 Dec 2002 05:42:16 +0100
Message-ID: <3E069428.2060806@netscape.com>
To: "Sundararajan Senthil" <Senthil.Sundararajan@Aspect.com>
CC: "'www-html@w3.org'" <www-html@w3.org>

Sundararajan, Senthil wrote:
> Hi,
>  
> I am looking for a pure HTML parser in c++. I have been using lexyacc / 
> SGML parser all these days and this is not working well with latest tags 
> "/" etc. So, my parsing is throwing lot of errors and as a result, HTML 
> page display is not proper and also, functionality is also lost lot of 
> times.
>  
> Thouth HTML is built on SGML grammer, it is very very flexible. With the 
> parsers available now ( most of them are SGML based), it is very 
> difficult to provide a good application. It would be appreciated if you 
> could shed some light on this issue.

Mozilla and Amaya use one:

   http://www.mozilla.org/
   http://www.w3.org/Amaya/

</Daniel>
-- 
Daniel Glazman  - Netscape Communications Corp. -  glazman@netscape.com
Netscape is Linux Journal Editors Choice Award for Best Web Client 2002
   Netscape 7.0 : 13.588.591 downloads so far... and it's not over !
Received on Sunday, 22 December 2002 23:40:36 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:15:53 GMT