W3C home > Mailing lists > Public > www-html@w3.org > July 2000

Re: HTML parser

From: Dave Raggett <dsr@w3.org>
Date: Fri, 7 Jul 2000 19:22:27 +0100 (GMT Daylight Time)
To: Jan Roland Eriksson <jrexon@newsguy.com>
cc: Tech Lover <netuser@bikeracer.com>, www-html@w3.org
Message-ID: <Pine.WNT.4.10.10007071921310.-487035@hazel.hpl.hp.com>
On Thu, 6 Jul 2000, Jan Roland Eriksson wrote:

> On 5 Jul 2000 14:51:05 -0700, Tech Lover <netuser@bikeracer.com>
> wrote: I am looking for a HTML parser in c/c++ or Java. Can I
> get pointers for 'source code' of a parser from anywhere ?
> Several of us have been using James Clarks SP package for a number of
> years now, it's all free and only a http://www.raging.com/ away...

SP isn't fault tolerant, which is fine for SGML but not for real
world HTML. Take a look at HTML Tidy, which is also free, see the
W3C site or my home page.


-- Dave Raggett <dsr@w3.org> http://www.w3.org/People/Raggett
tel/fax: +44 122 578 3011 (or 2521) +44 778 532 0444 (mobile)
World Wide Web Consortium (on assignment from HP Labs)
Received on Friday, 7 July 2000 14:22:51 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:15:43 GMT