- From: Nick Kew <nick@webthing.com>
- Date: Wed, 26 May 2004 10:46:34 +0100 (BST)
- To: QA Dev <public-qa-dev@w3.org>
- Cc: ht@inf.ed.ac.uk
On Wed, 26 May 2004, olivier Thereaux wrote: > On the topic of multiple SGML/XML parsers. > > Henry S. Thompson wrote: (this response Cc: Henry) > > It's really not a great idea to > > keep trying to make openSP be a fully-compliant XML validator when > > there is a _tiny_ user/developer community for it in that role, > > compared to the huge amount of effort which is put into the top 3 or 4 > > XML validators. Indeed, I don't think we're arguing with him about that. Although the W3C validator is still limited to OpenSP, various related projects such as my own Site Valet and Björn's MSIE validation bar use multiple parsers. It's the relatively narrow but crucial subject of HTML validation where we have no alternative in sight. > I would recommend rxp, since it's open source, fast, > > produces good error messages and is a Univ. of Edinburgh product owned > > and operated by Richard Tobin, member of the XML Core WG, but any of > > the validators which did well in Rusty Harold's conformance > > comparison, reported at XML Europe 2004, would do. I used Xerces in mod_validator, because at the time of writing it was the only available parser offering full validation (for values of "available" that precluded anything-Java). Nowadays I would choose libxml2 for new software. Indeed, I regard libxml2 as the way ahead for several reasons over and above the validation we have: (1) It's very fast (see the xmlbench results @sourceforge), widely-deployed, and easy to work with. (2) I've used it successfully in the past for related projects (AccessValet, and more recently mod_annot). (3) It has all the basics we need for extending it beyond validation to full conformance testing. (4) It's the basis for libcroco, so should integrate well with CSS analysis. (5) It has an active and responsive development community. Henry (or anyone) want to present arguments for AN Other option? -- Nick Kew
Received on Wednesday, 26 May 2004 05:52:24 UTC