W3C home > Mailing lists > Public > semantic-web@w3.org > November 2008

Re: student project idea: RDF/RDFa parser QA via automatic test-suite generation

From: Manu Sporny <msporny@digitalbazaar.com>
Date: Tue, 18 Nov 2008 10:11:40 -0500
Message-ID: <4922DB2C.7010400@digitalbazaar.com>
To: Dan Brickley <danbri@danbri.org>
CC: Semantic Web <semantic-web@w3.org>, RDFa <public-rdf-in-xhtml-tf@w3.org>

Dan Brickley wrote:
> I'd like to see an auto-generated repository of RDFa samples, most (but
> not all) of which are decent wellformed XHTML with RDFa, but also with a
> good number of poorly-marked up files. 

+1 - sounds like a worthwhile project.

There are two permutations of this approach:

The first involves generating valid and invalid XHTML+RDFa to see if the
parsers can make it through the file. Did the parser dump core or did it
exit with a good status code?

The second involves generating valid XHTML+RDFa as well as the
corresponding SPARQL files such that they can be hooked up to the RDFa
Test harness. Did the parser exit with a good status code AND did the
SPARQL evaluate to TRUE?

> Generating such a test set and then wiring it up to a set of RDFa
> parsers (via http://rdfa.digitalbazaar.com/rdfa-test-harness/ or
> something like it) shouldn't be a huge job

It would be fairly straight-forward to do this - the RDFa Test Harness
is already setup for use-cases like what you are describing. We would need:

A manifest file[1], and a set of matching RDFa+XHTML files and their
corresponding SPARQL files[2].

> (c) whether the spec gurus agree on what ought to be generated.

I don't suggest getting the spec gurus involved in most of the 1000 test
cases. On the RDFa telecons, it takes us roughly 5-10 minutes to get
through the simple, straight-forward test cases... and that's after
we've reviewed them offline. I'd lean on the spec gurus only when there
is a disagreement between the parser writers on what should happen.

This would be a great summer project for a student. I'd be willing to
lend advice and help integrating with the RDFa Test Harness.

-- manu


Manu Sporny
President/CEO - Digital Bazaar, Inc.

blog: POSIX Threads Don't Scale Past 100K Concurrent Web Requests

blog: Fibers are the Future: Scaling Past 100K Concurrent Web Requests
Received on Tuesday, 18 November 2008 15:12:40 UTC

This archive was generated by hypermail 2.4.0 : Thursday, 24 March 2022 20:41:13 UTC