W3C home > Mailing lists > Public > public-xml-schema-testsuite@w3.org > November 2006

RE: Bug tracking

From: Michael Kay <mike@saxonica.com>
Date: Thu, 16 Nov 2006 15:51:26 -0000
To: "'Henry S. Thompson'" <ht@inf.ed.ac.uk>
Cc: <public-xml-schema-testsuite@w3.org>
Message-ID: <00bf01c70997$132bfcb0$6401a8c0@turtle>

Fine, I'll keep my powder dry.

FWIW, about 600 of my 1500 discrepancies are in the regex area which is why
I'm tackling that first. All but 24 are cases where the expected result is
"valid" and Saxon says "invalid". Some of these are due to the continued
lack of clarity in the rules for handling hyphens. Many of them are because
the test results are just plain wrong, for example I've found the following
being classed as valid:

x{,2}
[\u0554-\u0557]+
\p{Nd}{4}-\[{Nd}{2}
[^a-f-[\x00-\x60\u007B-\uFFFF]]+
\p{klsak

To be honest, I'm wondering what the best way of tackling these is. Going
through 600 cases by hand to check whether they conform to the regex grammar
doesn't sound like much fun. There must be a better way, like writing a
JavaCC parser to automate the checking. 
 
Michael Kay


> -----Original Message-----
> From: Henry S. Thompson [mailto:ht@inf.ed.ac.uk] 
> Sent: 16 November 2006 10:33
> To: Michael Kay
> Cc: public-xml-schema-testsuite@w3.org
> Subject: Re: Bug tracking
> 
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Michael Kay writes:
> 
> > Could I suggest that you consider using the W3C bugzilla 
> database for 
> > tracking bugs against the Schema test suite? This is how XQTS have 
> > operated and it's hard to see how they would have managed 
> without it. 
> > I'm going to be reporting a large number of bugs, and it's 
> going to be 
> > very difficult to track progress in resolving them if it's 
> all done by informal email.
> 
> Sounds sensible to me.  I'll get a definitive ruling tomorrow.
> Perhaps you can hold off sending your planned bug reports 
> until I've got that set up.
> 
> > A CVS repository that allows fixed tests to be downloaded 
> as soon as 
> > they are available would also be a great boost.
> 
> That's in place, just waiting for agreed corrections.
> 
> > In reporting test results, the only test results allowed by 
> the schema 
> > appear to be "valid" and "invalid". It would also be useful 
> to allow 
> > "not run" for cases where the processor was unable to run the test 
> > (e.g. because it ran out of memory), and "test bug reported" to 
> > indicate that a bug has been reported against the test in question.
> 
> Yeah. . .
> 
> ht
> - --
>  Henry S. Thompson, HCRC Language Technology Group, 
> University of Edinburgh
>                      Half-time member of W3C Team
>     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 
> 131 650-4440
>             Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk
>                    URL: http://www.ltg.ed.ac.uk/~ht/ [mail 
> really from me _always_ has this .sig -- mail without it is 
> forged spam] -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.2.6 (GNU/Linux)
> 
> iD8DBQFFXD5ikjnJixAXWBoRApIBAJ0S4Uo55zKKzxwpA6VFqxdro/nsSgCggmyw
> 3ADTylpecHqA9fvHyM+QFjA=
> =qqVM
> -----END PGP SIGNATURE-----
Received on Thursday, 16 November 2006 16:47:08 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 January 2008 14:21:55 GMT