- From: John Lumley <john@saxonica.com>
- Date: Mon, 30 May 2022 19:04:52 +0100
- To: "C. M. Sperberg-McQueen" <cmsmcq@blackmesatech.com>
- Cc: public-ixml@w3.org
Try working on insertion alternates in repetition separators… (which I have test sets for) - it’s proving horrendous!
Sent from my iPad
> On 30 May 2022, at 18:49, C. M. Sperberg-McQueen <cmsmcq@blackmesatech.com> wrote:
>
>
> C. M. Sperberg-McQueen writes:
>
>> For what it's worth, I have been working on tools to perform a simple
>> measurement of the coverage of our test set. The short version:
>>
>> 1 There are (predictably) no occurrences of the nonterminals 'version'
>> and 'prolog', or the choice that involves them, in any of the
>> grammars of the test set.
>>
>> 2 There are no hexadecimal numbers with length > 2.
>>
>> 3 There is only one empty set (an empty inclusion); there are no empty
>> exclusions.
>
> A slightly cleaner coverage report based on the spec grammar of 28 May
> confirms the above and appears to show some additional points worth
> attention. Some of these involve whitespace and are thus a bit
> error-prone to detect (and may thus be false alarms):
>
> - If the nonterminal 'whitespace' is ever realized as a tab character,
> I cannot find it.
>
> - If the nonterminal 'whitespace' is ever realized as a carriage
> return character, I cannot find it.
>
> - If there are empty comments (i.e. {}) in any input grammars in the
> test set, I am not finding them.
>
> - There are (unsurprisingly) no insertions in the test suite.
>
> - My tools currently have trouble with these constructs, but I believe
> there are ten occurrences of "" as instantiations of dchar but no
> occurrences of '' as instantiations of schar.
>
> The automated tools do not look to see whether every character in a
> character set gets used (that would pose problems for a nonterminal like
> cchar, defined as ~["{}"]), but manual checks show:
>
> - There are no grammar rules in the test set that use '=' to separate
> LHS and RHS.
>
> - All three marks are instantiated ('-' 306 times, '@' 106 times, '^'
> three times).
>
> - Tmark is instantiated 307 times as '-' and once as '^'.
>
>
> Michael
>
> --
> C. M. Sperberg-McQueen
> Black Mesa Technologies LLC
> http://blackmesatech.com
>
Received on Monday, 30 May 2022 18:05:08 UTC