W3C home > Mailing lists > Public > public-ixml@w3.org > December 2022

Re: Character classes tests

From: Norm Tovey-Walsh <norm@saxonica.com>
Date: Sat, 17 Dec 2022 16:30:26 +0000
To: Steven Pemberton <steven.pemberton@cwi.nl>
Cc: public-ixml@w3.org
Message-ID: <m2tu1u2dru.fsf@saxonica.com>
Steven Pemberton <steven.pemberton@cwi.nl> writes:
> I just realised that this grammar is ambiguous, because LC is a subset of L, so should be
> replaced by two tests:
>
> ================
> input: char*. -char: C; L; M; N; P; S; Z; Unassigned. Unassigned: ~[C; L; M; N; P; S;
> Z]. C: [C]. L: [L]. M: [M]. N: [N]. P: [P]. S: [S]. Z: [Z].

Okay.

> ===================
> input: char*. -char: C; L; LC; Lm ; Lo; M; N; P; S; Z; Unassigned.
> Unassigned: ~[C; L; LC; Lm ; Lo; M; N; P; S; Z].
> C: [C]. L: [L]. LC: [LC].
> Lm: [Lm].
> Lo: [Lo]. M: [M]. N: [N]. P: [P]. S: [S]. Z: [Z].

Doesn’t this test exhibit the same ambiguity since it still has L and LC?

                                        Be seeing you,
                                          norm

--
Norm Tovey-Walsh
Saxonica

Received on Saturday, 17 December 2022 16:31:23 UTC

This archive was generated by hypermail 2.4.0 : Saturday, 17 December 2022 16:31:24 UTC