- From: Dave Pawson <dave.pawson@gmail.com>
- Date: Thu, 16 Dec 2021 17:31:06 +0000
- To: "C. M. Sperberg-McQueen" <cmsmcq@blackmesatech.com>
- Cc: Norm Tovey-Walsh <norm@saxonica.com>, ixml <public-ixml@w3.org>
Thanks Michael (and Norm) for your patience. On Thu, 16 Dec 2021 at 16:47, C. M. Sperberg-McQueen <cmsmcq@blackmesatech.com> wrote: > > > > > On 16,Dec2021, at 8:30 AM, Dave Pawson <dave.pawson@gmail.com> wrote: > I can try to explain. Please bear with me. > Why does the expression []? denote the language consisting > of the empty sequence? > > Empty brackets are a character inclusion with no members. > The meaning of a character inclusion is that at least one of the > members of the inclusion must match the current input > character (I hope people know what I mean by ’the current > input character', because I am not in a position to offer a simple > clear definition). An empty inclusion cannot satisfy that condition, > so no sequence of input symbols can match an empty inclusion. Grates, but accepted. > So the language recognized by [] is the empty set. In the > set notation I learned in school: {} or ∅. > > Adding a question mark to [] so as to form []? gives us an > expression recognizing a different and larger language. In general, > for any expression E, the expression E? recognizes a language > which contains (a) all the sentences of the language of E, > plus (b) the empty sequence of symbols. As an operation on > sets of sentences, the operation ‘?’ adds the empty sequence > (often written ε) to the set. Which is quite different from DTD's, regex etc (my prior exposure). > > Since [] denotes the set ∅, the expression []? denotes the > union of {} and {ε}. Now, it’s a theorem of set theory that > the union of any set S with {} is S itself. The union of {} and > {ε} is {ε}, the set containing only the empty sequence. Accepted. > > That is why I say that []? is a way of writing the empty sequence. > It recognizes a set of input sequences that contains the empty > sequence and no other sequences. How to avoid having this discussion a thousand times over in the future? Is it feasible to extract 'implementer' notes (with this inclusion) and a simpler (glossed) term for users? Perhaps 'glossed over' more accurately. Estimate: For each MSM there are 10^n DP's | those not exposed to set theory. <grin/> regards -- Dave Pawson XSLT XSL-FO FAQ. Docbook FAQ.
Received on Thursday, 16 December 2021 17:31:29 UTC