- From: Graydon <graydonish@gmail.com>
- Date: Sun, 27 Aug 2023 12:34:32 -0400
- To: "C. M. Sperberg-McQueen" <cmsmcq@blackmesatech.com>
- Cc: Norm Tovey-Walsh <norm@saxonica.com>, public-ixml@w3.org
On Sun, Aug 27, 2023 at 09:43:40AM -0600, C. M. Sperberg-McQueen scripsit: > Graydon <graydonish@gmail.com> writes: > > ∅ (U+2205 EMPTY SET) might be another character possibility. > > Please no. In some discussions of formal languages (at least), ∅ is > used to denote the empty language -- that is, the language that has no > sentences. It is not the same as the language consisting of the empty > string '', which is often denoted ε (U+0385 GREEK SMALL LETTER EPSILON), > as in Norm's proposal. I was unaware! That would indeed make things confusing; suggestion withdrawn. [snip] > For me, part of the appeal of ixml is its relative simplicity. Adding > keywords or symbols that do not change the expressive power of the > language is a slippery slope. I won't say never, but I'm skeptical. I think "as simple as possible, but no simpler" is a supportable principle. Having meaningful implicit nothingness is simple in a "language rules" sense but not simple in a "using the language" sense. It creates a category of syntax errors which can't be reported by machine parsing because there's nothing mechanically wrong with the syntax even when there is a significant error in meaning. A rule for "No meaningful implicit nothingness" (which is how I understand Norm's proposal) strikes me as a large gain in utility. Such a rule need not include a standard representation of nothingness, fundamentally it just has to require the parser to throw an error. I would prefer a standard representation of nothingness, but I need the error. I use ixml in production to test that generated text conforms to agreed patterns of natural language. I don't think the second time I do this the grammar will be more effort than the code that generates the text, but it isn't going to become trivial, either. "Making ixml easier to use" might not be a sensible goal—grammars are not on the list of things easy to use—but I think introducing this particular error condition is a large gain for small cost. -- Graydon -- Graydon Saunders | graydonish@fastmail.com Þæs oferéode, ðisses swá mæg. -- Deor ("That passed, so may this.")
Received on Sunday, 27 August 2023 16:34:41 UTC