- From: Steven Pemberton <steven.pemberton@cwi.nl>
- Date: Thu, 04 Nov 2021 10:39:18 +0000
- To: Dave Pawson <dave.pawson@gmail.com>
- Cc: "C. M. Sperberg-McQueen" <cmsmcq@blackmesatech.com>, ixml <public-ixml@w3.org>
I'm not sure if I exactly understand your comment, but ixml doesn't tokenise at all. It only recognises characters. Steven On Thursday 04 November 2021 09:45:32 (+01:00), Dave Pawson wrote: > A common XSLT processing sequence, plain text to XML (e.g. CSV) is to > tokenize by eol first, then > within line? > > I'd hope you might support this form of processing. > > regards > > On Thu, 4 Nov 2021 at 08:31, Steven Pemberton <steven.pemberton@cwi.nl> wrote: > > > > Good points. > > > > ABC completely denies the existence of end-of-line characters. It delivers input as an array of lines, where the line terminators have been elided. This is because different operating systems use different line end conventions, and the language hides these differences. So there is no way to get a LF delivered. > > > > Steven > > > > On Thursday 04 November 2021 01:32:13 (+01:00), C. M. Sperberg-McQueen wrote: > > > > > I like most of these changes. > > > > > > But having > > > > > > ixml: s, rule+. > > > rule: (mark, s)?, name, s, -[“:=“], s, -alts, -“.”, s. > > > > > > instead of > > > > > > ixml: s, rule+s. > > > rule: (mark, s)?, name, s, -[“:=“], s, -alts, -“.”. > > > > > > has the unfortunate effect that a grammar like > > > > > > { Section 1: …} > > > a: … . > > > b: … . > > > > > > > > > { Section 2: …} > > > z: … . > > > y: … . > > > > > > produces XML in which the comment ‘ Section 2: … ‘ turns up not > > > between the last rule of section 1 and the first rule of section 2, but > > > within the last rule of section 1. > > > > > > Also, I’m curious what the bug involving lf was. > > > > > > Michael > > > > > > > On 3,Nov2021, at 5:02 PM, Steven Pemberton <steven.pemberton@cwi.nl> wrote: > > > > > > > > In an idle moment, I refactored the grammar. Comments gladly received. > > > > Changes: * I hid all nonessential terminals. I know above all Tom was asking for this. > > > > * I moved the spaces from the rule for ixml into the rule for rule. Tidier and more consistent. > > > > * I renamed S to s. > > > > * I simplified 'namestart', since I realised class L covered all the cases. > > > > > > > > I think that's all. > > > > > > > > See attachment. > > > > > > > > Steven<ixml-new.ixml> > > > > > > > > > > > >
Received on Thursday, 4 November 2021 10:39:41 UTC