Re: ixml must produce well-formed XML

ixampl doesn't bother to check on any of the errors that the XML parser will pick up. I think it'll happily produce duplicate attributes for instance.

Also worth noting in the example below that if the output text contains "<", it will appear as &lt;

I use similar techniques in my paper on roundtripping, where I note a possible simplification of my serialisation routine by transforming the grammar twice (roundtripping the roundtripping), and only producing textual output (that happens to be XML).

https://cwi.nl/~steven/Talks/2024/06-07-prague/paper.html

Steven

On Monday 03 March 2025 09:46:17 (+01:00), Norm Tovey-Walsh wrote:

> Hello,
> 
> I think this is additional fodder for our discussion of “serialization”.
> 
> I wonder if we should allow an ixml processor to produce a single text node as a result? According to a recent bug report[1], both ixampl and xmq fail to detect that this grammar violates D01 and D06:
> 
> -number = -~[N]*, [N]+, ending.
> -ending = | -~[N], -~[]*.
> 
> Input: abc123yyy
> 
> The user expected the output “123”, but that’s not a well-formed XML document.
> 
>                                         Be seeing you,
>                                           norm
> 
> [1] https://github.com/nineml/nineml/issues/67
> 
> --
> Norm Tovey-Walsh
> Saxonica
> 
> 

Received on Monday, 3 March 2025 12:31:48 UTC