Re: Error recovery

In general I'm in favour of having an error recovery option, but I think
that we should revisit the data model section of the micro-xml spec if
that leads to a different set of element names, or any other differences.

For anything like xpath (or potentially schema languages) it is really
the data model instances that matter not the constraints on the lexical
form of a (micro-)xml document.

If the "error recovery" produces a data model that can not be queried or
constrained by conforming (micro-)xml tools then the recovery aspect is
a bit of a false promise.

That is, I think that the tokenisation stage should guarantee to return
valid name tokens, either by adding a fixup stage there, or (more
likely?) by redefining valid name tokens in the data model to be
unconstrained arbitrary strings.

If any xml character data is allowed as an element name in the data
model then xpath can still query those elements with something like
*[name()='.....'] but if the data model is opened up even further to
allow any Unicode string as a name, a suitably extended xpath-like query
language would probably need some extended quote mechanism to refer to
non-xml characters like U+0001 Or perhaps it wouldn't and xpath embedded
in a micro-xml document using an error recovery parser could refer to
those characters directly.

There are various possibilities, which don't have to decided now, but
which I do think have a possible impact on the main micro-xml draft
(especially if there is consideration for setting that off on REC track
at this point).


David

________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs. 
________________________________________________________________________

Received on Tuesday, 27 November 2012 12:54:19 UTC