- From: David Carlisle <davidc@nag.co.uk>
- Date: Tue, 27 Nov 2012 12:53:55 +0000
- To: public-microxml@w3.org
In general I'm in favour of having an error recovery option, but I think that we should revisit the data model section of the micro-xml spec if that leads to a different set of element names, or any other differences. For anything like xpath (or potentially schema languages) it is really the data model instances that matter not the constraints on the lexical form of a (micro-)xml document. If the "error recovery" produces a data model that can not be queried or constrained by conforming (micro-)xml tools then the recovery aspect is a bit of a false promise. That is, I think that the tokenisation stage should guarantee to return valid name tokens, either by adding a fixup stage there, or (more likely?) by redefining valid name tokens in the data model to be unconstrained arbitrary strings. If any xml character data is allowed as an element name in the data model then xpath can still query those elements with something like *[name()='.....'] but if the data model is opened up even further to allow any Unicode string as a name, a suitably extended xpath-like query language would probably need some extended quote mechanism to refer to non-xml characters like U+0001 Or perhaps it wouldn't and xpath embedded in a micro-xml document using an error recovery parser could refer to those characters directly. There are various possibilities, which don't have to decided now, but which I do think have a possible impact on the main micro-xml draft (especially if there is consideration for setting that off on REC track at this point). David ________________________________________________________________________ The Numerical Algorithms Group Ltd is a company registered in England and Wales with company number 1249803. The registered office is: Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom. This e-mail has been scanned for all viruses by Star. The service is powered by MessageLabs. ________________________________________________________________________
Received on Tuesday, 27 November 2012 12:54:19 UTC