At 01:43 PM 9/12/96 GMT, Gavin Nicol wrote:
>>One thing that's hard is the rules about how REs adjacent to PIs are
>David Durand and I independently came up with a good way of dealing
>with such things
I've heard about this but never seen it. Could you or David or
someone please post it to the group? In our informal discussions
before the advent of the WG, figuring out what to do about RS/RE, without
busting our 8879 compliance, was one of the most worrying things.
My "MGML" parsing system does the following:
At any point in the document, the parser knows from the content model
whether data can appear. If so, all bytes that are not markup are data.
If not, all white space is ignored, and non-whitespace non-markup is an
This sacrifices some flexibility, but has the immense advantage that you
can explain it completely in 40 words to both users and programmers. Which
is not the case for the SGML rules. Unfortunately, I don't think this can
ever be made 8879-compliant.
Our goal for XML is something that is crisp and clear as the above, but is also
8879-legal. If someone on the group could give us a flash of creativity on
this, that would be wonderful.
Cheers, Tim Bray
email@example.com http://www.textuality.com/ +1-604-488-1167