- From: Joe English <jenglish@crl.com>
- Date: Mon, 23 Sep 1996 10:52:55 -0700
- To: w3c-sgml-wg@w3.org
(The discussion so far has focussed on RS/RE handling, but there is also a problem with separator characters _other_ than record-ends in mixed vs. element content.) Paul Prescod <papresco@calum.csclub.uwaterloo.ca> wrote: > Joe seems to be proposing that if we > > * restrict PIs and comments to element content > * restrict mixed-content-models to "|" [ actually, "restrict mixed content to OR groups with a REP occurrence indicator and which only contain primitive content tokens", or something equivalent; IOW, "no pernicious mixed content" ] > * disallow inclusion exceptions There's one more rule (which is the important one): * disallow separator characters in element content This is because things like: <a> <b>blah</b> <b>blah</b> </a> have different meanings depending on whether A has mixed content or element content. The record-end after the first </b> end-tag is significant in the former case, and is ignored in the latter. If A has element content, the above would have to be written like: <a><b>blah</b><b>blah</b></a> or <a ><b>blah</b ><b>blah</b ></a> instead. > then we can reduce the RS/RE handling rules to "Robert's Rules" ( =) ) of > > In data content: > 1. If an element begins or ends with a newline [not entirely > accurate, but this is what people see], the newline is ignored. > 2. Newlines inside markup are ignored. > 3. All other newlines are passed on. Yes, as far as I can tell. Charles' proposal is similar: * restrict PIs and comment declarations to element content * disallow mixed content * disallow inclusion exceptions (or perhaps, disallow included subelements in pseudoelement content). * require data content to be delimited The chief difference is in the second and fourth rules. With these restrictions the RS/RE/separator character rules are even simpler: 1. Delimited separator characters are data. 2. Undelimited separator characters are ignored. --Joe English jenglish@crl.com
Received on Monday, 23 September 1996 13:52:39 UTC