W3C home > Mailing lists > Public > html-tidy@w3.org > July to September 2000

Re: Tidy of Script contents desc += "</td></table>";

From: Daniel Biddle <deltab@osian.net>
Date: Mon, 4 Sep 2000 08:11:55 +0000 (UTC)
To: "Rzepa, Henry" <h.rzepa@ic.ac.uk>
cc: html-tidy@w3.org
Message-ID: <Pine.LNX.4.21.0009040753520.21984-100000@charizard.blazingfast.net>
On Mon, 4 Sep 2000, Rzepa, Henry wrote:

> The August  7 Tidy does the following
> <script type="text/javascript" language="JavaScript">
> desc += "</td></table>";
> </script>
> with the warning
>  line 7 column 16 - Warning: '<' + '/' + letter not allowed here
> and writes out ie
> desc += "<\/td><\/table>";
> Because of course it does not detect eg the <td>, formally its correct,
> but in practice of course the  <td> is written out elsewhere.
> Can one suppress this behaviour?  Is it in fact correct? 

The relevant section in the HTML specification (4.01) is at

| Although the STYLE and SCRIPT elements use CDATA for their data model, for
| these elements, CDATA must be handled differently by user agents. Markup and
| entities must be treated as raw text and passed to the application as is. The
| first occurrence of the character sequence "</" (end-tag open delimiter) is
| treated as terminating the end of the element's content. In valid documents,
| this would be the end tag for the element.

It is the sequence </ (which some programs will treat as the end of the
element, regardless of whether it's actually the right name or not) that
is the problem here -- not specifically an end tag apparently without
a matching start tag.

Tidy's behaviour is therefore correct for HTML4.01 documents. (Though I
expect XHTML is stricter, and requires CDATA element content to use &lt;/

Daniel Biddle <deltab@osian.net>
Received on Monday, 4 September 2000 04:12:18 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:48 UTC