- From: James Aylett <sja20@hermes.cam.ac.uk>
 - Date: Sun, 17 Nov 1996 17:45:01 +0000 (GMT)
 - To: jon@oaktree.co.uk
 - cc: www-html@w3.org
 
> >     <p> <!- not a comment, just regular old data characters ->
> 
> While we're here, what does this example mean? Are the
> 'not a comment' characters supposed to be ignored?
> Displayed as text on the user's screen? How about the
> '<!-' characters? If '<!-' is displayable text, how
> about '<*'? '<title*'? The above example opens
> a whole can of worms which the text then simply fails
> to discuss. I've never seen a web browser which will
> display the above example as text.
(The following is a little simplified, but gets the point across - I
hope!)
The opening '<' starts a tag/entity/whatever. If the browser doesn't
recognise it, it shouldn't do anything with it - just find the end of the
tag '>', and skip everything in between. Something starting <! is slightly
different - SGML defines a number of these to mean particular things,
most of which aren't (currently) relevant to HTML.
Therefore the text '<title*' on its own is meaningless; it opens a tag,
but doesn't close it. I imagine most browsers will then skip everything up
to the next '>', assuming that title* is some tag it doesn't understand.
The reason the 'text' (by which I assume you mean the RFC for 2.0, or
perhaps the 3.2 docs, or whatever) doesn't define what to do with
<FLIBBLE> or whatever is that it's already covered by the fact that HTML
is an SGML application. (Technically, of course, putting <FLIBBLE> in
stops it from being HTML, but in terms of a web browser unknown tags
should simply be skipped, to stop newer tags from breaking older
browsers.)
Hope this helps,
James
-- 
/-----------------------------------------------------------------------------\
  James Aylett - Crystal Services (crystal.clare.cam.ac.uk): BBS, Ftp and Web
     Clare College, Cambridge, CB2 1TL -- sja20@cam.ac.uk -- (0976) 212023
Received on Sunday, 17 November 1996 12:44:28 UTC