HTML question from SINGH SATWINDER (516) 434-6274 on 1997-05-08 (www-html@w3.org from May 1997)

From: SINGH SATWINDER (516) 434-6274 <satwinder.singh@reuters.com>
Date: Thu, 08 May 1997 21:04:59 +0000 (GMT)
To: www-html@w3.org
Cc: David Ryder <david.ryder@reuters.com>, David Schleifer <david.schleifer@reuters.com>, Joe Koplik <joe.koplik@reuters.com>, Eugene Olafsen <OLAFSEN%AM%RIT@MR.RIT.REUTERS.COM>
Message-Id: <9259041608051997/A51001/RIT1/11B544041700*@MHS>

Hi,

I am in process of writing an HTML parser. After writing my grammar, I
found out some HTML facts which may effect the sanity of my grammer. I
used HTML reference library to do most of my stuff. I happened to
encounter an HTML document which had some constructs in it which
doesn't follow the HTML syntax as described but still the standard
browsers like IE and Netscape are able to load the HTML without any
problem.
Here is the example

The construct is in following format in the document

<DL>
This is test1
<DL>
This is test2
<\DL>
<\DL>

but if I refer to the HTML refernce library it should be 

<DL>
(<DT>/<DD>)*
</DL>

The above mentioned example is one instance which I have encountered,
who knows there are how many other undocumented features.

Well now the question is, Is HTML grammer rigid or all these browsers
are using the undocummented grammer. 

I would appreciate if somebody can answer my question.

thanks in advance

sat


===

Received on Thursday, 8 May 1997 16:05:30 UTC