- From: SINGH SATWINDER (516) 434-6274 <satwinder.singh@reuters.com>
- Date: Thu, 08 May 1997 21:04:59 +0000 (GMT)
- To: www-html@w3.org
- Cc: David Ryder <david.ryder@reuters.com>, David Schleifer <david.schleifer@reuters.com>, Joe Koplik <joe.koplik@reuters.com>, Eugene Olafsen <OLAFSEN%AM%RIT@MR.RIT.REUTERS.COM>
Hi, I am in process of writing an HTML parser. After writing my grammar, I found out some HTML facts which may effect the sanity of my grammer. I used HTML reference library to do most of my stuff. I happened to encounter an HTML document which had some constructs in it which doesn't follow the HTML syntax as described but still the standard browsers like IE and Netscape are able to load the HTML without any problem. Here is the example The construct is in following format in the document <DL> This is test1 <DL> This is test2 <\DL> <\DL> but if I refer to the HTML refernce library it should be <DL> (<DT>/<DD>)* </DL> The above mentioned example is one instance which I have encountered, who knows there are how many other undocumented features. Well now the question is, Is HTML grammer rigid or all these browsers are using the undocummented grammer. I would appreciate if somebody can answer my question. thanks in advance sat ===
Received on Thursday, 8 May 1997 16:05:30 UTC