Re: HTML token

On Tue, 27 Feb 2001, Irawan Tanudirdjo wrote:

> Shallom,
>
> I'm an undergraduate student of Computer Science
> from Surabaya, Indonesia.
>
> Right now, I'm having a compiler class project to
> create a HTML interpreter and viewer. So, I would
> like to ask about HTML tokens, lexemes, regular
> expression and grammar.
>
> Could anyone help me point out in the web, the
> documentation that contains the above specification?

There is no such documentation on the web that I know of.  I suggested
getting a copy of "The SGML handbook" by Charles F. Goldfarb.  It should
contain everything you need to know to write an SGML, and hence HTML 4.0
parser.

I should point out that there is a good chance that HTML 4.0 is not
defined by a context free grammer.

For XHTML, you can read "The Annotated XML Specification" at
<http://www.xml.com/pub/a/axml/axmlintro.html>.

-- 
Russell O'Connor                        roconnor@alumni.uwaterloo.ca
           <http://www.math.berkeley.edu/~roconnor/>
``Paradoxically, a refusal to `put a monetary value on life' means that
life is often undervalued.'' -- Artificial Intelligence: A Modern Approach

Received on Tuesday, 27 February 2001 15:32:40 UTC