- From: Markku Savela <msa@msa.tte.vtt.fi>
- Date: Wed, 14 Aug 1996 17:28:34 +0300 (EET DST)
- To: frystyk@w3.org
- CC: www-lib@w3.org, Vincent.Quint@imag.fr, connolly@w3.org
>Having the SGML parser being simply a tokenizer is a good idea, I >know that Dan has been working on that for some time. You can find >documentation on this at > http://www.w3.org/pub/WWW/MarkUp/SGML/sgml-lex/sgml-lex Yes, but as of now, the SGML.c tokenizes quite okay(?), but fails on higher level checks. Do we need to mix in lex into this also? All that is needed in SGML.c, is to strip away all the extra stuff. It does not need to stack elements, does not need to worry about matching begin/end tags etc.. However, as some may depend on the current working of SGML.c, perhaps the library should just have the stripped down SGML.c with some other name, which we who have our own higher level "normalizers" on top of the old SGML.c, could activate instead? (I could do this myself, but I hate to have too many modified modules which I have to replace and check every time a new library version comes out). -- Markku Savela (msa@hemuli.tte.vtt.fi), Technical Research Centre of Finland Multimedia Systems, P.O.Box 1203,FIN-02044 VTT,http://www.vtt.fi/tte/staff/msa/
Received on Wednesday, 14 August 1996 10:29:00 UTC