- From: karthik bala guru <bluekarthik@yahoo.com>
- Date: Wed, 30 Jun 2004 21:38:41 -0700 (PDT)
- To: olczyk@interaccess.com, www-amaya-dev@w3.org
- Cc: www-amaya-dev@w3.org
Hi, I would like to have an Amaya Setup with just the XHTML Parser alone. Can Someone kindly tell some clear steps for extracting the 'Amaya XHTML parser' Alone ? The files to consider and other relevant details. Thanks & Regards, karthik bala guru --- "Thaddeus L. Olczyk" <olczyk@interaccess.com> wrote: > > On Wed, 30 Jun 2004 15:16:50 +0200, Laurent Carcone > <laurent@w3.org> > wrote: > > > > >Hello Thaddeus, > > > >In fact, Amaya uses 2 different parsers, expat for > XHTML documents (and for > >XML documents in general) > Which is of minor interest because this is something > I can already do > quite easily. > > >nd an ad'hoc parser for other HTML documents. > >This parser is specific to Amaya and has no > well-defined API. Nevertheless, > >you can have a look on it in the module > 'amaya/html2thot.c', and particularly > >on the definition of the automaton. > > > Ok. So you've basically answered my last question, > but > the first two are still left unanswered. > > Is the parser relatively bullet proof? I find the > combination of > Tidy+expat simply unusable. Tidy chokes on some > rather > simple From previous experience if there are > problems with > with simple input a system is going to have lots > more problems > when the input scales up. I don't want to be dealing > with tons of > special cases that Tidy can't handle. That's the way > to disaster. > > Is the parser code in Amaya easily extractable? I > once tried > to do the same thing with the pile of -- they call > mozilla, and it was > a disaster. Now that was mozilla, and using anything > from there > is asking for trouble. The question is what about > Amaya? > > > Thaddeus L. Olczyk > ----------------------- > Think twice, code once. > > __________________________________ Do you Yahoo!? Yahoo! Mail is new and improved - Check it out! http://promotions.yahoo.com/new_mail
Received on Thursday, 1 July 2004 00:39:12 UTC