W3C home > Mailing lists > Public > whatwg@whatwg.org > March 2013

Re: [whatwg] Tokenizor PseudoCode

From: Mohammad Al Houssami (Alumni) <mha53@mail.aub.edu>
Date: Fri, 15 Mar 2013 23:39:48 +0000
To: Bjoern Hoehrmann <derhoermi@gmx.net>
Message-ID: <0F8BA5A0576A5F44B0D188C2628265DA406A7866@DBXPRD0611MB559.eurprd06.prod.outlook.com>
Cc: "whatwg@lists.whatwg.org" <whatwg@lists.whatwg.org>
I'm trying to build an HTML5 Parser in Smalltalk and as a first step I'm implementing the tokenizer and everything happens there. I think this is the case only when we have scripts that add characters to the HTML document which is out of the scope of the project I am working on at the moment. Is this true or not ?

-----Original Message-----
From: Bjoern Hoehrmann [mailto:derhoermi@gmx.net] 
Sent: Friday, March 15, 2013 11:30 PM
To: Mohammad Al Houssami (Alumni)
Cc: whatwg@lists.whatwg.org
Subject: Re: [whatwg] Tokenizor PseudoCode

* Mohammad Al Houssami (Alumni) wrote:
>I just want to make sure that in places where no state change is called 
>it means we stay in the same state right?

You missed "When a token is emitted, it must immediately be handled by the tree construction stage. The tree construction stage can affect the state of the tokenization stage ..." but if that does not result in a change of state either, then yes, as far as I am aware.
Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 
Received on Friday, 15 March 2013 23:40:21 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:59:56 UTC