W3C home > Mailing lists > Public > whatwg@whatwg.org > March 2013

[whatwg] Tokenizor PseudoCode

From: Mohammad Al Houssami (Alumni) <mha53@mail.aub.edu>
Date: Fri, 15 Mar 2013 19:12:43 +0000
To: "whatwg@lists.whatwg.org" <whatwg@lists.whatwg.org>
Message-ID: <0F8BA5A0576A5F44B0D188C2628265DA406A780C@DBXPRD0611MB559.eurprd06.prod.outlook.com>
Hello Everyone,

I just want to make sure that in places where no state change is called it means we stay in the same state right?
Take the RCDATA state below. In the anything else branch we emit character token and then go consume another character and check all the cases in this state.
This is the only thing that makes sense but I just want to make sure :)

Thanks


12.2.4.3 RCDATA state
Consume the next input character<http://www.whatwg.org/specs/web-apps/current-work/multipage/parsing.html#next-input-character>:
U+0026 AMPERSAND (&)
Switch to the character reference in RCDATA state<http://www.whatwg.org/specs/web-apps/current-work/multipage/tokenization.html#character-reference-in-rcdata-state>.
U+003C LESS-THAN SIGN (<)
Switch to the RCDATA less-than sign state<http://www.whatwg.org/specs/web-apps/current-work/multipage/tokenization.html#rcdata-less-than-sign-state>.
U+0000 NULL
Parse error<http://www.whatwg.org/specs/web-apps/current-work/multipage/parsing.html#parse-error>. Emit a U+FFFD REPLACEMENT CHARACTER character token.
EOF
Emit an end-of-file token.
Anything else
Emit the current input character<http://www.whatwg.org/specs/web-apps/current-work/multipage/parsing.html#current-input-character> as a character token.
Received on Friday, 15 March 2013 19:13:14 GMT

This archive was generated by hypermail 2.3.1 : Friday, 15 March 2013 19:13:14 GMT