[whatwg] Parsing entities from Simon Pieters on 2006-08-14 (public-whatwg-archive@w3.org from August 2006)

From: Simon Pieters <zcorpan@hotmail.com>
Date: Mon, 14 Aug 2006 21:18:28 +0000
Message-ID: <BAY109-F11EC5B1B0957042D17EE36B44E0@phx.gbl>

Hi,

From: Ian Hickson <ian@hixie.ch>
>On Mon, 14 Aug 2006, Simon Pieters wrote:
> >
> > I guess that for compat with IE and the Web[1] we have to treat
> > "R&eacutesum&eacute" as if it were "R&eacute;sum&eacute;". So how do we
> > handle "&noti;"? When the parser has come as far as "&not" it can't
> > return U+00AC yet because it could well be "&notin;". But when it has
> > reached "&noti;" then it can't be "&notin;", thus it returns U+00AC, but
> > then you also have to reparse the "i;", right? Unless I'm mistaken the
> > spec doesn't say anything about that.
>
>Section 8.2.3.1 "Tokenising entities", under "Anything else", covers this:
>"Consume the maximum number of characters possible, with the consumed
>characters case-sensitively matching one of the identifiers in the first
>column of the entities table".

I've read that several times. But it doesn't say that "i;" in "&noti;" is 
going to be reparsed. ("noti" isn't an entity, but "not" and "notin" are.)

Regards,
Simon Pieters

Received on Monday, 14 August 2006 14:18:28 UTC