W3C home > Mailing lists > Public > whatwg@whatwg.org > August 2006

[whatwg] Parsing entities

From: Ian Hickson <ian@hixie.ch>
Date: Mon, 14 Aug 2006 22:19:08 +0000 (UTC)
Message-ID: <Pine.LNX.4.62.0608142217490.5340@dhalsim.dreamhost.com>
On Mon, 14 Aug 2006, Simon Pieters wrote:
> > 
> > Section 8.2.3.1 "Tokenising entities", under "Anything else", covers this:
> > "Consume the maximum number of characters possible, with the consumed
> > characters case-sensitively matching one of the identifiers in the first
> > column of the entities table".
> 
> I've read that several times. But it doesn't say that "i;" in "&noti;" 
> is going to be reparsed. ("noti" isn't an entity, but "not" and "notin" 
> are.)

In "&noti;", the "maximum number of characters possible" that 
"case-sensitively [match] one of the identifiers in the first column of 
the entities table" is "not".

Thus you consume the "&", the three characters "not", and emit the 
character for "&not;".

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'
Received on Monday, 14 August 2006 15:19:08 UTC

This archive was generated by hypermail 2.3.1 : Monday, 13 April 2015 23:08:28 UTC