W3C home > Mailing lists > Public > whatwg@whatwg.org > August 2006

[whatwg] Parsing entities

From: Simon Pieters <zcorpan@hotmail.com>
Date: Mon, 14 Aug 2006 15:01:43 +0000
Message-ID: <BAY109-F23025A7BBDC187706FA904B44E0@phx.gbl>

How are these entities handled?


Each of these have other other entities whose names are subsets of the 


I guess that for compat with IE and the Web[1] we have to treat 
"R&eacutesum&eacute" as if it were "R&eacute;sum&eacute;". So how do we 
handle "&noti;"? When the parser has come as far as "&not" it can't return 
U+00AC yet because it could well be "&notin;". But when it has reached 
"&noti;" then it can't be "&notin;", thus it returns U+00AC, but then you 
also have to reparse the "i;", right? Unless I'm mistaken the spec doesn't 
say anything about that.

[1] http://www.google.com/search?q=R%26eacutesum%C3%A9

Simon Pieters
Received on Monday, 14 August 2006 08:01:43 UTC

This archive was generated by hypermail 2.3.1 : Monday, 13 April 2015 23:08:28 UTC