[whatwg] Parsing entities from Simon Pieters on 2006-08-14 (public-whatwg-archive@w3.org from August 2006)

From: Simon Pieters <zcorpan@hotmail.com>
Date: Mon, 14 Aug 2006 15:01:43 +0000
Message-ID: <BAY109-F23025A7BBDC187706FA904B44E0@phx.gbl>

Hi,

How are these entities handled?

   &notin;
   &ordf;
   &ordm;
   &piv;
   &sugmaf;
   &sube;
   &sup1;
   &sup2;
   &sup3;
   &supe;
   &thetasym;

Each of these have other other entities whose names are subsets of the 
above:

   &not;
   &or;
   &pi;
   &sigma;
   &sub;
   &sup;
   &theta;

I guess that for compat with IE and the Web[1] we have to treat 
"R&eacutesum&eacute" as if it were "R&eacute;sum&eacute;". So how do we 
handle "&noti;"? When the parser has come as far as "&not" it can't return 
U+00AC yet because it could well be "&notin;". But when it has reached 
"&noti;" then it can't be "&notin;", thus it returns U+00AC, but then you 
also have to reparse the "i;", right? Unless I'm mistaken the spec doesn't 
say anything about that.

[1] http://www.google.com/search?q=R%26eacutesum%C3%A9

Regards,
Simon Pieters

Received on Monday, 14 August 2006 08:01:43 UTC