[whatwg] Ampersands not followed by ASCII letters or #

On Wed, 27 Dec 2006, Henri Sivonen wrote:
>
> I noticed that the Web Apps spec itself contains script samples with 
> unescaped JavaScript && operators in <pre> blocks.
> 
> Considering that this is not an error in HTML 4.01 as SGML and 
> considering that it is harmless in browsers, I think the top-level 
> "Anything else" case under "8.2.3.1. Tokenising entities" should be 
> split in two so that there is also an error-free case for the ASCII 
> characters that aren't '#', aren't ASCII letters and that weren't in 
> error in SGML-based HTML. I don't have The Handbook at my disposal right 
> now, but the error-free case should cover at least '&', '<' and space 
> characters.

I've allowed:

   U+0009 CHARACTER TABULATION
   U+000A LINE FEED (LF)
   U+000B LINE TABULATION
   U+000C FORM FEED (FF)
   U+0020 SPACE
   U+003C LESS-THAN SIGN
   U+0026 AMPERSAND
   EOF

Let me know if you want any more added to the list.

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'

Received on Monday, 18 June 2007 17:55:20 UTC