C0 control characters in HTML 5

HTML 5 seems to now allow the entire U+0001..U+001F range, whereas HTML 4.x, 
3.2, and I think 2.0, as defined by their "document character set" and SGML 
profile, have long forbidden all of that range except for tab, LF, CR, and, 
inexplicably, FF.

Why is HTML 5 different, and what are the expectations for the processing of 
the now-allowed BEL, BS, VT, DEL, and so on? If it was deliberate, why not put 
a note of explanation in the spec?

Received on Monday, 18 June 2007 22:09:48 UTC