W3C home > Mailing lists > Public > public-html@w3.org > June 2007

C0 control characters in HTML 5

From: Mike Brown <mike@skew.org>
Date: Mon, 18 Jun 2007 16:09:24 -0600 (MDT)
Message-Id: <200706182209.l5IM9OuQ005779@chilled.skew.org>
To: public-html@w3.org

HTML 5 seems to now allow the entire U+0001..U+001F range, whereas HTML 4.x, 
3.2, and I think 2.0, as defined by their "document character set" and SGML 
profile, have long forbidden all of that range except for tab, LF, CR, and, 
inexplicably, FF.

Why is HTML 5 different, and what are the expectations for the processing of 
the now-allowed BEL, BS, VT, DEL, and so on? If it was deliberate, why not put 
a note of explanation in the spec?
Received on Monday, 18 June 2007 22:09:48 UTC

This archive was generated by hypermail 2.3.1 : Thursday, 29 October 2015 10:15:22 UTC