W3C home > Mailing lists > Public > public-html@w3.org > June 2007

Re: C0 control characters in HTML 5

From: Ian Hickson <ian@hixie.ch>
Date: Fri, 22 Jun 2007 04:50:14 +0000 (UTC)
To: Mike Brown <mike@skew.org>
Cc: public-html@w3.org
Message-ID: <Pine.LNX.4.64.0706220448331.31033@dhalsim.dreamhost.com>

On Mon, 18 Jun 2007, Mike Brown wrote:
> HTML 5 seems to now allow the entire U+0001..U+001F range, whereas HTML 
> 4.x, 3.2, and I think 2.0, as defined by their "document character set" 
> and SGML profile, have long forbidden all of that range except for tab, 
> LF, CR, and, inexplicably, FF.
> Why is HTML 5 different, and what are the expectations for the 
> processing of the now-allowed BEL, BS, VT, DEL, and so on? If it was 
> deliberate, why not put a note of explanation in the spec?

It was deliberate only insofar as I didn't come across any reason to 
disallow them. The expectations for their processing are unaffected by 
whether they are allowed or not.

What would the note explain?

Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'
Received on Friday, 22 June 2007 04:52:44 UTC

This archive was generated by hypermail 2.3.1 : Thursday, 29 October 2015 10:15:22 UTC