W3C home > Mailing lists > Public > whatwg@whatwg.org > February 2008

[whatwg] [WA1] Insignificant white space

From: Ian Hickson <ian@hixie.ch>
Date: Wed, 13 Feb 2008 01:40:20 +0000 (UTC)
Message-ID: <Pine.LNX.4.62.0802130137490.20115@hixie.dreamhostps.com>
On Fri, 8 Jul 2005, Robin Berjon wrote:
> fantasai wrote:
> > # The whitespace characters U+0020 SPACE, U+000A LINE FEED, and U+000D
> > CARRIAGE # RETURN are always allowed between elements.
> > 
> > What about U+0009 TAB?
> 
> And NEL and VERTICAL TAB?

All the above except NEL have now been space characters for a while.

NEL isn't, mostly because in practice nobody uses it, and adding new space 
characters is moderately expensive. UAs with different sets of space 
characters will end up with different behaviour, e.g. in processing the 
class attribute. It's also desireable for us to have the raw syntax be 
a pure subset of ASCII, so that you can safely code HTML parsers and be 
certain that they won't parse documents syntactically differently based 
just on whether the encoding was correctly guessed or not (so long as 
you're within a subset of ASCII).

Cheers,
-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'
Received on Tuesday, 12 February 2008 17:40:20 UTC

This archive was generated by hypermail 2.3.1 : Monday, 13 April 2015 23:08:39 UTC