W3C home > Mailing lists > Public > whatwg@whatwg.org > November 2006

[whatwg] Space characters

From: Ian Hickson <ian@hixie.ch>
Date: Mon, 6 Nov 2006 05:34:36 +0000 (UTC)
Message-ID: <Pine.LNX.4.62.0611060531290.18838@dhalsim.dreamhost.com>
On Sun, 5 Nov 2006, Henri Sivonen wrote:
> Is there a reason why the definition of space characters does not match 
> the XML 1.0 and RELAX NG definition of white space (space, tab, CR, LF) 
> but also includes (line tabulation and form feed)? Is the deviation from 
> XML 1.0 needed for backwards compatibility with text/html UAs?

I made the parser consider VT and FF as being whitespace based on, as I 
recall, a complete examination of every Unicode character's behaviour in 
the parsers I was testing. The definition of "space characters" matches 
the parser's behaviour for consistency.

The definition of "space characters" doesn't affect the XML parser stage 
as far as I can recall, only attribute parsing and DOM conformance.

Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'
Received on Sunday, 5 November 2006 21:34:36 UTC

This archive was generated by hypermail 2.4.0 : Wednesday, 22 January 2020 16:58:49 UTC