W3C home > Mailing lists > Public > www-html@w3.org > January 2010

Re: Line separator and Paragraph separator in HTML 5

From: Andrey V. Lukyanov <land@long.yar.ru>
Date: Mon, 18 Jan 2010 15:56:34 +0300 (MSK)
To: www-html@w3.org
Message-ID: <alpine.LFD.2.00.1001181530170.6709@long.yar.ru>
On Mon, 18 Jan 2010, Kent Karlsson wrote:

> So I don't think one should blindly reuse this bidi category for other
> purposes. For HTML5's purposes, I think TAB, VT, LF, CR, NEL, and PS
> should also be considered to be "white space"; i.e. a slightly more
> general sense than the bidi category White_Space/WS. Further, in addition
> to LF and CR, also VT, FF, NEL, LS, and PS should be considered line
> break characters.
>
> I don't see much logic in having both "[HTML5]space" and "White_Space"
> in HTML5. A single set (as described above) would suffice it seems to me...
> (out of which a subset are also line break characters, as above).


"[HTML5]space" has a very clear meaning: these are characters used for 
HTML source formatting; any sequence of "[HTML5]space" is equivalent to 
a single space. Surely "[HTML5]space" should include TAB, VT, LF, FF, 
CR, Space, NEL, LS and PS.

As for the "[HTML5]White_Space" category, its purpose is really unclear. 
The rendering of characters in this category (those that are not 
included in "[HTML5]space") should be defined in the Unicode standard, 
not in the HTML standard.
Received on Monday, 18 January 2010 12:57:55 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:16:17 GMT