- From: Matitiahu Allouche <matial@il.ibm.com>
- Date: Sun, 7 Nov 2010 12:44:55 +0200
- To: Amit Aronovitch <aronovitch@gmail.com>
- Cc: public-i18n-bidi@w3.org
- Message-ID: <OFE86076E7.1ACC315A-ONC22577D4.003A6336-C22577D4.003B1877@il.ibm.com>
Entities like &ls; and &ps; seem nice to have. Even without those, we can always use numeric character references like 
 and 
 The really important thing is to convince browser developers to support these characters properly, in whatever form they appear in the text. Shalom (Regards), Mati Bidi Architect Globalization Center Of Competency - Bidirectional Scripts IBM Israel Fax: +972 2 5870333 Mobile: +972 52 2554160 From: Amit Aronovitch <aronovitch@gmail.com> To: public-i18n-bidi@w3.org Date: 05/11/2010 10:04 Subject: Should <br> be removed from the HTML5 spec? Sent by: public-i18n-bidi-request@w3.org Following the correspondence on HTML5 bugs #10828 and #11211, I started to think that maybe a more radical approach is the way to go. Quick summary: 1) The issue at hand is the current problems and incompatibility issues with <br>: * HTML4 spec (implemented by Gecko and Opera) says that <br> should be "soft" (not introduce a bidi paragraph break). * Implementation by IE and Webkit (as well as common usage on websites) assume "hard" <br> (considered bidi paragraph separator). 2) Our suggestion, originally in #10828, was split into two bugs: * #10828 suggests that <br> should be "hard" by default (contrary to HTML4). * #11211 suggests that some way should be provided to produce a "soft <br>". 3) While #10828 seems to be on the road to acceptance, we are having trouble with use cases for #11211: * The "natural" use case (I assume, based on old HTML tutorials, that this was the original purpose of <br> ) is poems. However, while there are *a lot* of websites with poems, I could not find an RTL poem that contains numbers and LTR words (I am sure there exist such sites - would appreciate a link if you find one). Same goes for mail addresses (they do include numbers and Latin words, but the <br>'s are normally positioned in non-sensitive places. * A use case brought up by Adil (comment 18 on #10828), a site displaying a newspaper page, in a way that should match the actual printed page. This was criticized by Ian Hickson for being "a bit of an abuse of HTML". Now, considering the last point, the editor does have a point. These linebreaks are a matter of display. However, the thing that troubles me is that *the same argument applies equally well to the "hard" <br> * (bug #102828). As opposed to <p>, which reflects a logical partition of the text into DOM nodes, <br> is a matter of textual display. Since HTML5 aims to represent the pure logical structure of document, maybe the right place for such things is in the interaction between the contents (text) and CSS. Hence, the new suggestion: (1) Add mandatory entities (temporary names): &ps; for U+2029 (paragraph separator, replacement for "hard" <br>) &ls; for U+2028 (line separator, replacement for "soft" <br>) (2) Remove <br> from HTML5 spec. Add a comment saying that <br> was deprecated, stating explicitly that when upgrading from ÑýHTML4, <br> should be replaced with &ps; if the intended use was a hard break, or with &ls; if the intended use was compliant with the HTML4 spec (i.e. soft break). In a private discussion with Aharon, he said that this approach might, in practice, work against our goal of reaching compatibility. I'm sure he can explain this better than me. The purpose of this post is trying to reach some concensus before posting the idea in bugzilla. thanks for taking the time to read, Amit A.
Received on Sunday, 7 November 2010 10:46:05 UTC