Re: [CSS3 Text] word-break

On Fri, 19 Jan 2007 00:35:14 +1300
fantasai <fantasai.lists@inkedblade.net> wrote:

> 
> MURAKAMI Shinyu wrote:
> > Hi fantasai,
> > thank you very much for replying to my comments.
> > 
> > On Wed, 10 Jan 2007 08:18:53 +1300
> > fantasai <fantasai.lists@inkedblade.net> wrote:
> > 
> >> MURAKAMI Shinyu wrote:
> >>> My proposal is to separate "strict" from "normal". The new definition is 
> >>> the following.
> >>>
> >>> Name: word-break
> >>> Value: normal | strict | keep-all | loose | break-all |  break-strict 
> >>> Initial: normal 
> >>>
> >>> normal
> >>>     The UA determines what set of line-breaking restrictions. It may be 
> >>>     equivalent to either of 'strict' or 'loose'.
> >> The problem with this is that it's not interoperable. (We want CSS
> >> implementations to be consistent, which is why 'normal' means one
> >> thing only, not a choice of two things.)
> > 
> > But the CSS spec already has many UA dependent things such as
> > font-family (initial value: depends on user agent).
> 
> The UA-dependent things tend to be those that are system-dependent.
> Fonts, for example, are system-dependent: what font is used depends
> on what fonts are available on your system. CSS can't specify the
> default value for 'font-family' because there is no font that is
> guaranteed to be available on all systems that run a CSS processor.
> 
>  > Allowing or disallowing kanas at beginning of line is very trivial thing
>  > for most people, and the font-family, serif or sans-serif (mincho or
>  > gothic, in Japanese typefaces), is much more important.
> 
> This setting is too trivial to justify a user preference, and it is not
> system-dependent. As far as web authors are concerned, any differences
> between implementations are a pain (they want to rely on defaults being
> consistent), so we want to minimize the allowance for such differences.
> What benefit is there to allowing UAs to have different default values
> for word-break?

"Strict" line-breaking is not suitable for small screen devices and
narrow columns, e.g. newspapers. In such media, "loose" line-breaking
has better readability. If default line-breaking is UA-dependent, UA
can choose best line-breaking strictness for target media.

> 
> >> While we're on this topic, if you could specify what breaks are disallowed
> >> in strict that are allowed in loose, that would be very helpful information
> >> to put in the spec.
> > 
> > Antenna House XSL Formatter (yes, I am a developper of it) is an
> > implementation of the XSL-FO spec but it has extension properties 
> > that come from CSS3 draft specs (including the obsolete CSS3-text CR).
> > 
> > Please see the description of the axf:line-break property:
> > http://www.antennahouse.com/xslfo/axf4-extension.htm#line-break
> 
> Thank you for the reference. Is it ok with Antenna House if I copy
> information from it to CSS3 Text?

Yes, but note that the referred Unicode LineBreak data was not recent
version. (we have to update...)

For the latest version (5.0.0):

UAX#14: Line Breaking Properties
http://www.unicode.org/reports/tr14/

Line Break property data file
http://www.unicode.org/Public/UNIDATA/LineBreak.txt

The Nonstarter characters in LineBreak.txt (version 5.0.0):
([JIS] indicates Nonstarter Japanese characters in JIS X 4051)
------------------------------------------------------------------------
17D6;NS # KHMER SIGN CAMNUC PII KUUH                        ៖
203C;NS # DOUBLE EXCLAMATION MARK                           ‼
203D;NS # INTERROBANG                                       ‽
2047;NS # DOUBLE QUESTION MARK                              ⁇
2048;NS # QUESTION EXCLAMATION MARK                         ⁈
2049;NS # EXCLAMATION QUESTION MARK                         ⁉
3005;NS # IDEOGRAPHIC ITERATION MARK                        々    [JIS]
301C;NS # WAVE DASH                                         〜
303B;NS # VERTICAL IDEOGRAPHIC ITERATION MARK               〻
303C;NS # MASU MARK                                         〼
3041;NS # HIRAGANA LETTER SMALL A                           ぁ    [JIS]
3043;NS # HIRAGANA LETTER SMALL I                           ぃ    [JIS]
3045;NS # HIRAGANA LETTER SMALL U                           ぅ    [JIS]
3047;NS # HIRAGANA LETTER SMALL E                           ぇ    [JIS]
3049;NS # HIRAGANA LETTER SMALL O                           ぉ    [JIS]
3063;NS # HIRAGANA LETTER SMALL TU                          っ    [JIS]
3083;NS # HIRAGANA LETTER SMALL YA                          ゃ    [JIS]
3085;NS # HIRAGANA LETTER SMALL YU                          ゅ    [JIS]
3087;NS # HIRAGANA LETTER SMALL YO                          ょ    [JIS]
308E;NS # HIRAGANA LETTER SMALL WA                          ゎ    [JIS]
3095;NS # HIRAGANA LETTER SMALL KA                          ゕ
3096;NS # HIRAGANA LETTER SMALL KE                          ゖ
309B;NS # KATAKANA-HIRAGANA VOICED SOUND MARK               ゛
309C;NS # KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK          ゜
309D;NS # HIRAGANA ITERATION MARK                           ゝ    [JIS]
309E;NS # HIRAGANA VOICED ITERATION MARK                    ゞ    [JIS]
30A0;NS # KATAKANA-HIRAGANA DOUBLE HYPHEN                   ゠
30A1;NS # KATAKANA LETTER SMALL A                           ァ    [JIS]
30A3;NS # KATAKANA LETTER SMALL I                           ィ    [JIS]
30A5;NS # KATAKANA LETTER SMALL U                           ゥ    [JIS]
30A7;NS # KATAKANA LETTER SMALL E                           ェ    [JIS]
30A9;NS # KATAKANA LETTER SMALL O                           ォ    [JIS]
30C3;NS # KATAKANA LETTER SMALL TU                          ッ    [JIS]
30E3;NS # KATAKANA LETTER SMALL YA                          ャ    [JIS]
30E5;NS # KATAKANA LETTER SMALL YU                          ュ    [JIS]
30E7;NS # KATAKANA LETTER SMALL YO                          ョ    [JIS]
30EE;NS # KATAKANA LETTER SMALL WA                          ヮ    [JIS]
30F5;NS # KATAKANA LETTER SMALL KA                          ヵ    [JIS]
30F6;NS # KATAKANA LETTER SMALL KE                          ヶ    [JIS]
30FB;NS # KATAKANA MIDDLE DOT                               ・
30FC;NS # KATAKANA-HIRAGANA PROLONGED SOUND MARK            ー    [JIS]
30FD;NS # KATAKANA ITERATION MARK                           ヽ    [JIS]
30FE;NS # KATAKANA VOICED ITERATION MARK                    ヾ    [JIS]
31F0;NS # KATAKANA LETTER SMALL KU                          ㇰ
31F1;NS # KATAKANA LETTER SMALL SI                          ㇱ
31F2;NS # KATAKANA LETTER SMALL SU                          ㇲ
31F3;NS # KATAKANA LETTER SMALL TO                          ㇳ
31F4;NS # KATAKANA LETTER SMALL NU                          ㇴ
31F5;NS # KATAKANA LETTER SMALL HA                          ㇵ
31F6;NS # KATAKANA LETTER SMALL HI                          ㇶ
31F7;NS # KATAKANA LETTER SMALL HU                          ㇷ
31F8;NS # KATAKANA LETTER SMALL HE                          ㇸ
31F9;NS # KATAKANA LETTER SMALL HO                          ㇹ
31FA;NS # KATAKANA LETTER SMALL MU                          ㇺ
31FB;NS # KATAKANA LETTER SMALL RA                          ㇻ
31FC;NS # KATAKANA LETTER SMALL RI                          ㇼ
31FD;NS # KATAKANA LETTER SMALL RU                          ㇽ
31FE;NS # KATAKANA LETTER SMALL RE                          ㇾ
31FF;NS # KATAKANA LETTER SMALL RO                          ㇿ
A015;NS # YI SYLLABLE WU                                    ꀕ
FE54;NS # SMALL SEMICOLON                                   ﹔
FE55;NS # SMALL COLON                                       ﹕
FF1A;NS # FULLWIDTH COLON                                   :
FF1B;NS # FULLWIDTH SEMICOLON                               ;
FF65;NS # HALFWIDTH KATAKANA MIDDLE DOT                     ・
FF67;NS # HALFWIDTH KATAKANA LETTER SMALL A                 ァ    [JIS]
FF68;NS # HALFWIDTH KATAKANA LETTER SMALL I                 ィ    [JIS]
FF69;NS # HALFWIDTH KATAKANA LETTER SMALL U                 ゥ    [JIS]
FF6A;NS # HALFWIDTH KATAKANA LETTER SMALL E                 ェ    [JIS]
FF6B;NS # HALFWIDTH KATAKANA LETTER SMALL O                 ォ    [JIS]
FF6C;NS # HALFWIDTH KATAKANA LETTER SMALL YA                ャ    [JIS]
FF6D;NS # HALFWIDTH KATAKANA LETTER SMALL YU                ュ    [JIS]
FF6E;NS # HALFWIDTH KATAKANA LETTER SMALL YO                ョ    [JIS]
FF6F;NS # HALFWIDTH KATAKANA LETTER SMALL TU                ッ    [JIS]
FF70;NS # HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK  ー    [JIS]
FF9E;NS # HALFWIDTH KATAKANA VOICED SOUND MARK              ゙
FF9F;NS # HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK         ゚
------------------------------------------------------------------------

Best regards,

-- 
Shinyu Murakami
Antenna House XSL Formatter team
http://www.antennahouse.com

Received on Thursday, 18 January 2007 19:27:01 UTC