W3C home > Mailing lists > Public > www-style@w3.org > January 2007

Re: [CSS3 Text] word-break

From: MURAKAMI Shinyu <murakami@antenna.co.jp>
Date: Fri, 19 Jan 2007 19:04:47 +0900
To: www-style@w3.org
Message-Id: <20070119183218.3760.MURAKAMI@antenna.co.jp>

On Fri, 19 Jan 2007 04:27:03 +0900
MURAKAMI Shinyu <murakami@antenna.co.jp> wrote:

> > > Antenna House XSL Formatter (yes, I am a developper of it) is an
> > > implementation of the XSL-FO spec but it has extension properties 
> > > that come from CSS3 draft specs (including the obsolete CSS3-text CR).
> > > 
> > > Please see the description of the axf:line-break property:
> > > http://www.antennahouse.com/xslfo/axf4-extension.htm#line-break
> > 
> > Thank you for the reference. Is it ok with Antenna House if I copy
> > information from it to CSS3 Text?
> 
> Yes, but note that the referred Unicode LineBreak data was not recent
> version. (we have to update...)
> 
> For the latest version (5.0.0):
> 
> UAX#14: Line Breaking Properties
> http://www.unicode.org/reports/tr14/
> 
> Line Break property data file
> http://www.unicode.org/Public/UNIDATA/LineBreak.txt
> 
> The Nonstarter characters in LineBreak.txt (version 5.0.0):
> ([JIS] indicates Nonstarter Japanese characters in JIS X 4051)
...

Sorry, I found a problem in the description of the axf:line-break
property of Antenna House XSL Formatter.
http://www.antennahouse.com/xslfo/axf4-extension.htm#line-break
| normal
|    Nonstarter characters in JIS X 4051 and other small Kana letters in
|    Japanese (U+3095, U+3096, U+31F0 to U+31FF) are not treated as
|    Nonstarter characters. ...

I checked JIS X 4051:2004 (latest version of JIS X 4051) and found that
the U+3095, U+3096, U+31F0 to U+31FF were also included in the list of
Nonstarter Japanese characters. It should be corrected to:

    Nonstarter Japanese characters (prolonged sound mark, small hiragana
    letters, small katakana letters, and iteration marks) defined in 
    JIS X 4051 are not treated as Nonstarter characters. ...

The Nonstarter characters in LineBreak.txt (version 5.0.0):
([JIS] indicates Nonstarter Japanese characters in JIS X 4051)
------------------------------------------------------------------------
17D6;NS # KHMER SIGN CAMNUC PII KUUH                        ៖
203C;NS # DOUBLE EXCLAMATION MARK                           ‼
203D;NS # INTERROBANG                                       ‽
2047;NS # DOUBLE QUESTION MARK                              ⁇
2048;NS # QUESTION EXCLAMATION MARK                         ⁈
2049;NS # EXCLAMATION QUESTION MARK                         ⁉
3005;NS # IDEOGRAPHIC ITERATION MARK                        々    [JIS]
301C;NS # WAVE DASH                                         〜
303B;NS # VERTICAL IDEOGRAPHIC ITERATION MARK               〻    [JIS]
303C;NS # MASU MARK                                         〼
3041;NS # HIRAGANA LETTER SMALL A                           ぁ    [JIS]
3043;NS # HIRAGANA LETTER SMALL I                           ぃ    [JIS]
3045;NS # HIRAGANA LETTER SMALL U                           ぅ    [JIS]
3047;NS # HIRAGANA LETTER SMALL E                           ぇ    [JIS]
3049;NS # HIRAGANA LETTER SMALL O                           ぉ    [JIS]
3063;NS # HIRAGANA LETTER SMALL TU                          っ    [JIS]
3083;NS # HIRAGANA LETTER SMALL YA                          ゃ    [JIS]
3085;NS # HIRAGANA LETTER SMALL YU                          ゅ    [JIS]
3087;NS # HIRAGANA LETTER SMALL YO                          ょ    [JIS]
308E;NS # HIRAGANA LETTER SMALL WA                          ゎ    [JIS]
3095;NS # HIRAGANA LETTER SMALL KA                          ゕ    [JIS]
3096;NS # HIRAGANA LETTER SMALL KE                          ゖ    [JIS]
309B;NS # KATAKANA-HIRAGANA VOICED SOUND MARK               ゛
309C;NS # KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK          ゜
309D;NS # HIRAGANA ITERATION MARK                           ゝ    [JIS]
309E;NS # HIRAGANA VOICED ITERATION MARK                    ゞ    [JIS]
30A0;NS # KATAKANA-HIRAGANA DOUBLE HYPHEN                   ゠
30A1;NS # KATAKANA LETTER SMALL A                           ァ    [JIS]
30A3;NS # KATAKANA LETTER SMALL I                           ィ    [JIS]
30A5;NS # KATAKANA LETTER SMALL U                           ゥ    [JIS]
30A7;NS # KATAKANA LETTER SMALL E                           ェ    [JIS]
30A9;NS # KATAKANA LETTER SMALL O                           ォ    [JIS]
30C3;NS # KATAKANA LETTER SMALL TU                          ッ    [JIS]
30E3;NS # KATAKANA LETTER SMALL YA                          ャ    [JIS]
30E5;NS # KATAKANA LETTER SMALL YU                          ュ    [JIS]
30E7;NS # KATAKANA LETTER SMALL YO                          ョ    [JIS]
30EE;NS # KATAKANA LETTER SMALL WA                          ヮ    [JIS]
30F5;NS # KATAKANA LETTER SMALL KA                          ヵ    [JIS]
30F6;NS # KATAKANA LETTER SMALL KE                          ヶ    [JIS]
30FB;NS # KATAKANA MIDDLE DOT                               ・
30FC;NS # KATAKANA-HIRAGANA PROLONGED SOUND MARK            ー    [JIS]
30FD;NS # KATAKANA ITERATION MARK                           ヽ    [JIS]
30FE;NS # KATAKANA VOICED ITERATION MARK                    ヾ    [JIS]
31F0;NS # KATAKANA LETTER SMALL KU                          ㇰ    [JIS]
31F1;NS # KATAKANA LETTER SMALL SI                          ㇱ    [JIS]
31F2;NS # KATAKANA LETTER SMALL SU                          ㇲ    [JIS]
31F3;NS # KATAKANA LETTER SMALL TO                          ㇳ    [JIS]
31F4;NS # KATAKANA LETTER SMALL NU                          ㇴ    [JIS]
31F5;NS # KATAKANA LETTER SMALL HA                          ㇵ    [JIS]
31F6;NS # KATAKANA LETTER SMALL HI                          ㇶ    [JIS]
31F7;NS # KATAKANA LETTER SMALL HU                          ㇷ    [JIS]
31F8;NS # KATAKANA LETTER SMALL HE                          ㇸ    [JIS]
31F9;NS # KATAKANA LETTER SMALL HO                          ㇹ    [JIS]
31FA;NS # KATAKANA LETTER SMALL MU                          ㇺ    [JIS]
31FB;NS # KATAKANA LETTER SMALL RA                          ㇻ    [JIS]
31FC;NS # KATAKANA LETTER SMALL RI                          ㇼ    [JIS]
31FD;NS # KATAKANA LETTER SMALL RU                          ㇽ    [JIS]
31FE;NS # KATAKANA LETTER SMALL RE                          ㇾ    [JIS]
31FF;NS # KATAKANA LETTER SMALL RO                          ㇿ    [JIS]
A015;NS # YI SYLLABLE WU                                    ꀕ
FE54;NS # SMALL SEMICOLON                                   ﹔
FE55;NS # SMALL COLON                                       ﹕
FF1A;NS # FULLWIDTH COLON                                   :
FF1B;NS # FULLWIDTH SEMICOLON                               ;
FF65;NS # HALFWIDTH KATAKANA MIDDLE DOT                     ・
FF67;NS # HALFWIDTH KATAKANA LETTER SMALL A                 ァ    [JIS]
FF68;NS # HALFWIDTH KATAKANA LETTER SMALL I                 ィ    [JIS]
FF69;NS # HALFWIDTH KATAKANA LETTER SMALL U                 ゥ    [JIS]
FF6A;NS # HALFWIDTH KATAKANA LETTER SMALL E                 ェ    [JIS]
FF6B;NS # HALFWIDTH KATAKANA LETTER SMALL O                 ォ    [JIS]
FF6C;NS # HALFWIDTH KATAKANA LETTER SMALL YA                ャ    [JIS]
FF6D;NS # HALFWIDTH KATAKANA LETTER SMALL YU                ュ    [JIS]
FF6E;NS # HALFWIDTH KATAKANA LETTER SMALL YO                ョ    [JIS]
FF6F;NS # HALFWIDTH KATAKANA LETTER SMALL TU                ッ    [JIS]
FF70;NS # HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK  ー    [JIS]
FF9E;NS # HALFWIDTH KATAKANA VOICED SOUND MARK              ゙
FF9F;NS # HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK         ゚
------------------------------------------------------------------------

Best regards,

-- 
Shinyu Murakami
Antenna House XSL Formatter team
http://www.antennahouse.com
Received on Friday, 19 January 2007 10:05:04 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 27 April 2009 13:54:49 GMT