Re: New tests: Line breaks in CSS3 Text

On Fri, Aug 31, 2012 at 12:53 AM, Richard Ishida <ishida@w3.org> wrote:

> Several sets of tests are now available related to line breaks, word
> breaks and hyphenation in CSS3 Text and Unicode Standard Annex #14 (Line
> Breaking Properties). Some of the tests are rewrites of previous tests in
> the Internationalization Activity test suite. There are also many new tests.
>

(1) all of the Conditional Japanese Starter (CJ) test sets are missing
U+FF70, recently added in the latest ED [1][2];

[1] http://dev.w3.org/csswg/css3-text/#line-break
[2] http://lists.w3.org/Archives/Public/www-style/2012Aug/0860.html

(2) the tests listed under Conditional Japanese Starter (CJ) for
line-break: normal [3] are incorrect since CSS3 Text states that for *normal
* and *loose*, that breaks before small kana and prolonged sound marks are
permitted (not forbidden); these tests currently treat according to
*strict* semantics
(forbidden), which is wrong (according to spec);

[3]
http://www.w3.org/International/tests/html-css/line-break-ja-zh/results-ja#normal


(3) the tests for U+301C (WAVE DASH) and U+30A0 (KATAKANA-HIRAGABNA DOUBLE
HYPHEN) listed under (Non-starter) NS for line-break: normal [3] are
incorrect since CSS3 Text states that for *normal* and *loose*, that breaks
before these characters are permitted (not forbidden); these tests
currently treat according to *strict* semantics (forbidden), which is wrong
(according to spec);

(4) the tests for iteration marks {U+3005, U+303B, U+309D, U+309E, U+30FD,
U+30FE} and also exclamation/question marks {U+203C, U+2047, U+2048,
U+2049} listed under Non-starter (NS) for line-break: normal [3] are
incorrect since CSS3 Text states that for *normal* and *strict*, that
breaks before these characters are forbidden (not permitted); these tests
currently treat according to *loose* semantics (permitted), which is wrong
(according to spec);

(5) the tests for U+2010 (HYPHEN) and U+2013 (EN DASH) listed under under
Break After (BA) for line-break: normal [3] are incorrect since CSS3 Text
states that for *normal* and *loose*, that breaks before these characters
are permitted (not forbidden); these tests currently treat according to *
strict* semantics (forbidden), which is wrong (according to spec);

(6) the tests for exclamation/question marks {U+0021, U+003F, U+FF1F}
listed under Exclamation (EX) for line-break: normal [3] are incorrect
since CSS3 Text states that for *normal* and *strict*, that breaks before
these characters are forbidden (not permitted); these tests currently treat
according to *loose* semantics (permitted), which is wrong (according to
spec);

(7) the tests for U+003A (COLON) and U+003B (SEMICOLON) listed under under
Infix Numeric Separator (IS) for line-break: normal [3] are incorrect since
CSS3 Text states that for *normal* and *strict*, that breaks before these
characters are forbidden (not permitted); these tests currently treat
according to *loose* semantics (permitted), which is wrong (according to
spec);

(8) the tests for postfix characters {U+0025, U+00A2, U+00B0, U+2030,
U+2032, U+2033, U+2103, U+FF05, U+FFE0} listed under Postfix Numeric (XB)
for line-break: normal [3] are incorrect since CSS3 Text states that for *
normal* and *strict*, that breaks before these characters are forbidden
(not permitted); these tests currently treat according to *loose* semantics
(permitted), which is wrong (according to spec);

(9) for each of the strict, normal, loose test sets, there seems to be a
redundant test for U+0025 PERCENT SIGN, cf.,
line-break-ja-zh-{050,053}, line-break-ja-zh-{150,153},
line-break-ja-zh-{250,253};

(10) for each of the strict, normal, loose test sets, there are no tests
for the prefix characters {U+0024, U+00A3, U+00A5, U+20AC, U+2116, U+FF04,
U+FFE1, U+FFE5} which are assigned specific break semantics by CSS3 Text;

(11) some of the CSS3 Text line-break rules are not sensitive to content
language, e.g., small kana, prolonged sound marks, iteration marks, and
inseparables; the current tests do not test that the specified rules are
applied when content language is not ja or zh for these rules;

(12) similarly, the other CSS3 Text line-break rules are applied only if
content language is ja or zh; there are no tests that these rules are not
applied when content language is not ja or zh;

As an FYI, I'm in the processing of addressing a WebKit bug against the
line-break feature [4]. To this end, I've prepared some preliminary wiki
documentation on this subject that may be of interest [5][6]. I would be
interested in receiving any comments or corrections to this material.

[4] http://bugs.webkit.org/show_bug.cgi?id=89235
[5] http://trac.webkit.org/wiki/LineBreaking
[6] http://trac.webkit.org/wiki/LineBreakingCSS3Mapping

Regards,
Glenn

Received on Saturday, 1 September 2012 03:28:55 UTC