Re: [csswg-drafts] [css-text-3] Discarding Line Breaks Adjacent to Ambiguous Characters (#5017)

Which aspect of ambiguity are we talking about here?
a. a space may be added where not wanted, eg. 支持W3C实现“\尽展
b. a space may be removed when it should stay, eg. 空格字符“ \”不可见

Whichever applies, i suspect that my comments at https://github.com/w3c/csswg-drafts/issues/4992#issuecomment-621265490 about using common-sense or internationalised apps for line breaking may apply.

But Chinese & Japanese use a quite a lot of characters that are not in the discard blocks, not just quote marks.  I've actually been trying to make lists of what characters are used by what language, and fwiw currently i have the following from non-discard blocks:

**Chinese**
Basic Latin | 21 | !​"​#​%​&​(​)​*​-​.​/​:​;​?​@​[​\​]​_​{​}
General Punctuation | 21 | ‐​‑​–​—​―​‖​‘​’​“​”​†​‡​‥​…​‧​‰​′​″​‵​※​‾
Latin-1 Supplement | 2 | §​·

**Japanese**
Basic Latin | 20 | !​"​#​%​&​(​)​*​-​.​/​:​;​?​@​[​\​]​{​}
General Punctuation | 16 | ‐​—​―​‖​‘​’​“​”​†​‡​‥​…​‰​′​″​※
Latin-1 Supplement | 2 | §​¶

-- 
GitHub Notification of comment by r12a
Please view or discuss this issue at https://github.com/w3c/csswg-drafts/issues/5017#issuecomment-621351902 using your GitHub account

Received on Wednesday, 29 April 2020 17:24:24 UTC