Re: [csswg-drafts] [css-text-4] Add support for content-detection, phrase-based line breaking (#6730)

@r12a 
> I'm finding it hard to see the Thai case as similar.

I think the title and the original description was misleading, sorry about that. The actual intention of this issue is about supporting a "natural" line breaking. From our point of view, handling particles as part of a phrase is an example for Japanese to explain what it wants to achieve. It may also include handling compound nouns for Japanese/Chinese.

> i assume that that would need the user agent to start applying linguistic analysis on a language-specific basis to a large number of languages. Is that feasible?

One possible way to implement is to use ML, as done in [BudouX](https://github.com/google/budoux) (you can play with it by an [extension](https://github.com/google/budoux-extension).) Currently it supports Japanese only, but its basic idea is applicable to any languages.

> if you want to do this kind of thing manually then U+2060 WORD JOINER is your friend.

Thanks, it is indeed helpful. I hope CSS can support better ways than inserting WORD JOINER on every break opportunities, but until then, we can use the workaround.

-- 
GitHub Notification of comment by kojiishi
Please view or discuss this issue at https://github.com/w3c/csswg-drafts/issues/6730#issuecomment-1040629172 using your GitHub account


-- 
Sent via github-notify-ml as configured in https://github.com/w3c/github-notify-ml-config

Received on Tuesday, 15 February 2022 18:29:22 UTC