W3C home > Mailing lists > Public > public-i18n-archive@w3.org > January to March 2020

Re: [i18n-discuss] Languages / writing systems with 2 line breaking conventions in common use? (#11)

From: Florian Rivoal via GitHub <sysbot+gh@w3.org>
Date: Tue, 21 Jan 2020 18:02:37 +0000
To: public-i18n-archive@w3.org
Message-ID: <issue_comment.created-576805043-1579629756-sysbot+gh@w3.org>
As far as I can tell, browsers do that because Unicode tells them to: https://www.unicode.org/Public/UCD/latest/ucd/LineBreak.txt classifies Ethiopic syllables as AL, which by UAX14 prohibits breaks between pairs of such letters.

But given the explanation in elreq, that actually makes sense: when ethiopic was primarily written with word separators, using a break-all style of line breaking was fine, but with the advent using spaces, line breaking anywhere becomes somewhat ambiguous.

So, what elreq currently describes seems to be the historic reality that breaking between all letters was the common practice. What it doesn't say is whether there's a continued desire for this behavior.

-- 
GitHub Notification of comment by frivoal
Please view or discuss this issue at https://github.com/w3c/i18n-discuss/issues/11#issuecomment-576805043 using your GitHub account
Received on Tuesday, 21 January 2020 18:02:39 UTC

This archive was generated by hypermail 2.4.0 : Monday, 4 July 2022 18:09:40 UTC