- From: Zack Weinberg <zackw@panix.com>
- Date: Tue, 22 Jul 2014 14:16:41 -0400
- To: Christoph Päper <christoph.paeper@crissov.de>
- Cc: "www-style@w3.org" <www-style@w3.org>, Unicode <unicode@unicode.org>
On Tue, Jul 22, 2014 at 12:14 PM, Christoph Päper
<christoph.paeper@crissov.de> wrote:
> fantasai <fantasai.lists@inkedblade.net>:
>
>>> The problem is that the hyphenation system in itself can't decide how
>>> to change the spelling, without any "dictionary"   functionality. It
>>> can't know if I meant "mat-tjuv" ("food thief" in Swedish) or "matt-tjuv"
>>> ("carpet thief") when I wrote "mat­tjuv". So there has to be a way
>>> to tell the hyphenation system that.
...
>   “mattjuv, mat͏tjuv”
>
> Possible Unicode solution with a new combining character that makes the preceding character or grapheme – I’m not sure which – invisible except at the end of a line:
>
>   “mattjuv, matt⁥tjuv”
>
>   U+2065 – Combining Collapse or Reduplicating Soft Hyphen or so
I think I'd prefer new tags to new magic entities.  In TeX this would be
    mat\discretionary{t-}{}{}tjuv
so maybe in HTML
    mat<dbr before="t-">tjuv
also accepting after= and nobreak= attributes.  It's verbose but it's
easier to remember, I think.
I'd also support a "hyphenation" CSS property with the same semantics
as TeX's \hyphenation{}, i.e.
    hyphenation: "un-break-able" "mom-ent";
overrides the built-in hyphenation dictionary for the words
"unbreakable" and "moment" (within the selected elements; normally one
would put this on <body>).
For bonus points,
    hyphenation: "mat[t-//]tjuv"
precise syntax to be bikeshedded.
> All solutions require author education.
Yah.
zw
Received on Tuesday, 22 July 2014 18:17:05 UTC