- From: <BruceLeban@akimbo.com>
- Date: Fri, 11 Apr 1997 21:59:35 -0400 (EDT)
- To: www-html@w3.org
>> A suggested tag: something like <w>
>> for word split
>A soft hyphen entity, ­, was proposed in HTML 2.0, though I don't
>think many (if any) browsers actually support it. Some show a
>hyphen, but I'm not aware of any that actually treat it as a soft
>hyphen.
My favorite feature of ­ is how unreadable it makes
doc­u­ments that use them in browsers that don't recognize the
entity. *If* this is something that was desireable to use as markup, then
it should be a tag, not an entity. I tend to think that this is something
the renderer of the document (i.e., browser) could deal with. There
doesn't need to be a shared dictionary or anything. It uses my dictionary
on my machine; I don't care what hyphens you see. :-)
>With the <w> solution there is the problem of determining word begin/end.
> Com<w>
> mu<w>
> ni<w>
> ca<w>
> tor
>could not easily be determined as a single word. So to make it bullet proof
>word ending/beginnings would have to be tagged, too. E.g.
> <word>Com<w>mu<w>ni<w>ca<w>tor</word>
Not really. Just because the word can be broken on display, doesn't mean
it's allowed to be broken in the source. You could prohibit spaces in the
middle of a word (a fairly common rule :-).
But you don't need to. "Foo <w> bar" can simply be defined to be a single
word since there's no reason you would stick an optional hyphen at the
beginning or end of a word.
Of course nothing is as simple as it seems at first. This might seem like
a simple idea, but there are complications. If you're really doing this,
you need something slightly more sophisticated to handle all hyphenation:
ba<wb break="k-">c</wb>ken
which is a German word "backen" that hyphenates as bak- ken and
republican/<wb break=""></wb>democrat
which is "republican/democrat" and broken as "republican/" and "democrat".
<w> is equivalent to
<wb break="-"></wb>
Then you might also want specify precedence of breaks:
hy<w rank=3>phen<w rank=1>a<w rank=2>tion
There may be other complexities I haven't thought of. Of course we don't
need to worry about markup at all if the browser just handles it. Plus I
get hyphenation on all those web sites that the authors didn't bother to
markup.
--- Bruce Leban
Akimbo Systems
http://www.akimbo.com/globetrotter
Publish on the web without learning HTML! (Really.)
Received on Friday, 11 April 1997 21:59:37 UTC