- From: Martin J. Duerst <duerst@w3.org>
- Date: Wed, 15 Dec 1999 14:17:33 +0900
- To: "McDonald, Ira" <imcdonald@sharplabs.com>
- Cc: "'Al Gilman'" <asgilman@iamdigex.net>, "'Web Accessibility Initiative'"<w3c-wai-ig@w3.org>, "McDonald, Ira"<imcdonald@sharplabs.com>
Of course the non-breaking hyphen is the correct thing to use. For quotes, never use “ and ”, this is not correct HTML at all. The numbers in &#...; constructs refer to Unicode, and Unicode does not have any characters at these positions. Please see http://www.w3.org/TR/html40/sgml/entities.html#h-24.4.1 and use any of: <!ENTITY lsquo CDATA "‘" -- left single quotation mark, U+2018 ISOnum --> <!ENTITY rsquo CDATA "’" -- right single quotation mark, U+2019 ISOnum --> <!ENTITY sbquo CDATA "‚" -- single low-9 quotation mark, U+201A NEW --> <!ENTITY ldquo CDATA "“" -- left double quotation mark, U+201C ISOnum --> <!ENTITY rdquo CDATA "”" -- right double quotation mark, U+201D ISOnum --> As for 'implied', the comment in http://www.w3.org/TR/html40/sgml/entities.html#h-24.3.1 just is a note pointing out the assumption of the authors of the HTML spec that this was the same as an entity in some ISO standard with a different name, and that this assumption was supported by some other ISO document. Neither are the characters missing from Unicode/iso10646, and they are even available as *named* character entities in HTML 4.0. In case somebody really has a new character to propose (which I don't think will happen very frequently), please have a look at http://www.unicode.org/pending/proposals.html. Regards, Martin. At 11:33 1999/12/13 -0800, McDonald, Ira wrote: > Hi Al and Bruce, > > In Unicode 2.0 (1996), in Chapter 7 'Code Charts', on page 7-155 > 'General punctuation', the following character is defined: > > U+2011 NON-BREAKING HYPHEN > (cross reference) U+002D - HYPHEN-MINUS > > I believe this is what you're looking for. I've copied Martin > Duerst (W3C Internationalization leader) to verify. > > Cheers, > - Ira McDonald (outside consultant at Sharp Labs America) > High North Inc > > -----Original Message----- > From: Al Gilman [mailto:asgilman@iamdigex.net] > Sent: Friday, December 10, 1999 6:37 PM > To: 'Web Accessibility Initiative' > Subject: Re: Off Topic -- Hard hyphen? > > > 1. There is the whitespace=nowrap formatting property that in theory should > accomplish what you are after if you wrap the whole <span > class='my-nowrap'>hyphenated-expression</span> and then style this class or > ID with the property whitespace=nowrap. > > 2. Missing characters should go to the Unicode Consortium. It would be > good to get in touch with the internationalization, XML InfoSet, and/or > math groups depending on just what the character is, to find out if there > are others in W3C that care about this character one way or another. > > Al > > At 05:43 PM 12/10/99 -0500, Bruce Bailey wrote: > >Is there a character for a "non-breaking hyphen"? I want a dash that is > >treated like any other non-space alphanumeric character: i.e., one that, > >if near the end of a line, causes the line to wrap at the space before the > >text that precedes the hyphen rather than just after the hyphen. An > >equivalent Navigator-ism would be <nobr>-</nobr> (not valid html). − > > >is logical, but is kind of an abuse (and not compatible with the 3.x > >browsers). I know about ­ why not a &hhy; ? A proper "en dash" > >(– or – ) is a little to large and is not compatible with 3.x > >browsers. I even went so far as to try – -- it's also too big (but it > > >IS compatible with the 3.x browsers) but wraps the same as a regular > >dash/hyphen. (Yes, I know this "illegal" character gets the hackles up of > >the Unix crowd -- but I still blame _them_ for leaving fundamental > >typographical characters out of the 3.2 character set!) > > > >I can't find anything that works. I am tempted to create a IMG graphic, > >but I am sure that it will mess up my line height, this technique is not > >scaleable, and I just can't believe I am the only person who needs this. > > > >On a related, but perhaps equally off topic question, is there a > >straightforward HTTP-EQUIV="Content-Type" statement that would make my use > >of "windows" characters between  and Ÿ legal? There was a whole > > >discussion of this earlier, and people generously sent me URLs to academic > >discussions of characters sets. I learned a great deal, but there was a > >lot I could not follow. In particular, I could not discern a truly > >cross-platform backwards-compatible way to generate typographical quotation > > >marks. I am still using “ and ” and plan to do so until > >Navigator and IE support the <Q> ... </Q> construct. > > > >What is the correct channel to go through to suggest missing characters? > > The official W3C specs themselves point out that the basic mathematical > >symbols "implies" and "is implied by" (as well as the more obscure "not a > >super set of") are omitted. URL: > >http://www.w3.org/TR/html40/sgml/entities.html#h-24.3.1 > > > >Thank you. > >-- Bruce Bailey > > > > #-#-# Martin J. Du"rst, World Wide Web Consortium #-#-# mailto:duerst@w3.org http://www.w3.org
Received on Wednesday, 15 December 1999 00:45:29 UTC