RE: m-dashes

> Al Gilman wrote: 
> How we know that a given character reference is an em-dash is in
> the area of internationalization, because internationalization is
> the area where we have to deal with the fact that there are lots
> of characters that aren't in ASCII.
> 
> 
	[Pawson, David]  
	The inclusion or exclusion of 'funny marks' is determined by the

	'Latin' ISO set of entities - included below. If the one you
want
	is not in that one, it is no hardship to find the right entity
set that
	does and look at including that one.
	Note that none of the marks are included, this set is primarily 
	the one to cover the variants on the base alphabet.

	for example, mdash is in ISOPUB.ENT

	regards, DaveP




	     <!--  (C)  International   Organization   for
Standardization   1986
	     Permission to copy in any form is granted for use with
conforming SGML
	     systems and applications as defined in ISO 8879, provided
this  notice
	     is included in all copies.  -->

	     <!-- Character entity set. Typical invocation:

	     <!ENTITY % ISOlat1 PUBLIC "ISO 8879:1986//ENTITIES Added
Latin 1//EN">
	     %ISOlat1;
	     -->

	     <!ENTITY aacute SDATA "[aacute]"--=small a, acute accent-->
	     <!ENTITY Aacute SDATA "[Aacute]"--=capital A, acute
accent-->
	     <!ENTITY acirc  SDATA "[acirc ]"--=small a, circumflex
accent-->
	     <!ENTITY Acirc  SDATA "[Acirc ]"--=capital A, circumflex
accent-->
	     <!ENTITY agrave SDATA "[agrave]"--=small a, grave accent-->
	     <!ENTITY Agrave SDATA "[Agrave]"--=capital A, grave
accent-->
	     <!ENTITY aring  SDATA "[aring ]"--=small a, ring-->
	     <!ENTITY Aring  SDATA "[Aring ]"--=capital A, ring-->
	     <!ENTITY atilde SDATA "[atilde]"--=small a, tilde-->
	     <!ENTITY Atilde SDATA "[Atilde]"--=capital A, tilde-->
	     <!ENTITY auml   SDATA "[auml  ]"--=small a, dieresis or
umlaut mark-->
	     <!ENTITY Auml   SDATA "[Auml  ]"--=capital A, dieresis or
umlaut mark-->
	     <!ENTITY aelig  SDATA "[aelig ]"--=small ae diphthong
(ligature)-->
	     <!ENTITY AElig  SDATA "[AElig ]"--=capital AE diphthong
(ligature)-->
	     <!ENTITY ccedil SDATA "[ccedil]"--=small c, cedilla-->
	     <!ENTITY Ccedil SDATA "[Ccedil]"--=capital C, cedilla-->
	     <!ENTITY eth    SDATA "[eth   ]"--=small eth, Icelandic-->
	     <!ENTITY ETH    SDATA "[ETH   ]"--=capital Eth,
Icelandic-->
	     <!ENTITY eacute SDATA "[eacute]"--=small e, acute accent-->
	     <!ENTITY Eacute SDATA "[Eacute]"--=capital E, acute
accent-->
	     <!ENTITY ecirc  SDATA "[ecirc ]"--=small e, circumflex
accent-->
	     <!ENTITY Ecirc  SDATA "[Ecirc ]"--=capital E, circumflex
accent-->
	     <!ENTITY egrave SDATA "[egrave]"--=small e, grave accent-->
	     <!ENTITY Egrave SDATA "[Egrave]"--=capital E, grave
accent-->
	     <!ENTITY euml   SDATA "[euml  ]"--=small e, dieresis or
umlaut mark-->
	     <!ENTITY Euml   SDATA "[Euml  ]"--=capital E, dieresis or
umlaut mark-->
	     <!ENTITY iacute SDATA "[iacute]"--=small i, acute accent-->
	     <!ENTITY Iacute SDATA "[Iacute]"--=capital I, acute
accent-->
	     <!ENTITY icirc  SDATA "[icirc ]"--=small i, circumflex
accent-->
	     <!ENTITY Icirc  SDATA "[Icirc ]"--=capital I, circumflex
accent-->
	     <!ENTITY igrave SDATA "[igrave]"--=small i, grave accent-->
	     <!ENTITY Igrave SDATA "[Igrave]"--=capital I, grave
accent-->
	     <!ENTITY iuml   SDATA "[iuml  ]"--=small i, dieresis or
umlaut mark-->
	     <!ENTITY Iuml   SDATA "[Iuml  ]"--=capital I, dieresis or
umlaut mark-->
	     <!ENTITY ntilde SDATA "[ntilde]"--=small n, tilde-->
	     <!ENTITY Ntilde SDATA "[Ntilde]"--=capital N, tilde-->
	     <!ENTITY oacute SDATA "[oacute]"--=small o, acute accent-->
	     <!ENTITY Oacute SDATA "[Oacute]"--=capital O, acute
accent-->
	     <!ENTITY ocirc  SDATA "[ocirc ]"--=small o, circumflex
accent-->
	     <!ENTITY Ocirc  SDATA "[Ocirc ]"--=capital O, circumflex
accent-->
	     <!ENTITY ograve SDATA "[ograve]"--=small o, grave accent-->
	     <!ENTITY Ograve SDATA "[Ograve]"--=capital O, grave
accent-->
	     <!ENTITY oslash SDATA "[oslash]"--=small o, slash-->
	     <!ENTITY Oslash SDATA "[Oslash]"--=capital O, slash-->
	     <!ENTITY otilde SDATA "[otilde]"--=small o, tilde-->
	     <!ENTITY Otilde SDATA "[Otilde]"--=capital O, tilde-->
	     <!ENTITY ouml   SDATA "[ouml  ]"--=small o, dieresis or
umlaut mark-->
	     <!ENTITY Ouml   SDATA "[Ouml  ]"--=capital O, dieresis or
umlaut mark-->
	     <!ENTITY szlig  SDATA "[szlig ]"--=small sharp s, German
(sz ligature)-->
	     <!ENTITY thorn  SDATA "[thorn ]"--=small thorn,
Icelandic-->
	     <!ENTITY THORN  SDATA "[THORN ]"--=capital THORN,
Icelandic-->
	     <!ENTITY uacute SDATA "[uacute]"--=small u, acute accent-->
	     <!ENTITY Uacute SDATA "[Uacute]"--=capital U, acute
accent-->
	     <!ENTITY ucirc  SDATA "[ucirc ]"--=small u, circumflex
accent-->
	     <!ENTITY Ucirc  SDATA "[Ucirc ]"--=capital U, circumflex
accent-->
	     <!ENTITY ugrave SDATA "[ugrave]"--=small u, grave accent-->
	     <!ENTITY Ugrave SDATA "[Ugrave]"--=capital U, grave
accent-->
	     <!ENTITY uuml   SDATA "[uuml  ]"--=small u, dieresis or
umlaut mark-->
	     <!ENTITY Uuml   SDATA "[Uuml  ]"--=capital U, dieresis or
umlaut mark-->
	     <!ENTITY yacute SDATA "[yacute]"--=small y, acute accent-->
	     <!ENTITY Yacute SDATA "[Yacute]"--=capital Y, acute
accent-->
	     <!ENTITY yuml   SDATA "[yuml  ]"--=small y, dieresis or
umlaut mark-->

Received on Tuesday, 3 February 1998 04:01:10 UTC