ISO 8879 diacritical marks as HTML character entities

Chung-Chieh Shan (t-chungs@microsoft.com)
Thu, 25 Jul 1996 01:04:54 -0700


Message-ID: <c=US%a=_%p=msft%l=RED-22-MSG-960725080454Z-5841@tide21.microsoft.com>
From: Chung-Chieh Shan <t-chungs@microsoft.com>
To: "'www-html@w3.org'" <www-html@w3.org>
Cc: "'Kai-hsu Tai'" <kaihsu@ugcs.caltech.edu>
Subject: ISO 8879 diacritical marks as HTML character entities
Date: Thu, 25 Jul 1996 01:04:54 -0700

Hello,

This is my first post on this list, so please excuse me if it is in any
way inappropriate.

I am interested in the list of character entities that are/will be
included in HTML 3.2.  In particular, I am working on computerization of
several Taiwanese languages, the romanization of which requires
diacritics to be placed over letters such as "m" and "n".  Since there
are already entities like &acute; and &grave; defined in
ftp://ftp.ifi.uio.no/pub/SGML/ENTITIES/ISOdia, I suppose the only
question is whether these entities will be included in HTML 3.2 (I'm
actually not absolutely sure that they haven't been included in previous
versions; I'd be very happy if they have), and -- if they will --
whether any specific rendering behavior is to be specified by HTML.  If
it is HTML's responsibility to specify rendering behavior for these
entities, I think the logical way to proceed is to follow Unicode's
placement of non-spacing marks, i.e., use m&acute; (rather than
&acute;m) for m with acute above, and so on.

If the inclusion of these character entities have not been proposed for
HTML 3.2, I would also appreciate advice on how I can make such a
proposal.

Thanks in advance.

-- Ken

"The Converter Group in Building 17, a notoriously glum Campus locale"
-- Daniel Underwood, in Microserfs by Douglas Coupland