Re: HTML - i18n / NCR & charsets

Jim Taylor (JHTaylor@videodiscovery.com)
Fri, 20 Dec 1996 22:30:55 -0800


Message-Id: <s2bb13c6.020@videodiscovery.com>
Date: Fri, 20 Dec 1996 22:30:55 -0800
From: Jim Taylor <JHTaylor@videodiscovery.com>
To: www-html@www10.w3.org
Subject: Re: HTML - i18n / NCR & charsets

It figures! I disappear for a few weeks and people start talking about
something I actually know about. Sorry to join the discussion after it's
over.  :-)

>>> "David Perrell" <davidp@earthlink.net> 11/29/96 11:04pm >>>
>ISO8879 names for Windows CP 1252 80-9F (128-160) entities:

>83 (131) --   ?   -- florin

This is generally mapped to &fnof; (U0192)

>88 (136) -- &circ; -- circumflex
>98 (152) -- &tilde; -- tilde

These are not correct. &circ; is character 94, &tilde; is char 126.
Character 136 is "circumflex modifier" (MODIFIER LETTER CIRCUMFLEX
ACCENT in Unicode) and character 152 is "small tilde".

This is supported by the Unicode Consortium mapping tables:
<ftp://unicode.org/pub/UNIX/MAPPINGS>

A while ago I created a table (in MSAccess) mapping Unicode to ISO
8879 entities (and sources) to HTML version to Mac (Roman) to
Windows (ANSI, cp1252) to Adobe Symbol (Mac and Win) to DOS (OEM,
cp437).

Anybody want a copy?

______________________________________________
Jim "The Frog" Taylor, Director of Information Technology
<mailto:jhtaylor@videodiscovery.com>
Videodiscovery, Inc. - Multimedia Education for Science and Math
Seattle, WA, 206-285-5400 <http://www.videodiscovery.com/vdyweb>