W3C home > Mailing lists > Public > www-html@w3.org > December 1999

RE: accented characters, etc.

From: Jelks Cabaniss <jelks@jelks.nu>
Date: Thu, 2 Dec 1999 23:12:02 -0500
To: <www-html@w3.org>
Message-ID: <NBBBICMNIPCICMKJECCBEEAADDAA.jelks@jelks.nu>
John Delacour wrote:

> Without going so far as using Unicode, you can also declare some
> other character set such as iso-8858-2 and use decimal or hexadecimal
> character entities, eg. &#131;  &x83;

Hmmm.  That &#131; falls in the same category as using &#151; for em dashes --
avoid it.  Those are Windows codepage characters and undefined in Markup:

	<?xml version="1.0"?>
	<doc>
	  <p>&#131; is an undefined entity.</p>
	</doc>

nsgmls -wxml gives you a warning of "reference to non-SGML character", and it
displays as a square placeholder box in IE5 (even with other declared encodings
like iso-8859-x instead of the default UTF-8).

Avoid anything in the 128 - 159 range, and -- for completeness -- the 0 - 31
range (except for tabs (9), linefeeds (10), and carriage returns (13)).

/Jelks
Received on Thursday, 2 December 1999 23:11:29 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:15:40 GMT