W3C home > Mailing lists > Public > w3c-wai-ig@w3.org > July to September 2002

RE: non-sgml characters

From: Nick Kew <nick@webthing.com>
Date: Mon, 15 Jul 2002 18:18:58 +0100 (BST)
To: Jon Hanna <jon@spin.ie>
cc: <w3c-wai-ig@w3.org>
Message-ID: <20020715181045.R1406-100000@fenris.webthing.com>

On Mon, 15 Jul 2002, Jon Hanna wrote:

> > >       (&#153;)
>
> This is a valid way of constructing a character set, as long as one doesn't
> claim to be using Latin-1 etc.

Not quite.

A byte having the value 153 is OK[1], if you have declared your document
as having a proprietary charset that assigns meaning to it.

The sequence &#153; is incorrect, regardless.

As a datapoint, at least two operating systems have "native" charsets
that assign characters to bytes in the range 128-159, and browsers
running on those OSs may display them if they fall back to a native
charset for error recovery - or if they don't deal with i18n at all.
But the assignments of these characters are totally different on
RiscOS and Windows.

-- 
Nick Kew

Available for contract work - Programming, Unix, Networking, Markup, etc.
Received on Monday, 15 July 2002 13:19:19 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 19 July 2011 18:14:05 GMT