W3C home > Mailing lists > Public > www-html@w3.org > December 1999

Re: accented characters, etc.

From: Russell Steven Shawn O'Connor <roconnor@uwaterloo.ca>
Date: Thu, 2 Dec 1999 18:44:42 -0500 (EST)
To: W3C HTML <www-html@w3.org>
Message-ID: <Pine.SOL.3.95q.991202184140.17720A-100000@wronski.math.uwaterloo.ca>
On Thu, 2 Dec 1999, John Delacour wrote:

> No such thing is needed; the big step towards internationalization is
> already taken and it is Unicode <http:www.unicode.org/>.  All you need
> to do is declare UTF-8 as your character set and use unicode entities
> such as &x03EE;

To be more precice, you don't need to declare UTF-8 as your character
encoding (and probably shouldn't), to use these entities.  No matter what
your character encoding is, &#xxxx; will refer to the Unicode character
number xxxx. 

(Aside: Note the subtle difference between character encoding and
character set.)

-- 
Russell O'Connor                           roconnor@uwaterloo.ca
       <http://www.undergrad.math.uwaterloo.ca/~roconnor/>
``And truth irreversibly destroys the meaning of its own message''
-- Anindita Dutta, ``The Paradox of Truth, the Truth of Entropy''
Received on Thursday, 2 December 1999 18:44:45 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:15:40 GMT