RE: ISO-8859-1

In this case what goes over the wire is ASCII but the ASCII represents
the "HTML encoded" character in UCS-2.  The wonderful thing about
HTML-encoded characters is that the encoding is unique and therefore web
servers always know how to unencode each character.

-Paul

Paul Deuter
Internationalization Manager
Plumtree Software
paul.deuter@plumtree.com 
 


-----Original Message-----
From: Timothy Greenwood [mailto:tgreenwood@openmarket.com]
Sent: Monday, October 01, 2001 1:41 PM
To: Paul Deuter; Timothy Greenwood; souravm; www-international@w3.org
Subject: RE: ISO-8859-1


> Numeric character references?  What are those? 

From the HTML specification

5.3.1 Numeric character references
  
   Numeric character references specify the code position of a character
   in the document character set. Numeric character references may take
   two forms:
     * The syntax "&#D;", where D is a decimal number, refers to the ISO
       10646 decimal character number D.
     * The syntax "&#xH;" or "&#XH;", where H is a hexadecimal number,
       refers to the ISO 10646 hexadecimal character number H.
       Hexadecimal numbers in numeric character references are
       case-insensitive.
       

Received on Monday, 1 October 2001 16:47:51 UTC