W3C home > Mailing lists > Public > www-html@w3.org > June 2000

RE: XHTML and charset's [was: Re: XHTML questions]

From: Ian Graham <igraham@ic-unix.ic.utoronto.ca>
Date: Thu, 29 Jun 2000 16:35:42 -0400 (EDT)
To: Bertilo Wennergren <bertilow@hem.passagen.se>
cc: Chris Croome <chris@webarchitects.co.uk>, Ian Graham <ian.graham@utoronto.ca>, Christian Smith <csmith@barebones.com>, www-html@w3.org
Message-ID: <Pine.SO4.4.05.10006291620430.5043-100000@ic-unix.ic.utoronto.ca>

Bertilo is correct -- things are fine if your documet only
contains ASCII characters, as they map onto the same byte
sequence in UTF-8.

HOwever, things go wrong if you hav non-ascii characters
in the document. They also fail (on Navigator 4 and earlier)
if you have charcter references in the document that 
references non-latin-1 characters. For example, character
references like 

&#3124;

(this is a made up number I'm afraid), which references the
3124th character in Unicode, will only work if you explicitlyu
set UTF-8 using a META element.

Ian



On Thu, 29 Jun 2000, Bertilo Wennergren wrote:

> Chris Croome:
> 
> > I have just tried installing Netscape 1, 2 and 3 on a windoze box and
> > all 3 seem to be fine with UTF-8!
> 
> That might be true ... if you look at pages that happen to use
> only ASCII characters. ASCII is a subset of UTF-8.
>  
> Try a UTF-8 page in e.g. Russian or German or French.
> 
> #####################################################################
>                           Bertilo Wennergren
>                   <http://purl.oclc.org/net/bertilo>
>                       <bertilow@hem.passagen.se>
> #####################################################################
> 
> 
> 
> 
Received on Thursday, 29 June 2000 16:41:16 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 March 2012 18:15:43 GMT