W3C home > Mailing lists > Public > www-international@w3.org > January to March 2010

Re: For review: Character encodings in HTML and CSS

From: John Cowan <cowan@ccil.org>
Date: Fri, 12 Feb 2010 02:54:21 -0500
To: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>
Cc: CE Whitehead <cewcathar@hotmail.com>, ishida@w3.org, www-international@w3.org
Message-ID: <20100212075421.GC20401@mercury.ccil.org>
Leif Halvard Silli scripsit:

> > Elsewhere, I've seen it recommended elsewhere that I encode my 
> > documents as ansi, and then just use the Latin-1 char set (ISO 
> > 8859-1) with no escapes (assuming I can do this),
> 
> ISO-8859-1 apparently is synonymous with ANSI/Windows-1252 on the Web - 
> but I don't know the details.

Windows-1252 is a superset of ISO 8859-1, containing 16 characters that
aren't assigned in 8859-1.

> If your document actually only contains ASCII characters - or if all 
> non-ASCII characters are escaped, then it should work. But I don't see 
> why it should work if it has unescaped non-ASCII characters, unless 
> there were a mislabeling going on ...

It wouldn't, except that some browsers ignore the encoding declarations if
sniffing the document tells them otherwise.

-- 
Schlingt dreifach einen Kreis vom dies!    John Cowan <cowan@ccil.org>
Schliesst euer Aug vor heiliger Schau,     http://www.ccil.org/~cowan
Denn er genoss vom Honig-Tau,
Und trank die Milch vom Paradies.            --Coleridge (tr. Politzer)
Received on Friday, 12 February 2010 07:54:52 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 12 February 2010 07:54:53 GMT