Getting it right: Latin-1

Chris Lilley (Chris.Lilley@sophia.inria.fr)
Fri, 8 Aug 1997 16:08:18 +0200 (MET)


Date: Fri, 8 Aug 1997 16:08:18 +0200 (MET)
From: Chris Lilley <Chris.Lilley@sophia.inria.fr>
Message-Id: <9708081608.ZM25352@grommit.inria.fr>
To: www-html@w3.org
Subject: Getting it right: Latin-1

Useful resource containing screen shots of various browsers displaying
8859-1 characters, numeric character references (the &#234; stuff),
and named entities (the &times; stuff).

It identifies bugs in browsers that do not distinguish between

a) the charset (document encoding), which can be most anything and
   is indicated by the charset parameter

b) the document character set, which is always iso-10646, which is
   code for code the same as Unicode (this is the where character
   processing happens). NCRs allways refer to the document character set.

c) the font encoding vector.

Lots of mac browsers get this wrong, for example. Also includes a
screenshot of a browser that does it correctly ;-)

   http://ppewww.ph.gla.ac.uk/~flavell/iso8859/internat.html

I didn't write it, I just came across it and it seemed well written and
generally useful. Share and enjoy.

-- 
Chris Lilley, W3C                          [ http://www.w3.org/ ]
Graphics and Fonts Guy            The World Wide Web Consortium
http://www.w3.org/people/chris/              INRIA,  Projet W3C
chris@w3.org                       2004 Rt des Lucioles / BP 93
+33 (0)4 93 65 79 87       06902 Sophia Antipolis Cedex, France