[Prev][Next][Index][Thread]

Re: Accept-Charset support



On Dec 18, 12:03am, Keld J|rn Simonsen wrote:

[in amongst generally good stuff ]

> Of cause it complicates matters
> with yet another parameter, but it could help in chosing an
> appropiate font, and then it is the right concept.

Sequences of bytes, sequences of characters, and sequences of glyphs
are not the same thing. The charset does not, necessarily, map 1:1 to
the font encoding vector. The characters might not even be displayed
visually - they might be spoken for example.

> I note that a MIME charset identifies a repertoire,

Not necessarily. It identifies a mapping from  (one or more)
bytes to characters. In the particular case of HTML it does not identify
a repertoire at all. It is possible to write a document that contains the
entire character repertoire of 10646, and have it correctly labelled as
US-ASCII - just using numeric character references.





-- 
Chris Lilley, W3C                          [ http://www.w3.org/ ]
Graphics and Fonts Guy            The World Wide Web Consortium
http://www.w3.org/people/chris/              INRIA,  Projet W3C
chris@w3.org                       2004 Rt des Lucioles / BP 93
+33 (0)4 93 65 79 87       06902 Sophia Antipolis Cedex, France


References: