W3C home > Mailing lists > Public > www-international@w3.org > October to December 1996

Re: charset issues

From: Peter O.B. Mikes <pom@llnl.gov>
Date: Fri, 06 Dec 1996 16:38:56 -0800
Message-ID: <32A8BCA0.33A8@llnl.gov>
To: "J.Larmouth" <J.Larmouth@iti.salford.ac.uk>
CC: www-international@w3.org
To:     www_international@w3.org
 Subject:      charset issues

J.Larmouth @ ITI.SALFORD.AC.UK  wrote:

> ....  We really only want ONE de facto standard,  not even
> two,  and UTF-8 scores on most criteria.
>.......  But delivery should be in a "sensible" charset.
> >(3) All the special variants, alternative designations, and garbage
> >        "charset" parameters.
> >
> >To help keep the net clean of an uncontrolled proliferation of
> >encodings, things in the last category definitely should not
> >be sent in Accept-Charset.
> >So in practice, I could see the following solutions for
> >Accept-Charset:
> >- Send UTF-8 if you can accept it, and nothing else.
> Yes.  And then the browser worries about the font problems.
> John L

        Yes. Yes. If I understand this suggestion
        (and please explain it to me gently if I do not) 
        browser could handle charsets as follows:

   Browser would not treat charsets as static:
      "You pick one, you are stuck with it".
  Lets consider the following  example  problem:
 Say, I want to have a text which will contain  the following fragment
on one line:
      .. The German word ko"se becomes froma`ge  in French,  but  sy'r
in Czech yet ... in  ....

    With current system of static character sets I would need a charset
which combines all
 Latin-1 and Latin-2 and ...
  But, if you reserve ONE special character or tag or even just an
attribute for this, I can write this one line  like this:
       .. The German word <font charset=German > ko"se </font> becomes
<font charset=French> froma`ge </font>
         but <font charset=CZ > sy'r </font> ...

  ,where the default charset is  always ASCII, and I can switch
     to any other charset whenever I want to.
 If the browser supports charset chosen in a tag,
    it will render the mnemonic ASCII bigrams 
   ( o", y', s^, ..) in that set, e.g
 o"  as  o with umlaut for German, and
 a`  as a with accent for French, or
 y' with proper image of y with diacritics for Czech etc
   which is na improvement over the current system when I often get
something unreadable (this is the fail-safe feature)
   I could elaborate - if this is too telegraphic --
    as long as this  makes some sense to someone else. too..

  Would this require that the draft:
  has to be ammended?

   p.mikes@ieee.org  (Peter Mikes)
Received on Friday, 6 December 1996 19:37:44 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:16 UTC