- From: Peter O.B. Mikes <pom@llnl.gov>
- Date: Fri, 06 Dec 1996 16:38:56 -0800
- To: "J.Larmouth" <J.Larmouth@iti.salford.ac.uk>
- CC: www-international@w3.org
To: www_international@w3.org
Subject: charset issues
J.Larmouth @ ITI.SALFORD.AC.UK wrote:
> .... We really only want ONE de facto standard, not even
> two, and UTF-8 scores on most criteria.
>....... But delivery should be in a "sensible" charset.
>
> >(3) All the special variants, alternative designations, and garbage
> > "charset" parameters.
> >
> >To help keep the net clean of an uncontrolled proliferation of
> >encodings, things in the last category definitely should not
> >be sent in Accept-Charset.
>
> >So in practice, I could see the following solutions for
> >Accept-Charset:
>
> >- Send UTF-8 if you can accept it, and nothing else.
>
> Yes. And then the browser worries about the font problems.
>
> John L
Yes. Yes. If I understand this suggestion
(and please explain it to me gently if I do not)
browser could handle charsets as follows:
Browser would not treat charsets as static:
"You pick one, you are stuck with it".
Lets consider the following example problem:
Say, I want to have a text which will contain the following fragment
on one line:
----------------------------------------------------------
.. The German word ko"se becomes froma`ge in French, but sy'r
in Czech yet ... in ....
---------------------------------------------------------------------
With current system of static character sets I would need a charset
which combines all
Latin-1 and Latin-2 and ...
But, if you reserve ONE special character or tag or even just an
attribute for this, I can write this one line like this:
--------------------------------------
.. The German word <font charset=German > ko"se </font> becomes
<font charset=French> froma`ge </font>
but <font charset=CZ > sy'r </font> ...
-----------------------------------------------------
,where the default charset is always ASCII, and I can switch
to any other charset whenever I want to.
If the browser supports charset chosen in a tag,
it will render the mnemonic ASCII bigrams
( o", y', s^, ..) in that set, e.g
.
o" as o with umlaut for German, and
a` as a with accent for French, or
y' with proper image of y with diacritics for Czech etc
which is na improvement over the current system when I often get
something unreadable (this is the fail-safe feature)
e)
...
...
I could elaborate - if this is too telegraphic --
as long as this makes some sense to someone else. too..
Would this require that the draft:
ftp://ftp.isi.edu/internet-drafts/draft-freed-charset-reg-01.txt
has to be ammended?
p.mikes@ieee.org (Peter Mikes)
Received on Friday, 6 December 1996 19:37:44 UTC