W3C home > Mailing lists > Public > www-international@w3.org > July to September 2001

Re: Unicode <-> CJKV national encoding; supporting multi-lingual web content

From: Thierry Sourbier <webmaster@i18ngurus.com>
Date: Wed, 15 Aug 2001 14:53:34 +0200
Message-ID: <02ff01c12589$4b952f60$5e53fea9@dell400>
To: "Michael Gorelik" <mgorelik@Novarra.com>, <www-international@w3.org>
> 1)Have any one seeing some information on the amount of pages, % of
content
> available in different charsets, such as ISO8859-1, UTF-8, UTF-16, EUC-JP,
> ISO-2022-JP, ShiftJs,etc (except the Babel study). I am trying to get idea
> on the number of users of the particular charset.
>

Character set usage surveys are hard to come accross indeed. I found those
but they are unfortunatelly fairly old (99) but the method is interesting.
I'm curious to know if the tool used to generate those pages is available
somewhere or if it was home-grown.

http://people.netscape.com/erik/yahoo-jp.html
http://people.netscape.com/erik/yahoo-de.html

If anyone knows of better character encoding surveys, let me know I'll add a
link to i18ngurus,

Cheers,
Thierry.

--------------
www.i18ngurus.com - Open i18n resources directory.
Received on Wednesday, 15 August 2001 08:47:21 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:57 GMT