a few problems in O-charset-lang.html

A recent email on Windows-31J led me to take a look at the

http://www.w3.org/International/O-charset-lang.html

There are a few problems with the document.

It lists a 7-year-old statistics (probably taken with a not-so-good 
sample even then) of the frequency of character encodings used on the 
web. The web and the internet have changed a lot since 1997 and I'm 
afraid the statistics gives a  misleading impression to some people that 
Windows-1252 can cover the vast majority of web pages. It'd be nice to 
replace that stat. with a recent one. If it's not easy to find a new 
statistics, I think either that part has to be removed or a prominent 
disclaimer should be added.

Another problem is that it uses 'kr' (the country code for Republic of 
Korea/South Korea) in place of 'ko' (the language code for Korean). 

I also found that Chinese (both zh-TW and zh-CN) is not listed (it's a 
partial list, but still not listing Chinese seems a bit strange.)

Jungshik

Received on Thursday, 29 July 2004 22:48:36 UTC