Re: Unicode conference papers

In 2006, around 3.3% of crawlable Web documents use the xml encoding,
while in 2001 that number was 0.39%.

<?xml version="1.0" encoding="..."?>

Also, the hreflang attribute was used in 0.0057% of the documents in
2001, and 0.13% in 2006.

<a href="..." hreflang="...">

In 2001, the top 5 meta charsets were:

iso-8859-1
windows-1252
gb2312
shift_jis
utf-8

In 2006, the top 5 meta charsets are:

iso-8859-1
utf-8
gb2312
windows-1252
windows-1251

UTF-8 is gaining... :-)

Erik

On 11/21/06, Erik van der Poel <erik@vanderpoel.org> wrote:
> The html meta charset numbers do not include xml encodings; I will
> count those next time.

Received on Thursday, 23 November 2006 05:52:15 UTC