W3C home > Mailing lists > Public > www-international@w3.org > October to December 2006

Re: Unicode conference papers

From: Erik van der Poel <erik@vanderpoel.org>
Date: Wed, 22 Nov 2006 21:51:54 -0800
Message-ID: <638fb7f40611222151x4f4bbc5fwd6e4c2d50acb7838@mail.gmail.com>
To: "Richard Ishida" <ishida@w3.org>
Cc: "Mark Davis" <mark.davis@icu-project.org>, Unicode <unicode@unicode.org>, www-international@w3.org

In 2006, around 3.3% of crawlable Web documents use the xml encoding,
while in 2001 that number was 0.39%.

<?xml version="1.0" encoding="..."?>

Also, the hreflang attribute was used in 0.0057% of the documents in
2001, and 0.13% in 2006.

<a href="..." hreflang="...">

In 2001, the top 5 meta charsets were:

iso-8859-1
windows-1252
gb2312
shift_jis
utf-8

In 2006, the top 5 meta charsets are:

iso-8859-1
utf-8
gb2312
windows-1252
windows-1251

UTF-8 is gaining... :-)

Erik

On 11/21/06, Erik van der Poel <erik@vanderpoel.org> wrote:
> The html meta charset numbers do not include xml encodings; I will
> count those next time.
Received on Thursday, 23 November 2006 05:52:15 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:09 GMT