W3C home > Mailing lists > Public > public-vocabs@w3.org > April 2012

Yet more metadata statistics out - from Sindice

From: Giovanni Tummarello <giovanni.tummarello@deri.org>
Date: Tue, 17 Apr 2012 16:59:04 +0100
Message-ID: <CAHHRs7j6jXv_dd3tJ2NgiLVYFZKF_1RwUwrn6F+=7OTqZUx+xQ@mail.gmail.com>
To: Peter Mika <pmika@yahoo-inc.com>
Cc: "public-vocabs@w3.org" <public-vocabs@w3.org>, "public-lod@w3.org" <public-lod@w3.org>
HI Peter, all

to add (a probably small element of discussion) to this

i am happy to say that last week we released on the frontpage some
analytics stats which are fresh updated every week.

At the moment they come from  500million+ web URLS. Maybe not much but
pls notice we ONLY retain web urld which return RDF, RDFa, Microdata,
Microforamts etc (and trow away trivial markup).

Next week we hope to release the detailed per domain stats.

General analytics (classes) :

http://sindice.com/stats/direct/basic-class-stats?settings=%7B%22iCreate%22%3A1334676375502%2C%22iStart%22%3A0%2C%22iEnd%22%3A50%2C%22iLength%22%3A50%2C%22sFilter%22%3A%22%22%2C%22sFilterEsc%22%3Atrue%2C%22aaSorting%22%3A%5B%5B4%2C%22desc%22%5D%5D%2C%22aaSearchCols%22%3A%5B%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%5D%2C%22abVisCols%22%3A%5Btrue%2Ctrue%2Ctrue%2Ctrue%2Ctrue%5D%2C%22ssDelta%22%3A%22%22%7D

Schema specific analytics:

http://sindice.com/stats/direct/basic-class-stats?settings=%7B%22iCreate%22%3A1334676375502%2C%22iStart%22%3A0%2C%22iEnd%22%3A50%2C%22iLength%22%3A50%2C%22sFilter%22%3A%22%22%2C%22sFilterEsc%22%3Atrue%2C%22aaSorting%22%3A%5B%5B4%2C%22desc%22%5D%5D%2C%22aaSearchCols%22%3A%5B%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%2C%5B%22%22%2Ctrue%5D%5D%2C%22abVisCols%22%3A%5Btrue%2Ctrue%2Ctrue%2Ctrue%2Ctrue%5D%2C%22ssDelta%22%3A%22%22%7D


Its all on the homepage on http://sindice.com (see analytics tab)

Note: sindice is NOT at this point wildly crawling the web but rather
is accepting (and acting immediately) submissions of sitemaps, pings
and RDF datasets. Please submit yours to see them indexed (and
refreshed) at a reasonable rate nowadays


cheers
Gio

On Tue, Apr 17, 2012 at 4:06 PM, Peter Mika <pmika@yahoo-inc.com> wrote:
> Hi All,
>
> To add one more data point to the previous discussion about
> webdatacommons.org, we have recently presented a short position paper at the
> LDOW 2012 workshop at WWW 2012. Online at
>
> http://events.linkeddata.org/ldow2012/papers/ldow2012-inv-paper-1.pdf
>
> Please compare this carefully with the results of Bizer et al.:
>
> http://events.linkeddata.org/ldow2012/papers/ldow2012-inv-paper-2.pdf
>
> As it always the case with statistics, it matters what you count on and how
> you count ;) For example, Chris and his co-authors did not consider most of
> OGP data on the Web, which results in large discrepancies in the counts for
> RDFa, as well as overall counts.
>
> Nevertheless, both studies confirm that the Semantic Web, and in particular
> metadata in HTML, is taking on in major ways thanks to the efforts of
> Facebook, the sponsors of schema.org and many other individuals and
> organizations. Comparing to our previous numbers, for example we see a
> five-fold increase in RDFa usage with 25% of webpages containing RDFa data
> (including OGP), and over 7% of web pages containing microdata. These are
> incredibly impressive numbers, which illustrate that this part of the
> Semantic Web has gone mainstream.
>
> Cheers,
> Peter
>
Received on Tuesday, 17 April 2012 16:00:00 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 22 May 2012 06:49:02 GMT