- From: Kingsley Idehen <kidehen@openlinksw.com>
- Date: Fri, 25 Feb 2011 18:00:03 -0500
- To: Bernard Vatant <bernard.vatant@mondeca.com>
- CC: Annika Flemming <annika.flemming@gmx.de>, public-lod@w3.org, Bob Ferris <zazi@elbklang.net>
- Message-ID: <4D683473.70700@openlinksw.com>
> Hi Annika > > - "A vocabulary is said to be established, if it is one of the > 100 most popular vocabularies stated on pre x.cc" - uhm, as > the results from Richard's evaluation have, this is quite arguable > > It's a practical way to determine it (which I can use for the > implementation of the formalism). Another way would be to compare > many documents from many data sources and to find out, which > vocabularies are most popular. > > > I'm particularly interested in this aspect of vocabulary selection. > Regarding popularity, I fully go along with Bob regarding prefix.cc in > which all sorts of biases can be introduced. I think the popularity is > better measured by the use of vocabularies in CKAN datasets, as > indicated by "format-*" tags. See http://ckan.net/tag/?page=F and for > example http://ckan.net/tag/format-bibo or > http://ckan.net/tag/format-foaf. Why not actual link coefficient from an LOD Cloud cache instance ? That a least shows what's being used :-) Kingsley > > Another approach I'm currently working on is the one you can find at > http://labs.mondeca.com/dataset/lov. The description of interlinked > vocabularies (using VOAF vocabulary) provide indication of popularity > at the vocabulary level itself. From this dataset (still far from > exhaustive of course) you can see which vocabularies are reused, > extended, used for annotation by other ones. I think the density of > links to and from a vocabulary to other ones gives a good indicator of > its "establishment", in combination with the number of datasets > actually using it. > > Best > > Bernard > > > -- > Bernard Vatant > Senior Consultant > Vocabulary & Data Engineering > Tel: +33 (0) 971 488 459 > Mail: bernard.vatant@mondeca.com <mailto:bernard.vatant@mondeca.com> > ---------------------------------------------------- > Mondeca > 3, cité Nollez 75018 Paris France > Web: http://www.mondeca.com > Blog: http://mondeca.wordpress.com > ---------------------------------------------------- -- Regards, Kingsley Idehen President& CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Received on Friday, 25 February 2011 23:00:43 UTC