W3C home > Mailing lists > Public > semantic-web@w3.org > April 2009

Re: [foaf-dev] [foaf-protocols] FOAF sites offline during cleanup

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Tue, 28 Apr 2009 08:10:54 -0400
Message-ID: <49F6F24E.2020005@openlinksw.com>
To: Steve Harris <steve.harris@garlik.com>
CC: Semantic Web <semantic-web@w3.org>, foaf-dev Friend of a <foaf-dev@lists.foaf-project.org>
Steve Harris wrote:
> On 27 Apr 2009, at 20:10, Kingsley Idehen wrote:
>> Steve Harris wrote:
>>> On 27 Apr 2009, at 14:26, Kingsley Idehen wrote:
>>>>>> I would safely say re. LOD Cloud somewhere north of 80% :-) And 
>>>>>> thats
>>>>>> primary due to the content coming from PingTheSemanticWeb, otherwise
>>>>>> I would say 90% and higher. The "Linked Data" meme has always
>>>>>> encouraged URIs for everything.
>>>>> I guess it depends whether you count your population by triples or
>>>>> graphs, but that seems quite high to me. The vast majority of FOAF
>>>>> data (Hi5 and LiveJournal, for example) has bnodes in it, and FOAF
>>>>> makes up the bulk of LinkedData as far as I've been able to tell.
>>>> No, the FOAF data with bnodes in the LOD cloud come from the places
>>>> you've just mentioned via PingTheSemanticWeb (PTSW) and other crawler
>>>> built from PTSW, or those that performed similar RDF crawling.
>>> My reading of your sentence above was that you were including PTSW, 
>>> and in any case if you don't not crawl how can you ever get to see a 
>>> reasonable slice of the LOD?
>> I was referring to the data sets in the LOD cloud bubble that we've 
>> loaded into the instance at: http://lod.openlinksw.com  (which does 
>> include stuff from PTSW but placed into its own Named Graph Group). 
>> Think warehouse just for this conversation.
> Sure, but that cloud diagram includes FOAF, and something like 99% of 
> FOAF files include bNodes. I don't know  what proportion of the LOD 
> web is FOAF, but it must be around 50%.
There is a frustrating aspect to this LOD cloud pictorial which 
ultimately leads to these conversations. Hopefully, we will soon be 
taking about a real graph based on real linked data instead a marketing 
comms pictorial :-)

As for the % re. FOAF, I think that can be determined from our actual 
VoiD graph for LOD cloud [1]. I don't know off the top of my head if 
FOAF is up to 50%.
> The "Linked" part of the name implies that crawling is a valid tactic 
> to gather the data to me.
Not disputing that, just describing what we have in the instance :-)  
Remember, we've sponged (crawled and RDFized) data since inception of 
our participation in this space.


1. http://lod.openlinksw.com/void/Dataset

> - Steve



Kingsley Idehen	      Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO 
OpenLink Software     Web: http://www.openlinksw.com
Received on Tuesday, 28 April 2009 12:11:37 UTC

This archive was generated by hypermail 2.4.0 : Tuesday, 5 July 2022 08:45:11 UTC