W3C home > Mailing lists > Public > semantic-web@w3.org > April 2011

Re: LOD Cloud Cache Stats

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Mon, 04 Apr 2011 10:16:32 -0400
Message-ID: <4D99D2C0.7040101@openlinksw.com>
To: nathan@webr3.org
CC: "public-lod@w3.org" <public-lod@w3.org>, "semantic-web@w3.org" <semantic-web@w3.org>
On 4/4/11 10:06 AM, Nathan wrote:
> Kingsley Idehen wrote:
>> On 4/3/11 11:41 PM, Nathan wrote:
>>> Hi Kinglsey, All,
>>>
>>> Incoming open request, could anybody provide similar statistics for 
>>> the usage of each datatype in the wild (e.g. the xsd types, 
>>> xmlliteral and rdf plain literal)?
>>>
>>> Ideally Kingsley, could you provide a breakdown from the lod cloud 
>>> cache? would be very very useful to know.
>>>
>>> Best & TIA,
>>>
>>> Nathan
>>>
>>> Kingsley Idehen wrote:
>>>> I've knocked up a Google spreadsheet that contains stats about our 
>>>> 21 Billion Triples+ LOD cloud cache.
>>> ...
>>>> https://spreadsheets.google.com/ccc?key=0AihbIyhlsQSxdHViMFdIYWZxWE85enNkRHJwZXV4cXc&hl=en 
>>>> -- LOD Cloud Cache SPARQL stats queries and results
>>>
>>
>> Nathan,
>>
>> The typed literals used in>  10k triples:
>>
>> count    datatype IRI
>> 11308    xsd:anyURI
>> 12553http://dbpedia.org/datatype/day
>> 12788http://dbpedia.org/ontology/day
>> 15875http://dbpedia.org/ontology/usDollar
>> 18228http://dbpedia.org/datatype/usDollar
>> 20828http://europeanaconnect.eu/voc/fondazione/sgti#fondazioneNot
>> 22934http://statistics.data.gov.uk/def/administrative-geography/StandardCode 
>>
>> 23368http://www.w3.org/2001/XMLSchema#date
>> 30695http://dbpedia.org/datatype/inhabitantsPerSquareKilometre
>> 31662http://dbpedia.org/datatype/second
>> 35506http://dbpedia.org/datatype/kilometre
>> 57409http://www.w3.org/2001/XMLSchema#int
>> 160117http://stitch.cs.vu.nl/vocabularies/rameau/RecordNumber
>> 632256http://www.w3.org/2001/XMLSchema#anyURI
>> 1175435  xsd:string
>> 1696035http://data.ordnancesurvey.co.uk/ontology/postcode/Postcode
>> 70194534http://www.openlinksw.com/schemas/virtrdf#Geometry
>> 120147725http://www.w3.org/2001/XMLSchema#string
>>
>> Spreadsheet will be updated too.
>>
>
> Thanks Kingsley, very much appreciated! :)
>
> I have to admit I'm surprised by the lack of xsd:double and 
> xsd:decimal in the two stats sets, and also the inclusion of some 
> datatypes I'd never even heard of!
>
> Are there any virtuozo specific nuances which do some conversion, or 
> are all of these as found in the serialized RDF?
>
> also is xsd:string automatically set for all plain literals (with / 
> without langs?)
>
> Cheers,
>
> Nathan
>
>

Data comes from internal table in Virtuoso. Note, a threshold has been 
set so what you are seeing is a picture relative to the total amount of 
data (21 Billion+ triples).


-- 

Regards,

Kingsley Idehen	
President&  CEO
OpenLink Software
Web: http://www.openlinksw.com
Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca: kidehen
Received on Monday, 4 April 2011 14:17:03 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 21:45:42 GMT