W3C home > Mailing lists > Public > semantic-web@w3.org > April 2011

Re: LOD Cloud Cache Stats

From: Nathan <nathan@webr3.org>
Date: Mon, 04 Apr 2011 15:06:58 +0100
Message-ID: <4D99D082.9090207@webr3.org>
To: Kingsley Idehen <kidehen@openlinksw.com>
CC: "public-lod@w3.org" <public-lod@w3.org>, "semantic-web@w3.org" <semantic-web@w3.org>
Kingsley Idehen wrote:
> On 4/3/11 11:41 PM, Nathan wrote:
>> Hi Kinglsey, All,
>>
>> Incoming open request, could anybody provide similar statistics for 
>> the usage of each datatype in the wild (e.g. the xsd types, xmlliteral 
>> and rdf plain literal)?
>>
>> Ideally Kingsley, could you provide a breakdown from the lod cloud 
>> cache? would be very very useful to know.
>>
>> Best & TIA,
>>
>> Nathan
>>
>> Kingsley Idehen wrote:
>>> I've knocked up a Google spreadsheet that contains stats about our 21 
>>> Billion Triples+ LOD cloud cache.
>> ...
>>> https://spreadsheets.google.com/ccc?key=0AihbIyhlsQSxdHViMFdIYWZxWE85enNkRHJwZXV4cXc&hl=en 
>>> -- LOD Cloud Cache SPARQL stats queries and results
>>
> 
> Nathan,
> 
> The typed literals used in>  10k triples:
> 
> count    datatype IRI
> 11308    xsd:anyURI
> 12553http://dbpedia.org/datatype/day
> 12788http://dbpedia.org/ontology/day
> 15875http://dbpedia.org/ontology/usDollar
> 18228http://dbpedia.org/datatype/usDollar
> 20828http://europeanaconnect.eu/voc/fondazione/sgti#fondazioneNot
> 22934http://statistics.data.gov.uk/def/administrative-geography/StandardCode 
> 
> 23368http://www.w3.org/2001/XMLSchema#date
> 30695http://dbpedia.org/datatype/inhabitantsPerSquareKilometre
> 31662http://dbpedia.org/datatype/second
> 35506http://dbpedia.org/datatype/kilometre
> 57409http://www.w3.org/2001/XMLSchema#int
> 160117http://stitch.cs.vu.nl/vocabularies/rameau/RecordNumber
> 632256http://www.w3.org/2001/XMLSchema#anyURI
> 1175435  xsd:string
> 1696035http://data.ordnancesurvey.co.uk/ontology/postcode/Postcode
> 70194534http://www.openlinksw.com/schemas/virtrdf#Geometry
> 120147725http://www.w3.org/2001/XMLSchema#string
> 
> Spreadsheet will be updated too.
> 

Thanks Kingsley, very much appreciated! :)

I have to admit I'm surprised by the lack of xsd:double and xsd:decimal 
in the two stats sets, and also the inclusion of some datatypes I'd 
never even heard of!

Are there any virtuozo specific nuances which do some conversion, or are 
all of these as found in the serialized RDF?

also is xsd:string automatically set for all plain literals (with / 
without langs?)

Cheers,

Nathan
Received on Monday, 4 April 2011 14:07:52 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 21:45:42 GMT