- From: Nathan <nathan@webr3.org>
- Date: Mon, 04 Apr 2011 15:06:58 +0100
- To: Kingsley Idehen <kidehen@openlinksw.com>
- CC: "public-lod@w3.org" <public-lod@w3.org>, "semantic-web@w3.org" <semantic-web@w3.org>
Kingsley Idehen wrote: > On 4/3/11 11:41 PM, Nathan wrote: >> Hi Kinglsey, All, >> >> Incoming open request, could anybody provide similar statistics for >> the usage of each datatype in the wild (e.g. the xsd types, xmlliteral >> and rdf plain literal)? >> >> Ideally Kingsley, could you provide a breakdown from the lod cloud >> cache? would be very very useful to know. >> >> Best & TIA, >> >> Nathan >> >> Kingsley Idehen wrote: >>> I've knocked up a Google spreadsheet that contains stats about our 21 >>> Billion Triples+ LOD cloud cache. >> ... >>> https://spreadsheets.google.com/ccc?key=0AihbIyhlsQSxdHViMFdIYWZxWE85enNkRHJwZXV4cXc&hl=en >>> -- LOD Cloud Cache SPARQL stats queries and results >> > > Nathan, > > The typed literals used in> 10k triples: > > count datatype IRI > 11308 xsd:anyURI > 12553http://dbpedia.org/datatype/day > 12788http://dbpedia.org/ontology/day > 15875http://dbpedia.org/ontology/usDollar > 18228http://dbpedia.org/datatype/usDollar > 20828http://europeanaconnect.eu/voc/fondazione/sgti#fondazioneNot > 22934http://statistics.data.gov.uk/def/administrative-geography/StandardCode > > 23368http://www.w3.org/2001/XMLSchema#date > 30695http://dbpedia.org/datatype/inhabitantsPerSquareKilometre > 31662http://dbpedia.org/datatype/second > 35506http://dbpedia.org/datatype/kilometre > 57409http://www.w3.org/2001/XMLSchema#int > 160117http://stitch.cs.vu.nl/vocabularies/rameau/RecordNumber > 632256http://www.w3.org/2001/XMLSchema#anyURI > 1175435 xsd:string > 1696035http://data.ordnancesurvey.co.uk/ontology/postcode/Postcode > 70194534http://www.openlinksw.com/schemas/virtrdf#Geometry > 120147725http://www.w3.org/2001/XMLSchema#string > > Spreadsheet will be updated too. > Thanks Kingsley, very much appreciated! :) I have to admit I'm surprised by the lack of xsd:double and xsd:decimal in the two stats sets, and also the inclusion of some datatypes I'd never even heard of! Are there any virtuozo specific nuances which do some conversion, or are all of these as found in the serialized RDF? also is xsd:string automatically set for all plain literals (with / without langs?) Cheers, Nathan
Received on Monday, 4 April 2011 14:07:52 UTC