W3C home > Mailing lists > Public > public-lod@w3.org > April 2008

Re: U.S. corporate ownership RDF data

From: Joshua Tauberer <jt@occams.info>
Date: Wed, 23 Apr 2008 16:12:06 -0400
Message-ID: <480F9816.8010507@occams.info>
To: Richard Cyganiak <richard@cyganiak.de>
CC: public-lod@w3.org

Richard Cyganiak wrote:
>>> I tried to get such a list using "SELECT DISTINCT ?p WHERE { ?s ?p ?o 
>>> }" but this seems to exceed the endpoint's execution time limit.
>>
>> Yeah. That's a difficult one to execute rapidly... Also note that it's 
>> the same end point that serves the Census data set and everything else 
>> I have, so that's potentially scanning a billion statements.
> 
> Yes, I appreciate the difficulty of making this kind of very general 
> query run fast on the amount of data you serve. Having some vocabulary 
> documentation around (such as an RDFS document) lessens the need for 
> such queries. Without either one (documentation or ability to run the 
> queries) it's fairly hard to get a feel for the data.

I know. :) But I can't spend more time on that data now.

>>> 5. I see you have a Semantic Sitemap at rdfabout.com -- can you add 
>>> this dataset to it? (This helps us index the dataset into Sindice.)
> 
>>
>> Ooops, yes.
> 
> I notice that there are still a bunch of example.org URIs in the 
> dataset, would be great if you could fix/remove them.

They're only in XML comments.

Josh
Received on Wednesday, 23 April 2008 20:12:38 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:16 UTC