W3C home > Mailing lists > Public > public-lod@w3.org > April 2008

Re: U.S. corporate ownership RDF data

From: Richard Cyganiak <richard@cyganiak.de>
Date: Tue, 22 Apr 2008 17:18:34 +0100
Cc: public-lod@w3.org, get-theinfo@googlegroups.com
Message-Id: <7A83F10C-02BF-4324-8334-6D05C69A1C72@cyganiak.de>
To: Joshua Tauberer <jt@occams.info>


A very cool dataset! Some questions:

1. I tried to open the example URIs in the Tabulator data browser, but  
for some reason Tabulator doesn't kick in, I just see Firefox's usual  
RDF source view. The “Get Info” window in Firefox says that the RDF is  
served as text/xml, which would explain the effect. But if I try to  
confirm this with curl it seems like the results are actually  
correctly served as application/rdf+xml. Any idea what's going on?

2. You use 302 redirects to get from the identifiers for people and  
companies to their RDF descriptions. Is this an oversight? Shouldn't  
it be 303, because the redirect goes from one resource (e.g. a person)  
to a different resource (an RDF document about the person)?

3. Is the schema (especially the ussec namespace) documented anywhere?  
Is there a list of all the available properties? I tried to get such a  
list using "SELECT DISTINCT ?p WHERE { ?s ?p ?o }" but this seems to  
exceed the endpoint's execution time limit.

4. As I'm not familiar with the U.S. environment: What is the coverage  
of this data? Is it all publicly traded U.S. companies?

5. I see you have a Semantic Sitemap at rdfabout.com -- can you add  
this dataset to it? (This helps us index the dataset into Sindice.)

Cheers, and keep up the great work,

On 19 Apr 2008, at 13:15, Joshua Tauberer wrote:
> (cross-posted to LOD and get-theinfo...)
> In response to a thread on Aaron Swartz's get-theinfo list, I
> resurrected my RDF data for U.S. corporate ownership derived from
> publicly filed records to the U.S. Securities and Exchange  
> commission's
> EDGAR database.
> It's 1 million triples, HTTP and SPARQL-accessible. More here  
> (including source code, data dump, and examples):
> http://rdfabout.com/demo/sec/
> The records establish board membership, officer positions, and 10%- 
> or-more ownership relations. Note that people can enter into any of  
> those relations with corporations, but additionally corporations can  
> be 10% owners of other corporations. The records exist at time  
> points when the interest (i.e. stock ownership) of an individual or  
> corporation that is in one of the relations above with a corporation  
> changes. It is thus possible (and likely) that individuals who are  
> no longer in such a relation with a corporation are still listed as  
> such in this data.
> Here are some starting points:
> News Corp (owner of FOX, WSJ, and other media things):
> http://www.rdfabout.com/rdf/usgov/sec/id/cik0001308161
> Rupert Murdoch (media mogul behind News Corp):
> http://www.rdfabout.com/rdf/usgov/sec/id/cik0001024835
> There are no links to other data sets.
> -- 
> - Josh Tauberer
> http://razor.occams.info
> "Yields falsehood when preceded by its quotation!  Yields
> falsehood when preceded by its quotation!" Achilles to
> Tortoise (in "Godel, Escher, Bach" by Douglas Hofstadter)
Received on Tuesday, 22 April 2008 16:19:21 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:15:49 UTC