Re: Improving Organization of Govt. based Linked Data Projects

2010/3/21 Hugh Glaser <hg@ecs.soton.ac.uk>

>
> On 21/03/2010 13:00, "Dan Brickley" <danbri@danbri.org> wrote:
>
> > On 21 Mar 2010, at 12:47, Hugh Glaser <hg@ecs.soton.ac.uk> wrote:
> >
> >> Hi Kingsley, I am right with you - finding stuff is hard.
> >> But I do think we could make it easier for all of us.
> >> Just the esw wiki alone requires me to put every set I create into a
> >> bunch of places
> >
> > 10 years ago, looking for RDF on the public Web was like looking for a
> > needle in a haystack. There wasnt much out there and it was poorly
> > linked. So a big part of the thinking that led to the foaf/rdfweb
> > design was to make discovery easier: if you find one rdf doc, you
> > should be able to find most of the rest by following seeAlso and other
> > kinds of links.
> >
> > Why isn't this enough? Perhaps because many of the datasets are huge
> > db exports, crawlers are often overwhelmed and dissapear into depth-
> > first holes? Or because we don't publish triples about doc- and
> > dataset-types in a crawler-discoverable way?
> Yes, sort of.
> I think the problem is now with metadata for the datasets, which is great.
> Actually if everyone published semantic sitemaps and voiD descriptions
> etc.,
> and we had the tools to re-present the data, we would be well along the
> road.
> At worst, I might register my site somewhere (as I do with Sindice), say
> "go
> figure". Pages such as the esw ones should then appear magically.
> >
> > A wiki page is ok for initial bootstrap but we ought to outgrow that
> > soon...
> But I think we may be pleased to say that "soon" has arrived?
> And perhaps if it was easier we would discover that there is so much more
> out there that the wiki page hasn't actually been enough for a while. I can
> think of 10 interesting datasets that aren't there (that aren't mine).
>
> I am tempted to say that we spend all our time persuading others to take
> things like those tables and republish as RDF, but... :-)
>
> And yes, I know this has been a topic before, but we really should be
> feeling increasingly embarrassed by this.
>

Well I got a bit carried away with some regular expressions (which you
should never do on a Sunday) and came up with:

2>/dev/null curl http://esw.w3.org/SparqlEndpoints | grep -A6 '^<tr>$'  |
awk '{  i++  ;  if (i==3 ) print $0 " ." ; if (i==1) print "\n<#endpoint"
p++ ">\n" $0 " ;" ; if ( $1 == "<tr>" ) i = 0; }' | sed 's/<td>
<b>/<dct:description> /' | sed 's/<.td><td>/<void:sparqlEndpoint> /' | sed
's/<void.*href="\([^"]*\).*/<void:sparqlEndpoint> <\1> ./' | sed
's/<dct:description>.*">\(.*\)<.a.*/<dct:description> "\1" ;/'


<#endpoint1>
<dct:description> Project</b> ;
</td><dct:description> SPARQL endpoint</b> .

<#endpoint2>
<dct:description> "BBC Programmes and Music" ;
<void:sparqlEndpoint> <http://bbc.openlinksw.com/sparql/> .

<#endpoint3>
<dct:description> "Bio2RDF" ;
<void:sparqlEndpoint> <
http://www.freebase.com/view/user/bio2rdf/public/sparql> .

<#endpoint4>
<dct:description> "BioGateway" ;
<void:sparqlEndpoint> <
http://www.semantic-systems-biology.org/biogateway/endpoint> .

<#endpoint5>
<dct:description> "BBC Backstage" ;
<void:sparqlEndpoint> <http://jena.hpl.hp.com:3040/backstage> .

<#endpoint6>
<dct:description> "DBTune" ;
<void:sparqlEndpoint> <http://dbtune.org/bbc/peel/sparql> .

<#endpoint7>
<dct:description> "DBTune" ;
<void:sparqlEndpoint> <http://dbtune.org/bbc/playcount/sparql> .

<#endpoint8>
<dct:description> "DailyMed" ;
<void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/dailymed/sparql> .

<#endpoint9>
<dct:description> "data.gov.uk" ;
<void:sparqlEndpoint> <http://data.gov.uk/sparql> .

<#endpoint10>
<dct:description> "D2R Server" ;
<void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/dblp/sparql> .

<#endpoint11>
<dct:description> "OpenLink Software" ;
<void:sparqlEndpoint> <http://dbpedia.org/sparql> .

<#endpoint12>
<dct:description> "OpenLink Software" ;
<void:sparqlEndpoint> <http://dbpedia-live.openlinksw.com/sparql/> .

<#endpoint13>
<dct:description> "Diseasome" ;
<void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/diseasome/sparql> .

<#endpoint14>
<dct:description> "DoapSpace" ;
<void:sparqlEndpoint> <http://doapspace.org/sparql> .

<#endpoint15>
<dct:description> "DrugBank" ;
<void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/drugbank/sparql> .

<#endpoint16>
<td>  <b><a href="http://code.google.com/p/openflydata/wiki/Flyatlas"
class="external text" title="
http://code.google.com/p/openflydata/wiki/Flyatlas">FlyAtlas</a></b> ;
<void:sparqlEndpoint> <http://openflydata.org/query/flyatlas_20080916> .

<#endpoint17>
<dct:description> "Fly-TED" ;
<void:sparqlEndpoint> <http://openflydata.org/query/flyted_20081203> .

<#endpoint18>
<dct:description> "Gene Expression In-situ Images of fruitfly embryogenesis"
;
<void:sparqlEndpoint> <http://spade.lbl.gov:2021/sparql> .

<#endpoint19>
<dct:description> "Gene Ontology Database" ;
<void:sparqlEndpoint> <http://spade.lbl.gov:2020/sparql> .

<#endpoint20>
<dct:description> "DBTune" ;
<void:sparqlEndpoint> <http://dbtune.org/henry/sparql/> .

<#endpoint21>
<dct:description> "IBM ATG (Advanced Technologies Group)" ;
<void:sparqlEndpoint> <http://abdera.watson.ibm.com:8080/sparql> .

<#endpoint22>
<dct:description> "DBTune" ;
<void:sparqlEndpoint> <http://dbtune.org/jamendo/sparql> .

<#endpoint23>
<dct:description> "LinkedCT" ;
<void:sparqlEndpoint> <http://data.linkedct.org/sparql> .

<#endpoint24>
<dct:description> "Linked Movie Data Base" ;
<void:sparqlEndpoint> <http://data.linkedmdb.org/sparql> .

<#endpoint25>
<dct:description> "LOD Cloud Cache" ;
<void:sparqlEndpoint> <http://lod.openlinksw.com/sparql/> .

<#endpoint26>
<dct:description> "DBTune" ;
<void:sparqlEndpoint> <http://dbtune.org/magnatune/sparql> .

<#endpoint27>
<dct:description> "DBTune" ;
<void:sparqlEndpoint> <http://dbtune.org/musicbrainz/sparql> .

<#endpoint28>
<dct:description> "myExperiment" ;
<void:sparqlEndpoint> <http://rdf.myexperiment.org/sparql> .

<#endpoint29>
<dct:description> "Neurocommons" ;
<void:sparqlEndpoint> <http://sparql.neurocommons.org/sparql?> .

<#endpoint30>
<dct:description> "OpenLink Data Spaces" ;
<void:sparqlEndpoint> <http://myopenlink.net:8890/sparql/> .

<#endpoint31>
<dct:description> "OpenLink Virtuoso" ;
<void:sparqlEndpoint> <http://demo.openlinksw.com/sparql/> .

<#endpoint32>
<dct:description> "Project Gutenberg Metadata" ;
<void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/gutendata/sparql> .

<#endpoint33>
<dct:description> "rdfabout.com" ;
<void:sparqlEndpoint> <http://www.rdfabout.com/sparql> .

<#endpoint34>
<dct:description> "Revyu" ;
<void:sparqlEndpoint> <http://revyu.com/sparql> .

<#endpoint35>
<dct:description> "RKBExplorer" ;
<void:sparqlEndpoint> <http://*.rkbexplorer.com/sparql/> .

<#endpoint36>
<dct:description> "SemanticWeb.org Dog Food" ;
<void:sparqlEndpoint> <http://data.semanticweb.org/sparql> .

<#endpoint37>
<dct:description> "SemanticWebSchool - Vienna" ;
<void:sparqlEndpoint> <http://sparql.semantic-web.at/sparql> .

<#endpoint38>
<dct:description> SPARQLer</b> ;
<void:sparqlEndpoint> <http://www.sparql.org/sparql> .

<#endpoint39>
<dct:description> Sparqlette</b> ;
<void:sparqlEndpoint> <http://www.wasab.dk/morten/2005/04/sparqlette/> .

<#endpoint40>
<dct:description> "STW Thesaurus for Economics" ;
<void:sparqlEndpoint> <http://zbw.eu/beta/sparql> .

<#endpoint41>
<dct:description> "Information about the Web-based Systems Group" ;
<void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/is-group/sparql> .

<#endpoint42>
<dct:description> "URIBurner.com" ;
<void:sparqlEndpoint> <http://uriburner.com/sparql/> .

<#endpoint43>
<dct:description> "voiD Data" ;
<void:sparqlEndpoint> <http://void.rkbexplorer.com/sparql/> .

<#endpoint44>
<dct:description> W3C Spanish Office SPARQL Demo</b> ;
<void:sparqlEndpoint> <http://www.w3c.es/Prensa/sparql/> .

<#endpoint45>
<dct:description> Wiki.Ontoworld.org</b> ;
<void:sparqlEndpoint> <http://dannyayers.com:8888/ontoworld/> .

<#endpoint46>
<dct:description> "World Factbook" ;
<void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/factbook/sparql> .

<#endpoint47>
<dct:description> "Linked Periodicals" ;
<void:sparqlEndpoint> <
http://api.talis.com/stores/periodicals/services/sparql> .

<#endpoint48>
<dct:description> Project</b> ;
</td><dct:description> SPARQL endpoint</b> .

<#endpoint49>
<dct:description> "Mindswap" ;
<void:sparqlEndpoint> <
http://www.mindswap.org/cgi-bin/2003/pellet/pelletPost.cgi> .

<#endpoint50>
<dct:description> "UniProt" ;
<void:sparqlEndpoint> <
http://labs.intellidimension.com/uniprot/query2.rsp?&amp;exec=Execute&amp;q=>
.

<#endpoint51>
<dct:description> Project</b> ;
</td><dct:description> SPARQL endpoint</b> .

<#endpoint52>
<dct:description> "AIFB" ;
<void:sparqlEndpoint> <http://km.aifb.uni-karlsruhe.de/services/sparql> .

<#endpoint53>
<dct:description> "Opera Community" ;
<void:sparqlEndpoint> <http://my.opera.com/community/sparql/sparql> .

<#endpoint54>
<dct:description> "SparqlSphere" ;
<void:sparqlEndpoint>  [SPARQL endpoint] .


>
> Best
> Hugh
> >
> > Dan
>
>
>

Received on Sunday, 21 March 2010 14:57:05 UTC