- From: Melvin Carvalho <melvincarvalho@gmail.com>
- Date: Sun, 21 Mar 2010 15:56:27 +0100
- To: Hugh Glaser <hg@ecs.soton.ac.uk>
- Cc: Dan Brickley <danbri@danbri.org>, Kingsley Idehen <kidehen@openlinksw.com>, Linked Data community <public-lod@w3.org>
- Message-ID: <9178f78c1003210756g24c35e73xe06ec22d0ffaf864@mail.gmail.com>
2010/3/21 Hugh Glaser <hg@ecs.soton.ac.uk> > > On 21/03/2010 13:00, "Dan Brickley" <danbri@danbri.org> wrote: > > > On 21 Mar 2010, at 12:47, Hugh Glaser <hg@ecs.soton.ac.uk> wrote: > > > >> Hi Kingsley, I am right with you - finding stuff is hard. > >> But I do think we could make it easier for all of us. > >> Just the esw wiki alone requires me to put every set I create into a > >> bunch of places > > > > 10 years ago, looking for RDF on the public Web was like looking for a > > needle in a haystack. There wasnt much out there and it was poorly > > linked. So a big part of the thinking that led to the foaf/rdfweb > > design was to make discovery easier: if you find one rdf doc, you > > should be able to find most of the rest by following seeAlso and other > > kinds of links. > > > > Why isn't this enough? Perhaps because many of the datasets are huge > > db exports, crawlers are often overwhelmed and dissapear into depth- > > first holes? Or because we don't publish triples about doc- and > > dataset-types in a crawler-discoverable way? > Yes, sort of. > I think the problem is now with metadata for the datasets, which is great. > Actually if everyone published semantic sitemaps and voiD descriptions > etc., > and we had the tools to re-present the data, we would be well along the > road. > At worst, I might register my site somewhere (as I do with Sindice), say > "go > figure". Pages such as the esw ones should then appear magically. > > > > A wiki page is ok for initial bootstrap but we ought to outgrow that > > soon... > But I think we may be pleased to say that "soon" has arrived? > And perhaps if it was easier we would discover that there is so much more > out there that the wiki page hasn't actually been enough for a while. I can > think of 10 interesting datasets that aren't there (that aren't mine). > > I am tempted to say that we spend all our time persuading others to take > things like those tables and republish as RDF, but... :-) > > And yes, I know this has been a topic before, but we really should be > feeling increasingly embarrassed by this. > Well I got a bit carried away with some regular expressions (which you should never do on a Sunday) and came up with: 2>/dev/null curl http://esw.w3.org/SparqlEndpoints | grep -A6 '^<tr>$' | awk '{ i++ ; if (i==3 ) print $0 " ." ; if (i==1) print "\n<#endpoint" p++ ">\n" $0 " ;" ; if ( $1 == "<tr>" ) i = 0; }' | sed 's/<td> <b>/<dct:description> /' | sed 's/<.td><td>/<void:sparqlEndpoint> /' | sed 's/<void.*href="\([^"]*\).*/<void:sparqlEndpoint> <\1> ./' | sed 's/<dct:description>.*">\(.*\)<.a.*/<dct:description> "\1" ;/' <#endpoint1> <dct:description> Project</b> ; </td><dct:description> SPARQL endpoint</b> . <#endpoint2> <dct:description> "BBC Programmes and Music" ; <void:sparqlEndpoint> <http://bbc.openlinksw.com/sparql/> . <#endpoint3> <dct:description> "Bio2RDF" ; <void:sparqlEndpoint> < http://www.freebase.com/view/user/bio2rdf/public/sparql> . <#endpoint4> <dct:description> "BioGateway" ; <void:sparqlEndpoint> < http://www.semantic-systems-biology.org/biogateway/endpoint> . <#endpoint5> <dct:description> "BBC Backstage" ; <void:sparqlEndpoint> <http://jena.hpl.hp.com:3040/backstage> . <#endpoint6> <dct:description> "DBTune" ; <void:sparqlEndpoint> <http://dbtune.org/bbc/peel/sparql> . <#endpoint7> <dct:description> "DBTune" ; <void:sparqlEndpoint> <http://dbtune.org/bbc/playcount/sparql> . <#endpoint8> <dct:description> "DailyMed" ; <void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/dailymed/sparql> . <#endpoint9> <dct:description> "data.gov.uk" ; <void:sparqlEndpoint> <http://data.gov.uk/sparql> . <#endpoint10> <dct:description> "D2R Server" ; <void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/dblp/sparql> . <#endpoint11> <dct:description> "OpenLink Software" ; <void:sparqlEndpoint> <http://dbpedia.org/sparql> . <#endpoint12> <dct:description> "OpenLink Software" ; <void:sparqlEndpoint> <http://dbpedia-live.openlinksw.com/sparql/> . <#endpoint13> <dct:description> "Diseasome" ; <void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/diseasome/sparql> . <#endpoint14> <dct:description> "DoapSpace" ; <void:sparqlEndpoint> <http://doapspace.org/sparql> . <#endpoint15> <dct:description> "DrugBank" ; <void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/drugbank/sparql> . <#endpoint16> <td> <b><a href="http://code.google.com/p/openflydata/wiki/Flyatlas" class="external text" title=" http://code.google.com/p/openflydata/wiki/Flyatlas">FlyAtlas</a></b> ; <void:sparqlEndpoint> <http://openflydata.org/query/flyatlas_20080916> . <#endpoint17> <dct:description> "Fly-TED" ; <void:sparqlEndpoint> <http://openflydata.org/query/flyted_20081203> . <#endpoint18> <dct:description> "Gene Expression In-situ Images of fruitfly embryogenesis" ; <void:sparqlEndpoint> <http://spade.lbl.gov:2021/sparql> . <#endpoint19> <dct:description> "Gene Ontology Database" ; <void:sparqlEndpoint> <http://spade.lbl.gov:2020/sparql> . <#endpoint20> <dct:description> "DBTune" ; <void:sparqlEndpoint> <http://dbtune.org/henry/sparql/> . <#endpoint21> <dct:description> "IBM ATG (Advanced Technologies Group)" ; <void:sparqlEndpoint> <http://abdera.watson.ibm.com:8080/sparql> . <#endpoint22> <dct:description> "DBTune" ; <void:sparqlEndpoint> <http://dbtune.org/jamendo/sparql> . <#endpoint23> <dct:description> "LinkedCT" ; <void:sparqlEndpoint> <http://data.linkedct.org/sparql> . <#endpoint24> <dct:description> "Linked Movie Data Base" ; <void:sparqlEndpoint> <http://data.linkedmdb.org/sparql> . <#endpoint25> <dct:description> "LOD Cloud Cache" ; <void:sparqlEndpoint> <http://lod.openlinksw.com/sparql/> . <#endpoint26> <dct:description> "DBTune" ; <void:sparqlEndpoint> <http://dbtune.org/magnatune/sparql> . <#endpoint27> <dct:description> "DBTune" ; <void:sparqlEndpoint> <http://dbtune.org/musicbrainz/sparql> . <#endpoint28> <dct:description> "myExperiment" ; <void:sparqlEndpoint> <http://rdf.myexperiment.org/sparql> . <#endpoint29> <dct:description> "Neurocommons" ; <void:sparqlEndpoint> <http://sparql.neurocommons.org/sparql?> . <#endpoint30> <dct:description> "OpenLink Data Spaces" ; <void:sparqlEndpoint> <http://myopenlink.net:8890/sparql/> . <#endpoint31> <dct:description> "OpenLink Virtuoso" ; <void:sparqlEndpoint> <http://demo.openlinksw.com/sparql/> . <#endpoint32> <dct:description> "Project Gutenberg Metadata" ; <void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/gutendata/sparql> . <#endpoint33> <dct:description> "rdfabout.com" ; <void:sparqlEndpoint> <http://www.rdfabout.com/sparql> . <#endpoint34> <dct:description> "Revyu" ; <void:sparqlEndpoint> <http://revyu.com/sparql> . <#endpoint35> <dct:description> "RKBExplorer" ; <void:sparqlEndpoint> <http://*.rkbexplorer.com/sparql/> . <#endpoint36> <dct:description> "SemanticWeb.org Dog Food" ; <void:sparqlEndpoint> <http://data.semanticweb.org/sparql> . <#endpoint37> <dct:description> "SemanticWebSchool - Vienna" ; <void:sparqlEndpoint> <http://sparql.semantic-web.at/sparql> . <#endpoint38> <dct:description> SPARQLer</b> ; <void:sparqlEndpoint> <http://www.sparql.org/sparql> . <#endpoint39> <dct:description> Sparqlette</b> ; <void:sparqlEndpoint> <http://www.wasab.dk/morten/2005/04/sparqlette/> . <#endpoint40> <dct:description> "STW Thesaurus for Economics" ; <void:sparqlEndpoint> <http://zbw.eu/beta/sparql> . <#endpoint41> <dct:description> "Information about the Web-based Systems Group" ; <void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/is-group/sparql> . <#endpoint42> <dct:description> "URIBurner.com" ; <void:sparqlEndpoint> <http://uriburner.com/sparql/> . <#endpoint43> <dct:description> "voiD Data" ; <void:sparqlEndpoint> <http://void.rkbexplorer.com/sparql/> . <#endpoint44> <dct:description> W3C Spanish Office SPARQL Demo</b> ; <void:sparqlEndpoint> <http://www.w3c.es/Prensa/sparql/> . <#endpoint45> <dct:description> Wiki.Ontoworld.org</b> ; <void:sparqlEndpoint> <http://dannyayers.com:8888/ontoworld/> . <#endpoint46> <dct:description> "World Factbook" ; <void:sparqlEndpoint> <http://www4.wiwiss.fu-berlin.de/factbook/sparql> . <#endpoint47> <dct:description> "Linked Periodicals" ; <void:sparqlEndpoint> < http://api.talis.com/stores/periodicals/services/sparql> . <#endpoint48> <dct:description> Project</b> ; </td><dct:description> SPARQL endpoint</b> . <#endpoint49> <dct:description> "Mindswap" ; <void:sparqlEndpoint> < http://www.mindswap.org/cgi-bin/2003/pellet/pelletPost.cgi> . <#endpoint50> <dct:description> "UniProt" ; <void:sparqlEndpoint> < http://labs.intellidimension.com/uniprot/query2.rsp?&exec=Execute&q=> . <#endpoint51> <dct:description> Project</b> ; </td><dct:description> SPARQL endpoint</b> . <#endpoint52> <dct:description> "AIFB" ; <void:sparqlEndpoint> <http://km.aifb.uni-karlsruhe.de/services/sparql> . <#endpoint53> <dct:description> "Opera Community" ; <void:sparqlEndpoint> <http://my.opera.com/community/sparql/sparql> . <#endpoint54> <dct:description> "SparqlSphere" ; <void:sparqlEndpoint> [SPARQL endpoint] . > > Best > Hugh > > > > Dan > > >
Received on Sunday, 21 March 2010 14:57:05 UTC