Metadata for Bio SPARQL endpoints ?

Hi,

I've recently seen a post on the w3c mailing list which points to a very
useful resource:

http://labs.mondeca.com/sparqlEndpointsStatus/index.html

(it's a bit incomplete on the bio side... perhaps not many people used ckan
in this area?).

I was wondering that for a resource like that to be really useful in
research, we would need an extra information: how fresh the information is.

Do you know if there is any standard metadata to indicate the last refresh
of the endpoint content ?
Technically speaking this kind of information should be associated to data
as provenance. In practice however, 90% of utility can be reached by  having
some state information for each big graph in the endpoint, corresponding to
major data sources.

In practice it would be nice to have a standard dictionary so that we can
ask to the triplestore:
list of graphs/datasets.

for each of these (or for endpoint itself if this holds information which is
"coherent" source-wise):
- update frequency
- last update
- data source (type and in case link).

Does anybody have this already ? Opinions ?

best,
Andrea

Andrea Splendiani
Senior Bioinformatics Scientist
Centre for Mathematical and Computational Biology
+44(0)1582 763133 ext 2004
andrea.splendiani@bbsrc.ac.uk

Received on Friday, 4 March 2011 17:18:01 UTC