W3C home > Mailing lists > Public > public-semweb-lifesci@w3.org > October 2011

Re: data quality

From: Peter Ansell <ansell.peter@gmail.com>
Date: Sun, 16 Oct 2011 15:10:53 +1000
Message-ID: <CAGYFOCSJXsVpYMZ1gzwmB5T06SDEhcLkd87PVqMETh-359DKBA@mail.gmail.com>
To: w3c semweb hcls <public-semweb-lifesci@w3.org>
The SPIN syntax is quite abstract at first glance, but the ability to
extend SPARQL is very interesting. There are other open source tools
such as R2R [1] and the Bio2RDF library which I maintain, QueryAll
[2], that may be a little easier to model data quality normalisation
rules using, IMO, especially using SPARQL 1.1. QueryAll models some of
the other data quality considerations, such as the query replication
part of provenance and dataset trust as well. Having said that, all
three provider RDF serialised rules, so they are easy to transport in
that respect.

The basic model for QueryAll is published at [4] and anyone is free to
fork the software on Github and send back pull requests with
improvements. In particular, new rule implementations would be
appreciated to extend the repertoire.

The Bio2RDF webapp normalisation rules give some examples of the way
QueryAll can be used. In Bio2RDF we have been mostly concerned with
URI transformations to get consistent URIs. The 3
base-bio2rdf-rdfrules-* config files in [3] may be useful for
examples, particularly the SPARQL rules that are similar to what R2R
and SPIN provider.

Peter

[1] http://www4.wiwiss.fu-berlin.de/bizer/r2r/spec/
[2] https://github.com/ansell/queryall/tree/develop/queryall-api/src/main/java/org/queryall/api/rdfrule
[3] https://github.com/bio2rdf/bio2rdf-webapp/tree/develop/src/main/resources/config
[4] http://dx.doi.org/10.1016/j.future.2010.08.016


On 16 October 2011 13:15, Mead, Charlie (NIH/NCI) [C]
<meadch@mail.nih.gov> wrote:
> Shouldn’t rules – and tools/standards to support them, e.g. SPIN?? – be part
> of this discussion as well?
>
>
>
> From: Bob Futrelle [mailto:bob.futrelle@gmail.com]
> Sent: Friday, October 14, 2011 16:04
> To: Michel Dumontier
> Cc: w3c semweb hcls
> Subject: Re: data quality
>
>
>
> Provenance doesn't appear to get much attention at SemWebQuality.org.
>
> It's more complex than just some notes about origins.
>
>
>
> http://openprovenance.org/
>
>
>
> - Bob Futrelle
>
>
>
> On Fri, Oct 14, 2011 at 9:38 AM, Michel Dumontier
> <michel.dumontier@gmail.com> wrote:
>
> Ran across this site on semantic web data quality (even includes an
> ontology):
>
>
>
> http://semwebquality.org/mediawiki/index.php?title=SemWebQuality.org
>
>
>
> Cheers!
>
>
>
> m.
>
>
>
> --
> Michel Dumontier
> Associate Professor of Bioinformatics
> Carleton University
> http://dumontierlab.com
>
>
Received on Sunday, 16 October 2011 05:11:32 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 18:01:03 GMT