Re: Reminder: BioRDF telcon on Monday at 11AM EDT (4PM UK, 5PM CET)

Dear all,

Today's BioRDF call with focus on aligning multiple genelist representations
(and their experimental context) derived from microarray experimental
results to improve integration of linked data represented using different
standards.
There are several things that need to be aligned before such integration is
possible

1) Classes and properties; 2) Different URI used for identifying genes,
diseases, anatomical part, etc ; 3) Methods to transform the data when
statement values are not of the same datatype

We are currently dealing with three different represetantions of similar
datasets:

a. RDF representation of genelists from the gene experession atlas
SPARQL endpoint at http://www.open-biomed.org.uk/sparql/endpoint/gxa

b. RDF representation of Provenance of Microarray Experiments by the bioRDF
group
SPARQL endpoint at
http://hcls.deri.org:8080/openrdf-sesame/repositories/biordfmicroarray

c. RDF representation of microarrays and the raw .CEL files derived from
those experiments
SPARQL endpoint at http://bloodprogram.hsci.harvard.edu/sparql



To answer the same competency questions from each of the endpoints, the
syntax of the SPARQL queries are necessarily different.
As an example for 1), consider the question "list all experiments".

in a) the question should be formulated as "?experiment        rdf:type
    <http://www.ebi.ac.uk/efo/EFO_0004033> . "
in b) the question should be formulated as "?experiment        rdf:type
     <http://purl.org/net/biordfmicroarray/ns#microarray_experiment> . "
in c) the quesion should be formulate as "?experiment        rdf:type
   <http://mged.sourceforge.net/ontologies/MGEDontology.php#BioAssay> ."

To solve and align these three we have prepared a table with the equivalent
classes and properties between the three representations (
https://spreadsheets.google.com/spreadsheet/ccc?key=0AlyJy66aJ50tdGktd1BRUGN4aEFUMC15aURtQ3VnRXc&hl=en_US#gid=0
)

For an  example of 2), we will have to link different gene identifiers in a)
and b) (NCBI gene id vs Ensembl gene id) and for an example of 3) data
transformation will be required before the datasets can be integrated since
in a. differential expression is encoded as "UP" or "DOWN" whereas in b.
differential expression encoded as fold change values (e.g. foldChange>0 <=>
"UP").

We would be delighted if those of you who are interested in such matters
could join the call today with the purpose of helping us solve these
integration issues in microarray result representation.

Best
Lena



On Mon, Jul 11, 2011 at 8:05 AM, M. Scott Marshall <mscottmarshall@gmail.com
> wrote:

> Here is the reminder for today's BioRDF call:
>
> http://www.w3.org/wiki/HCLSIG_BioRDF_Subgroup/Meetings/2011/07-04_Conference_Call
>
> Conference Details
>
> Date of Call: Monday, July 11, 2011
> Time of Call: 11:00 am Eastern Time, 4 pm UK, 5 pm CET
> Dial-In #: +1.617.761.6200 (Cambridge, MA)
> [Note: limited access to European dial in numbers below]
> Dial-In #: +33.4.26.46.79.03 (Nice, France)
> Dial-In #: +44.203.318.0479 (Bristol, UK)
> Participant Access Code: 4257 ("HCLS")
> IRC Channel: irc.w3.org port 6665 channel #HCLS (see W3C IRC page for
> details, or see Web IRC), Quick Start: Use
> http://www.mibbit.com/chat/?server=irc.w3.org:6665&channel=%23hcls for
> IRC access.
> Duration: ~1 hour
> Convener: Helena Deus
> Scribe: TBD
>
> Agenda
>
> Discuss JBI article - Helena
> Microarray, Expression RDF, Progress on the W3C note - All
> Feedback to the new Gene Expression Atlas SPARQL endpoint - All
> AOB
>
>
> --
> M. Scott Marshall, W3C HCLS IG co-chair, http://www.w3.org/blog/hcls
> http://staff.science.uva.nl/~marshall
>
>


-- 
Helena F. Deus
Post-Doctoral Researcher at DERI/NUIG
http://lenadeus.info/

Received on Monday, 11 July 2011 10:50:50 UTC