RE: magetab2magerdf from Michael Miller on 2010-01-05 (public-semweb-lifesci@w3.org from January 2010)

From: Michael Miller <mmiller@teranode.com>
Date: Tue, 5 Jan 2010 18:53:10 -0500
To: "Jim McCusker" <james.mccusker@yale.edu>, "mdmiller" <mdmiller53@comcast.net>, "w3c semweb HCLS" <public-semweb-lifesci@w3.org>
Message-ID: <6401DB16544A5B4AA279B921F43547EC066FC59E@MI8NYCMAIL16.Mi8.com>
hi jim,

 

from section 2.2.3, figure 6 (i hope the figure comes out, otherwise
reference the MAGE-TAB specification [1] directly):

 

2.2.3 Example: Iterated Design dual channel

This is shown in Figure 6.

(a) Investigation design graph

  

(b) SDRF representation

Figure 6: Iterated design, dual channel. LabeledExtract-Dye associations
can be added as a separate "Label"

 

Note that each source is repeated twice on a separate line so there is
no ambiguity as to the different processing.  i highly recommend reading
the spec, tim rayner did a great job of describing how different DAGs
are represented by a spreadsheet.  also, you'll need to take into
account that there can be multiple Protocol REF columns between each set
of Name nodes.

 

cheers,

michael

 

[1] http://www.mged.org/mage-tab/MAGE-TABv1.0.pdf

 

 

 

From: public-semweb-lifesci-request@w3.org
[mailto:public-semweb-lifesci-request@w3.org] On Behalf Of Jim McCusker
Sent: Tuesday, January 05, 2010 3:10 PM
To: mdmiller; w3c semweb HCLS
Subject: Re: magetab2magerdf

 

For what it's worth, I'm generating a DAG using ProtocolApplications as
edges. 

Related to that, I have a limpopo question:

In a case where there are 2 channel microarrays, different protocols are
used to create labeled extracts. If there is a technical dye swap
replicate (where the same extract is split and labeled differently), how
does the list of ProtocolNodes (where one is a Cy5 labeling and the
other is a Cy3 labeling) relate each LabeledExtract to the source
Extract? From what I can tell, this information is currently lost, as
the Extract (say ext1) would have 2 protocol nodes (Cy5 and Cy3), and
then two child nodes (labext1 and labext2). I need to be able to relate
which is which. 

Thanks,
Jim

On Sat, Jan 2, 2010 at 10:50 PM, Kei Cheung <kei.cheung@yale.edu> wrote:

Hi Michael et al,

The question is what is the appropriate structure of the DAG for
answering the semantic queries for our microarray use case.



mdmiller wrote:

hi kei,

mage-tab and its extension isa-tab is designed from the principal of a
DAG, in essence it is a flattening of the dag into a spreadsheet which
is describe in the MAGE-TAB 1.0 spec [1] in great detail.  i believe the
MAGE-TAB parser stores the nodes as a DAG [2].  EBI has also developed a
suite of tools around the MAGE standard [3] including a DAG
visualization. ArrayExpress for each experiment also has a visualized
view of the experiment as a DAG that can be downloaded.  in MAGE-ML the
'_ref' elements are used to describe the DAG in a MAGE document.

is the one mentioned below editable?  that's the one thing about the EBI
visualization, it is not editable.

 

I don't think the one mentioned in the paper below is editable.

Cheers,

-Kei

	 

	
	by the by, MAGE-TAB is also being used to report next-gen
sequencing experiments in ArrayExpress.
	
	cheers,
	michael
	
	[1] www.mged.org
	[2] https://sourceforge.net/projects/limpopo/
	[3] 
http://bioinformatics.oxfordjournals.org/cgi/content/full/25/2/279
	
	----- Original Message ----- From: "Kei Cheung" <
kei.cheung@yale.edu>
	To: "mdmiller" <mdmiller53@comcast.net>
	Cc: "Jim McCusker" <james.mccusker@yale.edu>; "w3c semweb HCLS"
<public-semweb-lifesci@w3.org>
	Sent: Sunday, December 13, 2009 6:34 PM
	Subject: Re: magetab2magerdf
	
	

	Hi Jim, Michael,
	
	The following paper describes how to convert mage-tab and
isa-tab (how does this differ from mage-tab?) into DAG for visualization
purposes.
	
	http://www.biomedcentral.com/1471-2105/10/133
	
	Why not DAG for machine readability as well?
	
	Cheers,
	
	-Kei
	
	mdmiller wrote:

	hi jim,
	
	looks like you're making great progress.  i have a few comments
in-line below.
	
	cheers,
	michael
	
	----- Original Message ----- From: "Jim McCusker" <
james.mccusker@yale.edu>
	To: "w3c semweb HCLS" <public-semweb-lifesci@w3.org>
	Sent: Tuesday, December 08, 2009 6:05 AM
	Subject: magetab2magerdf
	
	

	I'm distinguishing between magetab2rdf (raw conversion of
magetab into
	an RDF structure) and magetab2magerdf (conversion of magetab
into an
	RDF-based MAGE-OM structure) here. My purposes and goals require
a
	magetab2magerdf approach, so that's what I've been working on.
	
	I have checked in code for magetab2magerdf at the googlecode
project
	http://magetab2rdf.googlecode.com. The code can be checked out
from:
	
	http://magetab2rdf.googlecode.com/svn/trunk/magetab2magerdf/
	
	and example RDF is in:
	
	
http://magetab2rdf.googlecode.com/svn/trunk/magetab2magerdf/examples/E-M
EXP-986/ 
	
	I currently load the IDF-related entities into the RDF. I'm
beginning
	work on SDRF next.
	
	
http://magetab2rdf.googlecode.com/svn/trunk/ontologies/mage-om.owl
	contains the additional properties and classes needed to support
an
	RDF-based MAGE-OM on top of the MGED Ontology.
	
	A few notes on E-MEXP-986:
	
	The URI for the MGED Ontology is
	http://mged.sourceforge.net/ontologies/MGEDontology.owl, but has
been
	set to http://mged.sourceforge.net/ontologies/MGEDontology.php
in the
	IDF. The actual Term Source name is "The MGED Ontology".
	A common practice seems to be to refer to "MGED Ontology"
without
	reference to its URI.

	
	
	
	as you probably noticed, 
http://mged.sourceforge.net/ontologies/MGEDontology.php allows appending
"#{class name}" to go directly to the definition of the term, so in a
sense it is indeed a valid URI, that is a URL.  it also came before the
owl format.  can th epowl format be reached into over he net to extract
simply the class definition or does it need to be downloaded and
processed locally? my understanding is that a site would have to have
some sort of query, hopefully sparql, mechanism on top to enable this.

	
	Since I have to import the MGED ontology already for it's
classes and
	properties, I have already imported it under the correct URI. I
have
	added a kludge where if the term source name contains the string
"MGED
	Ontology", the code assumes you mean the MGED Ontology, and sets
the
	URI appropriately. However, this is a one-off solution.

	
	
	
	think of it as same as

	
	I went back and forth about importing the Term Source
ontologies.
	However, this particular experiment has used the "ArrayExpress"
term
	source using the URI "http://www.ebi.ac.uk/arrayexpress/" which
	doesn't correspond to an available ontology, but is technically
a term
	source.

	
	
	
	yes, and it does support a query mechinism, albeit a one off for
that site. i believe they plan on adding support for a sparql endpoint
but aren't sure if or when.

	
	I'm considering attempting to import the ontology if it's
available
	and validate if it is, but if it fails to resolve to a document
the
	validation will not happen against that term source.
	
	A note on Limpopo:
	
	The IDF Comment didn't seem to import on this experiment. I'm
not sure
	if it's a format problem or something else.

	
	
	
	i ran into this also, the implementation assumes
"Comment[type]\ttext\ttext..." to coresspond to the format of the other
fields in the IDF.  the MAGE-TAB 1.0 spec doesn't address, my assumption
was that it was simply "Comment[type]text" but that's not what the
parser expects.  we'll be discussing this for the MAGE-TAB 1.1 spec to
clarify it one way or another, possibly updating the parser before that.

	
	Thoughts and feedback are greatly appreciated.
	
	Jim
	-- 
	Jim McCusker
	Programmer Analyst
	Krauthammer Lab, Pathology Informatics
	Yale School of Medicine
	james.mccusker@yale.edu | (203) 785-6330
	http://krauthammerlab.med.yale.edu
	
	PhD Student
	Tetherless World Constellation
	Rensselaer Polytechnic Institute
	mccusj@cs.rpi.edu
	http://tw.rpi.edu
	
	

	
	
	

	 

	 

 




-- 
Jim
--
Jim McCusker
Programmer Analyst
Krauthammer Lab, Pathology Informatics
Yale School of Medicine
james.mccusker@yale.edu | (203) 785-6330
http://krauthammerlab.med.yale.edu

PhD Student
Tetherless World Constellation
Rensselaer Polytechnic Institute
mccusj@cs.rpi.edu
http://tw.rpi.edu
Attachments

image/png attachment: image003.png
Received on Wednesday, 6 January 2010 16:11:44 UTC