W3C home > Mailing lists > Public > public-semweb-lifesci@w3.org > September 2011

RE: How much does data integration cost ?

From: Michael Miller <Michael.Miller@systemsbiology.org>
Date: Fri, 16 Sep 2011 08:17:14 -0700
Message-ID: <4d8380ace4d13d67938a1158765f0284@mail.gmail.com>
To: "Mork, Peter D.S." <pmork@mitre.org>, HCLS IG <public-semweb-lifesci@w3.org>
hi all,

peter, nice article, it matches well my experience.

one thing to note is that, in this context, that the mapping is
(hopefully) a one shot deal that then can be used into the future without
much change, e.g. the bio* efforts that map to sequence database records.
also if one has a standard target that everything is mapped to, this also
helps.  my experience was mapping third party gene expression experiments
(data and annotation) to MAGE-ML.  then there was a standard mapping that
didn't have to change from MAGE-ML to our Rosetta Resolver application
which provided the UI.

cheers,
michael

> -----Original Message-----
> From: public-semweb-lifesci-request@w3.org [mailto:public-semweb-
> lifesci-request@w3.org] On Behalf Of Mork, Peter D.S.
> Sent: Wednesday, September 14, 2011 9:29 AM
> To: HCLS IG
> Subject: RE: How much does data integration cost ?
>
> This article
> (http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.5.6098&rep=re
> p1&type=pdf) doesn't give absolute numbers, but it does describe what
> portions of a data integration task eat up the most time.
>
> Peter Mork
>
>
> -----Original Message-----
> From: public-semweb-lifesci-request@w3.org [mailto:public-semweb-
> lifesci-request@w3.org] On Behalf Of Andrea Splendiani
> Sent: Wednesday, September 14, 2011 12:25 PM
> To: HCLS IG
> Subject: How much does data integration cost ?
>
> Hi,
>
> I was wondering if anybody on this list has some figures on how much
> time/resources are spent in data integration, as a percentage of the
> overall
> 'task' performed.
> I often got the impression that 'data integration' is an obscure entity
> for
> many final users. For instance people concerned about getting results
> out of
> data usually only refer to the overall process as 'analysis', and often
> data
> integration is an ill defined entity shadowed by a better defined
> statistical analysis.
> I know this varies across organizations/tasks and that the distinction
> between 'data integration' and the rest is a bit fuzzy, however, in a
> first
> approximation, which is the size of the problem that the Semantic Web
> is
> trying to tackle ?
> Obviously, I would be interested in the Life Sciences and Health Care
> context.
>
> best,
> Andrea Splendiani
>
>
> Andrea Splendiani
> Senior Bioinformatics Scientist
> Centre for Mathematical and Computational Biology
> +44(0)1582 763133 ext 2004
> andrea.splendiani@bbsrc.ac.uk
>
>
>
>
>
Received on Friday, 16 September 2011 15:42:40 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:21:00 UTC