RE: Ontology editor + why RDF? from Miller, Michael D (Rosetta) on 2006-03-31 (public-semweb-lifesci@w3.org from March 2006)

From: Miller, Michael D (Rosetta) <Michael_Miller@Rosettabio.com>
Date: Fri, 31 Mar 2006 10:21:39 -0800
To: "Jim Hendler" <hendler@cs.umd.edu>, "Kashyap, Vipul" <VKASHYAP1@PARTNERS.ORG>
cc: public-semweb-lifesci@w3.org
Message-ID: <E1FPOFu-0007wE-DU@maggie.w3.org>
Hi Jim and Vipul,
 
"If all three of us published to the Web, and used common URIs (or a
third party expressed equivalences) then the system as a whole would
have the information..."
 
Just to echo what Jim is saying, the Life Science Identifier (LSID), a
type of URI, basically came to be based on the experience and need in
the MAGE specification for such a common identifier.  This has helped
tremendously already in linking information between gene expression
experiments, particularly in referencing the MGED Ontology and GO.
 
cheers,
Michael
 
Michael Miller 
Lead Software Developer 
Rosetta Biosoftware Business Unit 
www.rosettabio.com 

	-----Original Message-----
	From: public-semweb-lifesci-request@w3.org
[mailto:public-semweb-lifesci-request@w3.org] On Behalf Of Jim Hendler
	Sent: Friday, March 31, 2006 9:46 AM
	To: Kashyap, Vipul; Danny Ayers
	Cc: public-semweb-lifesci@w3.org
	Subject: RE: Ontology editor + why RDF?
	
	
	Vipul - not sure this is best thread for this whole discussion,
but here's a quick answer and if you want longer, I can point you to
various things starting from the Scientific American article [1] and
also an article on integrating applications on the Web that Tim, Eric
and I did, originally published in Japanese but available in English at
[2].  More detailed technical stuff is also available if you want, but
you should be familiar with that literature...
	 The point your missing, which I've been making for years, is
that on the Web "a little semantics goes a long way" -- here's a simple
example.  If you have a Database that says you live in Boston, I have
one that says I live near BWI airport, and Danny has one that has tables
of distances between airports and cities, but these all use their own
terminology and live in their own boxes, then none of the three of us
(nor any third party) would know how far apart you and I live.  If all
three of us published to the Web, and used common URIs (or a third party
expressed equivalences) then the system as a whole would have the
information - so there would be Semantics available on the Web that
would not be available before -- the "sameas" type information
(expressed through same URI names) is very powerful, even before you
start worrying about the next levels of semantics.
	 I'm not arguing that expressive semantics is bad, it is very
valuable in some applications, but the traditional AI community often
ignores the importance of breadth, despite Google rubbing our noses in
it every day.   Even more important, once the data is on the Web in RDF,
it can be INCREMENTALLY extended, by the original provider or by third
parties, in ways that do add the expressivity - something not doable
when the datasources are not Web accessible.
	 Here's another way to think about it - on the Web my documents
can point to your documents.  However, my databases (or their schemas)
cannot point at elements in your databases. my thesaurus cannot point to
words in your thesaurus, etc.   The Web showed us that the network
effect is unbelievably powerful, and we need to be able to use that
power for data, terminologies, ontologies and the rest. 
	 -Jim H.
	p.s. You might also want to check out my article "Knowledge is
power: the view from the Semantic Web" which appeared in AI Magazine in
the January 06 issue.  It's not available on line free to non-AAAI
members, but I can send you a preprint version if you'd like - it's
aimed at explaining the value of the linking stuff to the applied AI
audience.

	[1]
http://www.sciam.com/article.cfm?articleID=00048144-10D2-1C70-84A9809EC5
88EF21
	[2] http://www.w3.org/2002/07/swint


	At 8:08 -0500 3/31/06, Kashyap, Vipul wrote:
	>> I saw a quote not long ago, not sure of the source (recognise
this
	>> Jim?), approximately: "what's new about the Semantic Web
isn't the
	>> semantics but the web".
	>
	>[VK] This is a great quote and expresses clearly that the value
proposition
	>     in representing and linking vocabularies using URIs stems
from the
	>     Web more than "semantics"
	>
	>> I take VK's point that this in itself isn't going to convince
many IT
	>> folks. I think the big persuader there is data integration,
even on a
	>> sub-enterprise kind of scale.
	>
	>[VK] Agreed, one of the clearer value propositions is data
integration.
	>
	>> Being able to use ontologies to infer new information is a
massive
	>> plus (I imagine especially in the lifesciences). Bigger still
are the
	>> (anticipated) benefits of the Semantic Web when the network
effect
	>> kicks in. But the ability to use RDF to simply merge data
from
	>> multiple sources consistently (and query across it), without
needing
	>> complete up-front schema design is a very immediate, tangible
gain.
	>>
	>> The work done around SKOS (and specific tasks like expressing
WordNet
	>> in RDF) does suggest RDF/OWL is a particularly good
technology choice
	>> for thesauri.
	>
	>[VK] Danny, has articulated some potential benefits:
	>     - Network effects
	>     - Schema-less linking based data integration
	>
	>I would argue that both these benefits stem from the web
infrastructure and have
	>nothing to do with the "semantics" of anything per-se.
	>
	>Also, one could argue that having a standardized markup
language whether it
	>be even HTML or XML enables the above to a significant extent.
	>
	>So the value proposition question could be:
	>
	>What is it about RDF that enables network effects and schema
less data linking
	>better than HTML, relational tables or XML in a more
significant manner?
	>
	>Is the improvement enabled v/s the cost required to achieve it
an attractive
	>trade off?
	>
	>Look forward to yours and the groups responses to these
questions.
	>
	>Cheers,
	>
	>---Vipul

	-- 
	Professor James Hendler                   Director
	Joint Institute for Knowledge Discovery
301-405-2696
	UMIACS, Univ of Maryland                    301-314-9734 (Fax)
	College Park, MD 20742
http://www.cs.umd.edu/~hendler 
	Web Log: http://www.mindswap.org/blog/author/hendler
Received on Friday, 31 March 2006 18:22:00 UTC