- From: Ted Thibodeau Jr <tthibodeau@openlinksw.com>
- Date: Tue, 27 Apr 2010 10:31:31 -0400
- To: Juan Sequeda <juanfederico@gmail.com>
- Cc: "Eric Prud'hommeaux" <eric@w3.org>, public-rdb2rdf-wg@w3.org
On Apr 27, 2010, at 09:42 AM, Juan Sequeda wrote: > On Tue, Apr 27, 2010 at 7:36 AM, Eric Prud'hommeaux <eric@w3.org> wrote: > * Juan Sequeda <juanfederico@gmail.com> [2010-04-26 20:27-0400] > > This week I commented on the role of the ontology [1] where we should > > consider that we are mapping to the following: > > > > - existing domain ontology (FOAF, SIOC, etc) > > - a putative ontology (automatically generating the ontology from the > > schema/DDL) > > - a federated ontology: combining different ontologies > > > > > > Eric initially manifested the need to talk about the expressivity [2]. If I > > understand correctly the issue of the expressivity is being presented as > > "DIRECT - Recapulating Relational Structure" [3] and TRANSFORM - > > Non-isomorphic transformation [4]. > > > > I'm honestly a bit confused with both of these requirements, but this is my > > take on them. Eric, please correct me if I'm wrong: > > > > DIRECT: What I understand is that you could essentially automatically > > generate RDF triples (a graph) from the relational data. The top image shows > > a graph of relational data and the bottom image shows how the previous graph > > can be exposed as RDF (a graph) and it is equivalent (isomorphic) to the > > relational data graph. > > > > TRANSFORM: In this section you have taken the previous relational data graph > > and put it into the graph structure of two different ontologies. You show > ^^^^^^^^^^ > > that if you compare the relational data graph with the two graph structures > > of the two different ontologies, then the graphs are different > ^^^^^^^^^^ > > (non-isomorphic). > > I believe that you understand the expressivity points DIRECT and > TRANSFORM, but I don't believe we can use the word "ontology" to > describe graph shape. For example, an attribute diabetes_history > could be represented in the SNOMED ontology as > # likely isomorphic to relational graph. > _:p sn:160303001 true . # family history of diabetes > or as > # likely non-isomorphic, but better model. > _:p sn:57177007 _:h . # family history > _:h sn:24609004 sn:73211009 . # with finding of diabetes. > As a sense of scale, the most popular medical ontology, SNOMED, > has around 370k analogous representation choices (pre-coordinated > terms). > > Pre-existing ontologies also demand term synthesis. A ball bearing > database with hardening treatment T5 would likely be expressed in a > pre-existing RDF graph as a URI in an ontology of eutectic hardening. > > In this, pre-existing ontologies are a motivation for a set of > expressivities. I think the expressivity you're seeking is tied to *reasoning*, not tied to exposing RDB data as RDF data. Bringing RDB into RDF requires only that the schema of that RDB be mapped to a "direct" or "putative" ontology -- which *is* the correct term. This ontology serves only to unambiguously identify a single cell (table, column, row) within that schema. I suggest that then mapping that RDF into a "domain" ontology (e.g., SNOMED) is a separate concern -- which may be addressed in a couple of ways -- 1. replication with transformation 2. mapping ontologies The first means that you decide *once* how SNOMED corresponds to a given RDB schema -- and if that correspondence changes, you have to somehow discard all the triples that resulted from the original conversion and then re-convert the RDB data. The second means that you decide how you think SNOMED corresponds to your putative ontology, and create a "mapping" ontology -- which does little more than declare broaderClass, narrowerClass, equivalentClass, sameAs, and such. If you realize later that one of your mappings is wrong, you change this ontology -- everything else remains as it is. Note that #2 does not mandate either forward- or backward-chaining. You *can* work from #2 and replicate & transform, if you find that works better for your deployment scenario. You *can* use reasoning engines to work entirely dynamically, if that works better for you. Note that #1 *does* mandate forward-chaining. You *cannot* use a reasoning engine to revise the putative-to-domain mapping once replication & transformation has been done. For this simple reason, I strongly advise that we *not* combine putative-to-domain ontology mapping into the rdb2rdf scenario -- because it makes a decision which we haven't been chartered for, and which I believe we are perilously close to deciding in the worst possible way. Be seeing you, Ted -- A: Yes. http://www.guckes.net/faq/attribution.html | Q: Are you sure? | | A: Because it reverses the logical flow of conversation. | | | Q: Why is top posting frowned upon? Ted Thibodeau, Jr. // voice +1-781-273-0900 x32 Evangelism & Support // mailto:tthibodeau@openlinksw.com // http://twitter.com/TallTed OpenLink Software, Inc. // http://www.openlinksw.com/ 10 Burlington Mall Road, Suite 265, Burlington MA 01803 http://www.openlinksw.com/weblogs/uda/ OpenLink Blogs http://www.openlinksw.com/weblogs/virtuoso/ http://www.openlinksw.com/blog/~kidehen/ Universal Data Access and Virtual Database Technology Providers
Received on Tuesday, 27 April 2010 14:41:59 UTC