- From: Achille Fokoue <achille@us.ibm.com>
- Date: Wed, 20 Feb 2008 12:52:49 -0500
- To: Alan Ruttenberg <alanruttenberg@gmail.com>
- Cc: "Web Ontology Language ((OWL)) Working Group WG" <public-owl-wg@w3.org>, public-owl-wg-request@w3.org
- Message-ID: <OF8A909D25.D5959144-ON852573F5.0062248F-852573F5.00623812@us.ibm.com>
Hi, SHER [1] has indeed focused on Alan?s point 3) ?scalability to large numbers of instances?. However, our approach has been to attempt scalability over ?practical? large knowledge bases without restricting their expressivity. It has successfully been applied to various real world problems (e.g. Automated Clinical trials matching using ontologies [2], Scalable Cleanup of Information Extraction Data Using Ontologies [3], and a work-in-progress application of semantic search against MEDLINE biomedical literature databases). Although our initial focus was not to restrict the expressivity, we have come to appreciate the additional improvement in performance and scalability obtained by adding some restrictions. Fragments were one of the most important reasons for IBM?s involvement in OWL 1.1 WG. We are interested in fragments that will provide a clear and simple guidance to users (non-dl experts) as to how to control in a standard way the computational complexity of reasoning on their knowledge bases. From our perspective, it will, on the one hand, reduce resistance to the adoption of the technology as our customers could start with annotating their data with concepts and relationships defined, for example, in a tractable subset without impact on performance. On the other hand, since the data is already annotated with semantic information ?alas inexpressive one-, more expressive semantic views could be defined without changing the data. So far, I am satisfied by the efforts towards simplicity in the WG - in particular, the use of rule-style semantics. We also see OWL 1.1 as an opportunity to bridge the gap between industry and academia. In particular, we consider as an important goal the standardization of a tractable subset of OWL that would work well with traditional DBMS. Something in the flavor of DL-Lite family could fit that requirement. We like the rule-style semantics of OWL Prime, but we are not convinced yet that its set of constructs will enable it to scale over large instances. In a nutshell, 1) we are not proposing a new fragment, 2) we would likely support an EL like fragment in REC, and 3) we agree with Alan on the need for a fragment gears towards scalability to large numbers of instances (DL-Lite could be a good starting point, but we might need some additions to it: for example transitivity, even though queries will no longer be transformed into SQL queries). Best regards, Achille. [1] http://domino.research.ibm.com/comm/research_projects.nsf/pages/iaa.index.html [2] http://www.springerlink.com/content/k582jur3k03l5m03/ [3] http://www.springerlink.com/content/q7500h6gv65078k7/ Alan Ruttenberg <alanruttenberg@gmail.com> Sent by: public-owl-wg-request@w3.org 02/19/2008 10:27 AM To "Web Ontology Language ((OWL)) Working Group WG" <public-owl-wg@w3.org> cc Subject Fragments discussion, continued It seems (I hope) we are getting closer to some sort of mutual understanding of our needs for fragments. I'd like to sketch out the shape of what I'd be happy to see as an outcome of tomorrow's meeting. I think we've heard convincing arguments that having a large number of fragments in Rec track, absent strong justification for each, isn't desirable. On the other hand, I'd like to put forward that there are compelling reasons to acknowledge OWL Lite, in the manner I've proposed, and to put 3 fragments on rec track, and proceed in subsequent meetings to nail down details of specification and documentation. The fragments: 1) OWL Prime (details of exactly what is in or out of OWL Prime remain to be worked out). Justified by specific industry interest from Oracle and HP, and to address the constituency that wishes to have a workable and more easily understandable rule-based OWL. 2) EL++. Justified by existing academic and commercial implementations, useful computational properties (polytime) and demonstrated use for working with important ontologies for biomedicine, a field which has been at the leading edge of Semantic Web adoption. 3) A fragment characterized by scalability to large numbers of instances (not necessarily scalable tbox) , but with strong guarantees with respect to completeness and consistency detection. This is probably DL-Lite, but I want to leave the door open to input from IBM, who's SHER implementation might also fit the bill. We haven't discussed this fragment much, so I'll give my view of why it is justified. Such a fragment fills a hole that neither of two other fragments fill, as It is likely that OWL Prime will not allow existentials in a rule head (following pD*), and EL++ is not as scalable. In addition DL-Lite is implementable in relational databases with queries translatable to SQL. I have heard of two academic implementations ("the italians" & Jeff Pan) and a commercial implementation - Clark and Parsia's, and the nature of the fragment is such that it would be easily adoptable by relational database providers. Finally, it is my judgement, as a user, that strong guarantees of the ability to detect inconsistency and give complete answers bring high value to science applications. To recap the OWL Lite proposal, the suggestion is to keep OWL Lite, unchanged, to support existing users, verify that it behaves as before using OWL 1.1 DL semantics, and write a Note to explain its status. Remaining fragments in the current Fragments document, would be described in one or more WG Notes. My hope would be to not necessarily work out all the details tomorrow, but instead to come to agreement on this number of fragments (3) and their character, so that subsequent meetings can be focused on implementation rather than policy. Regards, Alan
Received on Wednesday, 20 February 2008 17:53:09 UTC