• Aucun résultat trouvé

From Datalog Reasoning to Modular Structure of an Ontology

N/A
N/A
Protected

Academic year: 2022

Partager "From Datalog Reasoning to Modular Structure of an Ontology"

Copied!
4
0
0

Texte intégral

(1)

From Datalog Reasoning to Modular Structure of an Ontology

Changlong Wang1,2, Zhiyong Feng1,Guozheng Rao1, Xin Wang1, Xiaowang Zhang1

1School of Computer Science and Technology, Tianjin University,Tianjin 300072, China,

2School of Computer Science and Engineer Technology, NWNU,Lanzhou 730070, China,

Abstract. We propose a novel approach to modularly structuring an ontology.

First, a ontology TBox is transformed into a datalog program. Second, a tree, which represents forward-chaining proof and contains information about mod- ules, is mapped to a digraph, which allows us to extract intended ontology module via breadth-first search. Finally, we develop an algorithm to compute the modu- lar structure of an ontology and take an example to illustrate how our strategy might work in practice.

Keywords: Ontology; Modular structure; Datalog; Digraph

1 Introduction

Ontologies, often treated as monolithic objects, suffer from an inherent lack of structure [1]. To address this problem, atomic decomposition(AD) [2] is proposed to represent the modular structure of an ontology. This approach exploits locality-based module [3]

to induce a modular structure and allows users to explore the entire ontology in a sensi- ble manner so that we can find appropriate module. Atomic decomposition, however, is intimately tied to locality-based modules and thus suffers from their limitations [1]: it is bound by axiom structure and the extracted modules are often big in size. The recently proposed module extraction technique [4], in which module extraction is reduced to reasoning problem in datalog, allows us to extract smaller modules. Furthermore, those trees used in representing forward-chaining proofs in datalog can bring some infor- mation about ontology structure. Those distinguishing features stimulate us to further observe modular structure of an ontology by applying datalog reasoning.

2 Preliminaries

Ontologies and Datalog Reasoning The ontology axioms are formalised as rules:

function-free sentences of the form∀x.[φ(x)→ ∃y.[∨i=n

i=1ψi(x, y)]], whereφ,ψare con- junctions of distinct atoms. A signatureΣis a set of predicates andS ig(F) denotes the signature of a formulaF. A TBoxT Bis a finite set of rules. A factγis a function-free ground atom. An ABoxABis a finite set of facts. An ABox with predicates from the signatureΣis denoted byΣ-ABox. Given a datalog programPand ABoxAB, the ma- terialisation ofP

ABcan be computed by forward-chaining. A proof ofγinPAB is pairρ=(T, λ) whereT is a tree,λis a mapping from nodes inT to facts, and from

(2)

2 C. Wang, Z. Feng, G. Rao, X. Wang, X. Zhang

edges inT to rules inP, such that for each nodevthe followings hold: (1)λ(v)=γif vis the root ofT; (2)λ(v) ∈ ABifvis a leaf; and (3) ifvhas childrenw1, ..., wnthen each edge fromvtowiis labelled by the same rulerandλ(v) is a consequence ofrand λ(w1), ..., λ(wn). In this paper, we callT aproof tree.

Module Extraction via Datalog ReasoningThe module extraction approach pro- posed in [4] consists of three steps: (1) Pick a substitutionθmapping all existentially quantified variables inT Bto constants and then transformT Binto a datalog program P. (2) Pick aΣ-ABoxAB0 and materialise P

AB0. (3) Pick a setABr of “relevant facts” in the materialisation and compute the supporting rules inPfor each such fact.

Consequently, the moduleMconsists of all rules inT Bthat yield some supporting rule inP and is fully determined by the substitutionθand the ABoxAB0 andABr. Intu- itively,AB0determines the module ”topic” andABrdetermines which rules should be contained inM. Formally, this approach is defined as follows:

Definition 1[4]. A module setting forT BandΣis a tupleχ=(θ,AB0,ABr) withθ a substitution from existentially quantified variables inT Bto constants,AB0aΣ-ABox, ABraS ig(T B)-ABox, and such that no constant inχoccurs inT B.

The program ofχ is the smallest programPχ containing, for eachr = φ(x) →

∃y.[∨i=n

i=1ψi(x, y)] in T B, the rule φ → ⊥ if n = 0 and all rules φ → γθ for each 1≤inand each atomγinψi. The support ofχis the set of rulesrPχthat support a fact fromABrinPχAB0. The moduleMχofχis the set of rules inT Bthat have a

corresponding datalog rule in the support ofχ. ♢

Definition 1 shows that there exists some relationship between rules inPχwhich make it feasible in characterizing the modular structure of an ontology by datalog rea- soning:

Proposition 1.Given two datalog rulesdr1anddr2in a proof treeT, if the the head ofdr2is the body ofdr1, then, for ever moduleMχ,dr2Mχimpliesdr1Mχ

3 From Proof Tree to Modular Structure of an Ontology

For the materialisation of AB0 andP, we need to construct proof trees for each fact entailed byAB0

P. Consequently, the module Mχforχ =(θ,AB0,ABr) has one or more corresponding proof trees.

Definition 2.Givenχ =(θ,AB0,ABr), a module Mχis called single tree module (STM) if it has one corresponding proof tree, A module is calledmultiple trees module (MTM) if it has more than one corresponding proof trees.

In this paper, we mainly discuss STM. We useTMto denote the proof treeT for a moduleM, and useMT to denote the moduleMgenerated from a proof treeT. Clearly, a moduleMχforχ=(θ,AB0,ABr) is STM, if there exists only one factγ∈ABrwhose support contains supporting rules of any other factβ∈ABr.

Definition 3[2]. A moduleMis calledfakeif there exist two incomparable (w.r.t.

set inclusion) modulesM1 ,M2, such thatM1

M2 =M; a module is calledgenuine if it is not fake.

Proposition 2.A STM is a genuine module.

Definition 4.LetT be a tree, a subtreeS T inT is called breadth-first search tree (BFST), ifS T is generated by breadth-first search starting from the root inT.

(3)

From Datalog Reasoning to Modular Structure of an Ontology 3

Theorem 1.LetTMχbe a proof tree for the genuine moduleMχandS Tbe a BFST inTMχ, the set of rules inT Bthat have a corresponding datalog rule inS Tis aχ−module forχ=(θ,AB0,ABr), whereAB0consists of the facts that have corresponding leaves in S T andABrconsists of the single fact that corresponds to the root ofS T.

Proof.From Definition 1 and the procedure of module extraction in [4], AB0 can be picked as the initial ABoxAB0, andABr as the relevant fact setABr, the only fact in ABr is the root of S T and its support contains supporting rules of other inner n- ode(corresponding to a fact inPχ

AB0) inS T. Hence, the set of rules inT Bthat have

a corresponding datalog rule inS Tis aχ−module.

Theorem 1 shows that a module can be extracted from a proof treeTby breadth-first search. In order to represent the modular structure of original ontology, we map each proof treeT to a digraphDGby: (1) If rulerinT Bhas a corresponding datalog rule drinT,DGcontains noder; (2) For two noder1andr2inDGand their corresponding datalog rulesdr1 anddr2inT, if the body ofdr1is the head ofdr2, there exist a edge from noder1tor2inDG.

The mapping from datalog proof tree to digraph has no impact on the dependent relationship between rules. In other words, we can obtain the same module from datalog proof tree and its corresponding digraph. Those obtained digraphs exactly capture the modular structure of an ontology.

4 Computing the Modular Structure of an Ontology

From previous discussion, it is feasible to design an algorithm to compute the mod- ular structure of an ontology TBoxT B. In the algorithm, we need construct some proof trees, such that each rule inT Bhave a corresponding datalog rule in those trees. Al- gorithm 1 outlines our approach to computing the modular structure of an ontology.

In Algorithm 1, we can pick a general substitutionθand transformT Binto a datalog program P, and pickS ig(T B)-ABox as the initial ABox. From Definition 2, Defini- tion 4, and Definition 8 in [4], the modular structure in this paper can be induced on Σ-implication inseparable,Σ-fact inseparable, andΣ-model inseparable module.

To conclude, we illustrate our approach by an ontology TBoxT Bcontaining 6 ax- ioms:α1 :A⊑ ∃R.B,α2:A⊑ ∃R.C,α3 :BCD,α4:D⊑ ∃S.E,α5:D⊑ ∀S.F, α6:∃S(E⊓F)G. Those axioms are formalised as rules as follows, where the three rules r1,r2, and r4 have two corresponding datalog rules, respectively. The mapping from proof trees to modular structure is shown in Figure 1.

Algorithm 1Computing modular structure of an ontology 1: Input: a TBoxT B

2: Output: the modular structure ofT B— a set of digraphs 3: mapT Bto a datalog programP

4: use aS ig(T B)-ABox as AB0 and build proof trees for each fact in the materialisation of AB0

P

5: map each proof tree to a digraphDG;

6: return the set ofDG

(4)

4 C. Wang, Z. Feng, G. Rao, X. Wang, X. Zhang

Fig. 1.(a) proof tree (b) modular structure

r1:A(x)→ ∃y1[R(x, y1)∧B(y1) ⇒ dr1 :A(x)R(x,f(x)), dr1′′:A(x)B(c) r2:A(x)→ ∃y2[R(x, y2)∧C(y2) ⇒ dr2 :A(x)R(x,f(x)), dr2′′:A(x)C(c) r3:B(x)C(x)D(x)

r4:D(x)→ ∃y3[S(x, y3)∧E(y3) ⇒ dr4:D(x)S(x,f(x)), dr4′′ :D(x)E(c) r5:D(x)S(x, y)→F(y)

r6:S(x, y)∧E(y)∧F(y)→G(x)

5 Conclusion and Future Work

We have proposed an approach to modular structure of ontology by mapping datalog proof tree to digraph. In this modular structure, ontology module can be computed by the standard algorithm of breadth-first search. Our current work is preliminary, we will implement the proposed approach and evaluate it on real ontologies in the future.

Acknowledgments. This work is supported by the NSFC (61100049, 61373165) and the 863 Program of China (2013AA013204)

References

1. Chiara Del Vescovo.: The Modular Structure of an Ontology: Atomic Decomposition and its applications. PhD thesis, The University of Manchester, 2013.

2. Chiara Del Vescovo, Bijan Parsia, Ulrike Sattler, Thomas Schneider: The modular structure of an ontology: Atomic decomposition. In IJCAI, 2232-2237, 2011.

3. Bernardo Cuenca Grau, Ian Horrocks, Yevgeny Kazakov, Ulrike Sattler: Modular reuse of ontologies: Theory and practice. J. of Artificial Intelligence Research. 31, 273-318, 2008.

4. Ana Armas Romero, Mark Kaminski,Bernardo Cuenca Grau, Ian Horrocks: Ontology Module Extraction via Datalog Reasoning. In AAAI, 1410-1416, 2015.

Références

Documents relatifs

Many of the classes in the chemical entity ontology are not disjoint, because chemical classification is compositional (Hastings et al., 2012).. Annotating those pairs of classes

We examine commonalities between ontology modules and soft- ware aspects and propose an approach to applying the latter to the problem of a priori construction of modular ontologies

Given an input ontology O, MORe identifies a part of the classification of O that can be computed using the EL reasoner and limits the use of the OWL reasoner to a fragment of O

While there are ex- tensive ontologies, such as gene ontology and sequence ontology that have been de- veloped to model genome information, none is aimed at capturing the 3D

To combine pattern-based modular ontology engineering with the graphical modelling paradigm, we have developed the Comprehensive Modular Ontology IDE (CoModIDE, pronounced

As such, this PhD project explores the development of a comprehensive, modular ontology integrated development environment (IDE) in order to address the needs for ontology

Armas Romero, A., Kaminski, M., Cuenca Grau, B., Horrocks, I.: Module extrac- tion in expressive ontology languages via datalog reasoning.. Cuenca Grau, B., Halaschek-Wiener,

The database schema of our Phenomis database considered for mapping contains plant phenotyping information and environmental information.. The ontologies considered for