• Aucun résultat trouvé

Understanding Inexpressibility of Model-Based ABox Evolution in DL-Lite

N/A
N/A
Protected

Academic year: 2022

Partager "Understanding Inexpressibility of Model-Based ABox Evolution in DL-Lite"

Copied!
13
0
0

Texte intégral

(1)

of Model-Based ABox Evolution in DL-Lite

Evgeny Kharlamov, and Dmitriy Zheleznyakov KRDB Research Centre, Free University of Bozen-Bolzano, Italy

last_name@inf.unibz.it

Abstract. Evolution of Knowledge Bases (KBs) expressed in Description Logics (DLs) proved its importance. Recent studies of evolution in DLs mostly focussed on model-based approaches. They showed that evolution of KBs in tractable DLs, such asDL-Lite, suffers from inexpressibility, i.e., the result of evolution cannot be captured inDL-Lite. What is missing in these studies isunderstanding: in which DL-Litefragments evolution can be captured, what causes the inexpressibility, which logics is sufficient to express evolution, and whether one can approximate it inDL-Lite. This paper provides some understanding of these issues for both update and revision. We found whatDL-Liteformulas make evolution inexpressible and how to capture evolution in their absence. We introduce the notion of prototypes that gives an understanding of how to capture evolution for a richDL-Litefragment inFO[2]. Decidability ofFO[2]gives possibility for approximations.

1 Introduction

Description Logics (DLs) provide excellent mechanisms for representing structured knowledge by means of Knowledge Bases (KBs)Kthat are composed of two compo- nents: TBox (describes intensional or general knowledge about an application domain) and ABox (describes facts about individual objects). DLs constitute the foundations for various dialects of OWL, the Semantic Web ontology language.

Traditionally DLs have been used for modelingstatic and structural aspects of application domains [1]. Recently, however, the scope of KBs has broadened, and they are now used also for providing support in the maintenance andevolutionphase of information systems. This makes it necessary to studyevolution of Knowledge Bases[2], where the goal is to incorporate new knowledgeN into an existing KBKso as to take into account changes that occur in the underlying application domain. In general,N is represented by a set of formulas denoting those properties that should be true after K has evolved, and the result of evolution, denotedK N, is also intended to be a set of formulas. In the case whereN interacts withKin an undesirable way, e.g., by causing the KB or relevant parts of it to become unsatisfiable,N cannot simply be added to the KB. Instead, suitable changes need to be made inKso as to avoid this undesirable interaction. Different choices for changes are possible, corresponding to different approaches to semantics for KB evolution [3,4,5].

An important group of approaches to evolution semantics, that we focus in this paper, is calledmodel-based(MBAs). Under MBAs the result of evolutionK N is aset of

(2)

modelsofN that are minimally distanciated from models ofK. Depending on what the distance between models is and how to measure it, eight different MBAs were introduced (see Section 3 for details). SinceK N is a set of models, whileKandN are logical theories, it is desirable to representK N as a logical theory using the same language as forKandN. Thus, looking for representations ofK Nis the main challenge in studies of evolution under MBAs. WhenKandN are propositional theories, representingK N is well understood [5], while it becomes dramatically more complicated as soon asK andN are first-order, e.g., DL KBs [6].

Model based evolution of KBs whereK andN are written in a language of the DL-Litefamily [7] has been recently extensively studied [6,8,9]. The focus onDL-Liteis not surprising sinceDL-Liteis the basis of OWL 2 QL, a tractable OWL 2 profile. It has been shown that for every of the eight MBAs one can findDL-LiteKandN such that K N cannot be expressed inDL-Lite[10,11], i.e., DL-Liteisnot closedunder MBA evolution. This phenomenon was also noted in [6,10] for some of the eight semantics.

What is missing in all these studies of evolution forDL-Liteisunderstandingof:

(1) What fragments ofDL-Liteare closed under model-based evolutions? WhatDL-Lite formulas are responsible for inexpressibility of model-based evolutions?

(2) What are sufficient extensions ofDL-Liteto capture model-based evolutions of DL-LiteKBs? How to capture the evolutions ofDL-LiteKBs in these extensions?

(3) ForDL-LiteKandN, is it possible and how to do “good” approximations ofK N? In this paper we study the problems(1)-(3)for so-calledABox evolution, i.e.,N is a new ABox and the TBox ofKshould remain the same after the evolution. ABox evolution is important for areas, e.g., bioinformatics, where the structural knowledge TBox is well crafted and stable, while ABox facts about individuals may get changed.

These ABox changes should be reflected in KBs in a way that the TBox is not affected.

Our study covers both the case of ABox updates and ABox revision [4].

In Sections 2 and 3 we defineDL-LiteRand ABox evolution. In Section 4 we study relationships between different MBAs. In Section 5 we introduceDL-Lite+R, a restriction on DL-LiteR where disjointness of concepts with role projections is forbidden. We show thatDL-Lite+R is closed under most of MBA evolutions and provide tractable algorithms computingK N. In Section 6 we study two important MBAs where distance between models is based on atoms and prove thatDL-Lite+Ris a sufficient borderline of expressibility for these semantics: the class of KBs that are inDL-LiteRbut not in DL-Lite+Ris not closed under these semantics. We also introduce and studyDL-LiteRI that restrictsDL-LiteR, extendsDL-Lite+R and not closed under MBAs. In order to capture evolution ofDL-LiteRI KBs inFO, we introduce prototypes, based on which we show that the evolution can be expressed inFO[2](restriction of first-order logics that only makes use of two variables). Finally, we argue that decidability ofFO[2]gives possibilities for approximations ofK N. Proofs can be found in [12].

2 DL-Lite

R

We introduce some basic notions of DLs, (see [13] for more details). We consider a logic DL-LiteRofDL-Litefamily of DLs [7,14].DL-LiteRhas the following constructs for (complex)conceptsandroles:(i)B::=A| ∃R,(ii)C::=B| ¬B,(iii)R::=P |P,

(3)

where A andP stand for anatomic concept and role, respectively, which are just names. Aknowledge base(KB)K = (T,A)is compound of two sets ofassertions:

TBoxT, andABoxA.DL-LiteRTBox assertions areconcept inclusion assertionsof the formB vCandrole inclusion assertionsR1 vR2, while ABox assertions are membership assertionsof the formA(a),¬A(a), andR(a, b). Theactive domainofK, denotedadom(K), is the set of all constants occurring inK. TheDL-Litefamily has nice computational properties, for example, KB satisfiability has polynomial-time complexity in the size of the TBox and logarithmic-space in the size of the ABox [15,16].

The semantics of DL-Lite KBs is standard and based on first order interpretations, all over the same countable domain∆. AninterpretationIis a function·Ithat assigns to each C a subsetCI of∆, and toR a binary relationRI over∆ in such a way that(¬B)I = ∆\BI,(∃R)I = {a | ∃a0.(a, a0)∈ RI}, and(P)I ={(a2, a1) | (a1, a2)∈PI}. We assume that∆contains the constants and thatcI=c, i.e., we adopt standard names. Alternatively, we view an interpretation as a set of atoms and say that A(a)∈ Iiffa∈AIandP(a, b)∈ Iiff(a, b)∈PI. An interpretationI is amodel of a membership assertionA(a)(resp.,¬A(a)) ifa∈AI(resp.,a /∈AI), ofP(a, b)if (a, b)∈PI, and of an inclusion assertionD1vD2ifD1I⊆D2I.

As usual, we useI |=Fto denote thatIis a model of an assertionF, andI |=K to denote thatI |=Ffor each assertionFinK. We useMod(K)to denote the set of all models ofK. A KB issatisfiableif it has at least one model. We use entailment on KBs K |=K0in the standard sense. An ABoxA T-entailsan ABoxA0, denotedA |=T A0, ifT ∪ A |= A0, andAisT-equivalenttoA0, denotedA ≡T A0 ifA |=T A0 and A0|=T A.

The deductiveclosure of a TBoxT (of an ABoxA), denotedcl(T)(resp.,clT(A)), is the set of all TBox (resp., positive ABox) assertionsF such that T |= F (resp., A |=T F). Clearly inDL-LiteR cl(T)(andclT(A)) is computable in time quadratic in the number of assertions ofT, i.e.,|T |, (resp.,|T ∪ A|). In our work we assume that all TBoxes and ABoxes are closed, while results are extendable to arbitrarily KBs.

For satisfiable KBsK = (T,A)afull closure ofA, denotedfclT(A), is the set of all membership assertionsf(both positive and negative) overadom(K)such thatA |=T f.

Ahomomorphismhfrom a model I to a modelJ is a mapping from∆ to∆ satisfying:(i)h(a) =afor every constanta;(ii)ifα∈AI(resp.,(α, β)∈PI), then h(α)∈AJ (resp.,(h(α), h(β))∈PJ) for everyA(resp.,P). We writeI ,→ J if there is a homomorphism fromI toJ. A canonical modelIofK, denoted asIKcanor just IcanwhenKis clear from the context, is a model ofKwhich can be homomorphically embedded in every model ofK[7].

3 Evolution of Knowledge Bases

This section is based on [10]. LetK= (T,A)be aDL-LiteRKB andN a “new” ABox.

We study how to incorporateN’s assertions intoK, that is, howKevolvesunderN [2].

More practically, we studyevolution operatorsthat takeKandN as input and return, possibly inpolynomial time, aDL-LiteRK0= (T,A0)(with the same TBox asK) that captures the evolution, and which we callthe (ABox) evolution ofKunderN. Based on the evolution principles of [10], we requireKandK0to be satisfiable and coherent.

(4)

Mod(K) :I0 I1

J0 J1J2 J3

Mod(T ∪ N) : L ⊆ #

s a global:G

local: set: number:

atoms:

symbols:

Approach

Type of distance Distance is

built upon

Fig. 1. Left: measuring distances between models and finding local minimums.

Right: three-dimensional space of approaches to model-based evolution semantics.

Model-Based Semantics of Evolution. In model-based approaches (MBAs), the result of evolution of a KBKwrt new knowledgeN is a setK N of models. The idea of MBAs is to choose asK Nsome models of(T,N)depending on their distance toK’s models. Katsuno and Mendelzon [4] considered two ways, so calledlocalandglobal, of choosing these models ofN, where the first choice corresponds toknowledge update and the second one toknowledge revision.

The idea of the local approaches is to consider all modelsIofKand for eachIto take those modelsJ of(T,N)that are minimally distant fromI. Formally,

K N = [

I∈Mod(K)

I N, whereI N = arg min

J ∈Mod(T ∪N)

dist(I,J).

wheredist(·,·)is a function whose range is a partially ordered domain andarg minstands for theargument of the minimum,that is, in our case, the set of modelsJ for which the value ofdist(I,J)reaches its minimum value, givenI. The distance functiondistvaries from approach to approach and commonly takes as values either numbers or subsets of some fixed set. To get a better intuition of the local semantics, consider Figure 1, left, where we present two modelI0andI1of a KBKand four modelsJ0, . . . ,J3of(T,N).

We represent the distance between a model ofKand a model ofT ∪ Nby the length of a line connecting them. Solid lines correspond to minimal distances, dashed lines to distances that are not minimal. In this figure{J0}= arg minJ ∈{J0,...,J3}dist(I0,J) and{J2,J3}= arg minJ ∈{J0,...,J3}dist(I1,J).

In the global approach one choses models ofN that are minimally distant fromK: K N = arg min

J ∈Mod(N)

dist(Mod(K),J), (1)

wheredist(Mod(K),J) = minI∈Mod(K)dist(I,J). Consider again Figure 1, left, and assume that the distance betweenI0 andJ0 is the global minimum, hence,{J0} = arg minJ ∈{J0,...,J3}dist({I0,I1},J).

Measuring Distance Between Interpretations. The classical MBAs were developed for propositional theories [5], where interpretations were sets of propositional atoms, two distance functions were introduced, respectively based on symmetric difference “ ” and on the cardinality of symmetric difference:

dist(I,J) =I J and dist#(I,J) =|I J |, (2)

(5)

whereI J = (I \ J)∪(J \ I). Distances underdistare sets and are compared by set inclusion, that is,dist(I1,J1)≤dist(I2,J2)iffdist(I1,J1)⊆dist(I2,J2).

Finite distances underdist#are natural numbers and are compared in the standard way.

One can extend these distances to DL interpretations in two different ways. One way is to consider interpretationsI,J as sets ofatoms. ThenI J is again a set of atoms and we can define distances as in Equation (2). We denote these distances asdista(I,J) anddista#(I,J). Another way is to define distances at the level of the concept and role symbolsin the signatureΣunderlying the interpretations:

dists(I,J) ={S ∈Σ|SI6=SJ}, and dists#(I,J) =|{S∈Σ|SI 6=SJ}|. Summing up across the different possibilities, we have three dimensions, which give eight semantics of evolution according to MBAs by choosing: (1) thelocalor theglobal approach, (2)atomsorsymbolsfor defining distances, and (3)set inclusionorcardinality to compare symmetric differences. In Figure 1, right, we depict these three dimensions.

We denote each of these eight possibilities by a combination of three symbols, indicating the choice in each dimension, e.g.La#denotes the local semantics where the distances are expressed in terms of cardinality of sets of atoms.

Closure Under Evolution. LetDbe a DL andM one of the eight MBAs introduced above. We sayDisclosed under evolution wrtM (or evolution wrtM isexpressible inD) if for any KBsKandN written inD, there is a KBK0written inDsuch that Mod(K0) =K N, whereK N is the evolution result under semanticsM.

We showed in [10,11] thatDL-Lite is not closed under any of the eight model based semantics. The observation underlying these results is that on the one hand, the minimality of change principle intrinsically introduces implicit disjunction in the evolved KB. On the other hand, sinceDL-Liteis a slight extension of Horn logic [17], it does not allow one to express genuine disjunction (see Lemma 1 in [10] for details).

LetM be a set of models that resulted from the evolution of(T,A)withN. A KB (T,A0)is asound approximationofM ifM ⊆Mod(T,A0). A sound approximation (T,A0)isminimalif for every sound approximation(T,A00)inequivalent to(T,A0), it

holdsMod(T,A00)6⊂Mod(T,A0), i.e.(T,A0)is minimal wrt “⊆”.

4 Relationships Between Model-Based Semantics

LetS1andS2be two evolution semantics andDa logic language. ThenS1is subsumed byS2wrtD, denoted(S1 4sem S2)(D), or justS1 4sem S2whenDis clear from the context, ifK S1N ⊆ K S2N for all satisfiable KBsKandN written inD, where K SiN denotes evolution underSi. Two semanticsS1andS2areequivalent(wrtD), denoted(S1sem S2)(D), if(S1 4sem S2)(D)and(S2 4sem S1)(D). Further in this section we will considerKandN written inDL-LiteR. The following theorem shows the subsumption relation between different semantics, which we also depict with solid arrows in Figure 2. The figure is complete in the following sense: there is a solid path between any two semanticsS1andS2iff there is a subsumptionS14semS2.

(6)

Mod(K) :

Mod(K � N) :

Imin I1

Jmin J1

G#s Ls#

Ls Gs

G#a

Ga La

La#

� →

� →

Fig. 2. Left: Subsumptions for evolution semantics.

“ ”: inDL-LiteR(Theorem 1). “ ”: inDL-Lite+R(Theorems 6, 7).

Dashed frame surrounds semantics under whichDL-Lite+Ris closed.

Right: commutation diagram forLasemantics, “I ⇒ J” denotes thatJ ∈ I N. Theorem 1. Letβ ∈ {a, s}andα∈ {⊆,#}. Then

Gαβ4sem Lβα, G#β 4semGβ, and Ls#4sem Ls.

Proof. Letdist be one of the four considered distances, EG = K N wrtGαβ and EL =K N wrtLβαbe corresponding global and local semantics based ondist. For given Kand N, letJ0 ∈ EG. Then, there isI0 |= K such that for everyI00 |= K andJ00 |=T ∪ N itdoes notholddist(I00,J00) dist(I0,J0).In particular, when I00 = I0, there is no J00 |= T ∪ N such that dist(I0,J00) dist(I0,J0), which yields thatJ0 ∈ arg minJ ∈Mod(T ∪N)dist(I0,J), and J0 ∈ EL. We conclude that:

G#a 4sem La#, Ga 4semLa, G#s 4semLs#, Gs 4sem Ls.

Now considerE#=KNwrtLβ#, which is based ondist#, andE=KNwrtLβ, which is based ondist. AssumeJ0 ∈ E#andJ0 6∈ E. Then, from the former assump- tion we conclude existence ofI0 |=Ksuch thatJ0 ∈arg minJ ∈Mod(T ∪N)dist#(I0,J).

From the latter assumption, J0 ∈ E/ , we conclude existence of a modelJ00 such thatdist(I0,J00) ( dist(I0,J0). This yields thatdist#(I0,J00) dist#(I0,J0), which contradicts the fact thatJ0 ∈ E#, assuming thatdist(I0,J0)is finite. Thus, E#4sem Eas soon asdist(I,J)is finite. This finiteness condition always holds for whenβ =ssince the signature ofK ∪ Nis finite. It is easy to check thatdist(I,J) may not be finite whenβ=a, hence,La#64sem La.

Similarly one can show thatG#s 4semGs. It also holds thatG#a 4semGa due to the

finite model property inDL-LiteR. ut

5 Evolution in DL-Lite

+R

Here we consider a restriction ofDL-LiteR, which we callDL-Lite+R. A KBK= (T,A) is in DL-Lite+R if it is in DL-LiteR and T 6|= ∃R v ¬B for any role R and any conceptB. Intuitively, “+” emphasizes absence of disjointness for roles (only positive inclusions involving roles are permitted).DL-Lite+Ris defined semantically, while one can syntactically check (in quadratic time), given aDL-LiteRKBK, whetherKis in DL-Lite+R: one computes a closure ofK, checks that no assertions of the form∃Rv ¬B are in the closure and if it is the case, thenKis inDL-Lite+R.DL-Lite+Ris an extension of RDFS ontology language (of itsFOfragment).DL-Lite+Radds to RDFS the ability of expressing disjointness of concepts (A1v ¬A2) and mandatory participation (Av ∃R).

SinceDL-Lite+RrestrictsDL-LiteR, the semantics relations from Theorem 1) are also correct forDL-Lite+R. We now study what further4sem-relations hold inDL-Lite+R.

(7)

INPUT : consistent KBs(T,A)andN

OUTPUT: a setA0⊆fclT(A)of ABox assertions A0:=∅;S:=fclT(A);

1

repeat

2

choose someφ∈ S; S:=S \ {φ};

3

if{φ} ∪fclT(N)is consistentthenA0:=A0∪ {φ}

4

untilS=∅;

5

Algorithm 1: AlgorithmAlignAlg((T,A),N)forA0deterministic computation

5.1 Capturing Atom-Based Evolution

We first study evolution under La. LetI be an interpretation and N a knowledge base. AnalignmentofIwithN, denotedAlign(I,N), is the interpretationJ ={f | f ∈ Iandf is satisfiable withN }. There is a tight connection (Lemma 2) between an alignment of a model and its evolution underLasemantics. Moreover, alignment preserves homomorphic relationship on interpretations (Lemma 3). LetNbe a set of membership assertions andIan interpretation. A unionI ∪Ndenotes the modelI ∪JN, whereJN ={f |f ∈ N andfis positive}.

Lemma 2. LetK = (T,A)and(T,N)be satisfiable DL-Lite+R KBs, andI |= K. ThenI N ={Align(I,N)∪ N }, underLasemantics.

Lemma 3. LetKandN be satisfiable DL-Lite+R KBs andI be a model ofK. Then Align(IKcan,N),→Align(I,N).

Lemmas 3 and 2 imply that(Ican N),→(I N)for everyI ∈Mod(K)under La, where we apply,→to the sets(Ican N)and(I N), instead of their single models. We depict this phenomenon in Figure 2, right: evolution preserves homomorphic embeddability ofIcanintoI. This fundamental property implies thatIcan N consists of the universal model ofK NwrtLa.

Consider an algorithmAlignAlg(see Algorithm 1) that inputsK,N, and returns the alignmentAlign(Ican,N): it drops all the assertions offclT(A)contradictingN and keeps the rest. UsingAlignAlgwe can computeK N:

Theorem 4. LetK= (T,A)and(T,N)be satisfiable DL-Lite+RKBs. Then:

K N =Mod(T,AlignAlg(K,N)∪ N)(wrtLa).

Note that since computation ofAlignAlg(T,A,N)is polynomial in A andN, computation ofK N is also polynomial.

Example 5. ConsiderT = {B0 v B, B v ¬C},A = {C(a)}, and N = B(a).

Then,fclT(A) ={C(a),¬B0(a),¬B(a)}. The alignment of(T,A)withN is the set AlignAlg((T,A),N) ={B(a),¬B(a)}. Hence, the result of evolution under every the model-based semantics with atom-based distance is(T,{B(a),¬B0(a)}).

(8)

Relationships Between Atom-Based Semantics. Next theorem shows that evolutions wrt all four model-based semantics on atoms coincide wrtDL-Lite+R. We depict these relations between semantics in Figure 2 using a dashed arrow, e.g., betweenLa 4semG#a. Note that there is a path with solid or dashed lines between any two semantics if and only if there is a subsumption wrtDL-Lite+R.

Theorem 6. La#semLasem G#asemGa.

Theorems 6 and 4 imply that inDL-Lite+Rone can useAlignAlgto compute evolution under all MBAs on atoms.

5.2 Capturing Symbol-Based Evolution

Observe that symbol-based semantics behave differently from atom-based ones: two local semantics (on set inclusion and cardinality) coincide, as well as two global ones, while there is no subsumption between local and global ones, as depicted in Figure 2.

Theorem 7. The following relations hold:

(i) La4semG#s, whileG#s 64semLa;

(ii) LssemLs#, andGssemG#s, whileLs64sem Gs#.

As a corollary of Theorem 7, the algorithm of computingK N of Theorem 4 in general does not work for computing evolution under any of the symbol-based semantics.

At the same time what it outputs is a complete approximation of all symbol-based semantics, while it approximates global semantics better than the local ones.

Consider the algorithmSymAlgin Figure 2 that will be used for evolutions on symbols. It works as follows: for every atomφinN it checks whetherφsatisfies a conditionΠ (Line 4). If it does, the algorithm deletes all those literals that share their concept name withφ. Both local and global semantics have their ownΠ:ΠG andΠL. Capturing Global Semantics. ΠG(φ)checks whetherφofN T-contradictsA:ΠG(φ)is true iff¬φ∈fclT(A)\AlignAlg((T,A),N). Intuitively,SymAlgfor global semantics works as follows: having contradiction betweenN andAonφ=B(c), the change of B’s interpretation is inevitable. Since the semantics traces changes on symbols only, and Bis already changed, one can drop fromAall the assertions of the formB(d). Clearly, SymAlg(K,N, ΠG)can be computed in time polynomial in|A ∪ K|. The following theorem shows correctness of this algorithm.

Theorem 8. LetK= (T,A)and(T,N)be satisfiable DL-Lite+RKBs. Then DL-Lite+R is closed underGs andG#s, and for bothK N =Mod(T,SymAlg(K,N, ΠG)), Capturing Local Semantics. Observe thatLsandLs#are not expressible inDL-Lite+R. Theorem 9. DL-Lite+Ris not closed underLsandLs#semantics.

To compute the minimal sound approximations under local semantics on symbols, we use algorithmSymAlgin Figure 2 with the followingΠLL(φ)is true iffφ /∈ S1. That is,ΠLchecks whether the ABoxT-entailsA(c)∈fclT(N), and if it does, the

(9)

INPUT : consistentDL-Lite+RKB(T,A)and ABoxN, a formula propertyΠ OUTPUT: a setA0⊆fclT(A)∪fclT(N)of ABox assertions

A0:=∅;S1:=AlignAlg((T,A),N);S2:=fclT(N);

1

repeat

2

choose someφ∈ S2; S2 :=S2\ {φ};

3

ifΠ(φ) =TRUEthenS1:=S1\ {φ0|φandφ0have the same concept name}

4

untilS2=∅;

5

A0:=S1∪fclT(N)

6

Algorithm 2: AlgorithmSymAlg((T,A),N, Π)for deterministic computation of K N underGs andG#s semantics and minimal sound approximation underLs

andLs#semantics

algorithm deletes all the assertions fromfclT(A)that share the concept name withA(c).

This property is weaker than one for global semantics, since it is easier to get changes in interpretation ofAby choosing a model ofKwhich does not includeA(c). Clearly, SymAlg(K,N, ΠL)can be computed in time polynomial in|A ∪ K|. The following theorem shows correctness of the algorithm.

Theorem 10. LetK= (T,A)and(T,N)be satisfiable DL-Lite+RKBs. Then the KB (T,SymAlg(K,N, ΠL))is a minimal sound approximation ofK N underLsandLs#.

6 Evolution in DL-Lite

RI

In the previous section we considered evolution in DL-Lite+R. Here we show that DL-Lite+R is essentially a maximal fragment ofDL-LiteR closed under atom-based evolution, and present a wide fragment of DL-LiteR, namely DL-LiteRI, that is not closed under atom-based evolution while the evolution can be captured inFO[2].

The following theorem shows that the restriction not to have roles involved into negation relation is essential, and even a minimal violation leads to inexpressibility of evolution. The minimal way to violate conditions ofDL-Lite+Ris to have two assertions:

Av ∃Rand∃Rv ¬Centailed from a TBox.

Theorem 11. Let assertions{A v ∃R, ∃R v ¬C} be entailed from a DL-LiteR TBoxT, for someRandC, Then there exist ABoxesAandN such that(T,A) N is inexpressible in DL-LiteRunderLaand underLa#.

The following example provides reasons whyDL-Lite+Rrestrictions are important for expressibility of atom-based MBAs. We will further use this example to show how to captureLaevolution inFOfor KBs violatingDL-Lite+Rrestrictions.

Example 12. Consider the followingDL-LiteKBsK1= (T1,A1)andN1={C(b)}: T1={Av ∃R,∃Rv ¬C}; A1={A(a), C(e), C(d), R(a, b)}. Consider the following modelIofK1:

(10)

I: AI ={a, x}, CI ={d, e}, RI ={(a, b),(x, b)},

wherex∈∆\adom. We now show that the following models belong toI N1, this will be used further to captureK1 N1.

J0: AI =∅, CI ={d, e, b}, RI =∅, J1: AI ={x}, CI ={e, b}, RI ={(x, d)}, J2: AI ={x}, CI ={d, b}, RI ={(x, e)}, J3: AI ={a, x}, CI ={b}, RI ={(a, d),(x, e)}, J4: AI ={x}, CI ={d, e, b}, RI ={(x, β1)},

whereβ1∈∆\adom(K1). Clearly all five models satisfyN1andT1. To see that they are inI N1observe that every modelJ(I)∈(I N1)can be obtained fromIby making modifications that guarantee thatJ(I)|= (N1∪ T1)and that the distance betweenI andJ(I)is minimal. What are these modifications? Since in every suchJ(I)the new knowledgeC(b)holds and(Cv ¬∃R)∈ T1, there should be noR-arc incoming into binJ(I), hence the necessary modification ofI is to forbid theR-arcs going from aand fromxto point intob. Where inJ(I)should this R-arcs point in? There are only three mutually exclusive cases for each ofaandx:(i)there is noR-arc inJ(I) that points froma(resp.,x), we simply drop it fromI,(ii)it points to some element β ∈∆\ {d, e, b}, that is,J(I)|=R(a, β).(iii)it points to an elementγof{d, e}, that is,J(I)|=R(a, γ). Case (i) for bothaandxcorresponds toJ0, hence,J0∈ I N1. Case (ii) for bothaandxcorresponds toJ4, hence,J4∈ I N1. Case (i) and Case (iii) foracorresponds toJ1andJ2, hence, bothJ1andJ2are inI N1. Case (iii) for both aandxcorresponds toJ3, hence,J3∈ I N1. There are clearly more modelsI N1 than these five.

To show thatK1N1is inexpressible inDL-LiteRconsider a modelJ5∈Mod(T1,N1) which is not inK1 N1, while it gives a property of models fromK1 N1that is respon- sible for inexpressibility ofK1 N1.

J5: AI ={x}, CI ={e, b}, RI ={(x, d),(x, β1)}.

Observe thatJ4 is closer toI thatJ5, hence J5 6∈ (I N1). Indeed, since for ev- ery modelI0 ∈ Mod(K1)it holds thatI0 |=C(d)andI0 6|= R(x, d), we have that {C(d), R(x, d)} ⊆ I0 J5. At the same timeJ4|=C(d)andJ46|=R(x, d), whileJ4 andJ5agree on all the atoms butC(d)andR(x, d). Hence,(I0 J4)((I0 J5)and J56∈(I N1). Moreover, sinceI0is an arbitrary model ofK1,J5∈/(K1 N1).

Let us make a closer look atJ5: it extendsJ1with the atomR(x, β1), and this is the reason why it is not in the result of evolutionK1 N1. This observation gives a restriction on all the modelsJ onK1 N1: if inJ there is oneR-arc from somexinto d, thenJ has no other arcs fromx. In a way, we have a functionality restriction on the roleR, when it connects two specific elements of the domain:xandd. The same kind of functionality holds forxande, and both of them are not expressible inDL-Lite. This functionality forxandd, and forxandecan be formally written as the followingφand ψ, respectively:

φ=∀x∀y.[R(x, d)∧R(x, y)→y=d], ψ=∀x∀y.[R(x, e)∧R(x, y)→y=e]. Since every model ofK1 N1should satisfyφandψ, andDL-Liteis a slight extension of Horn logics [17],K1 N1cannot be captured inDL-Lite.

(11)

This example shows that DL-LiteR is not closed under model-based evolution.

Besides the fact that bothφ andψ are inexpressible inDL-LiteR, while should be entailed by the result of evolution, there is another observation also responsible for inexpressibility ofK1 N1: it has no canonical model. At the same time, one can see that the setK1 N1can be divided in four subsets, where each has a canonical model. We now show that these four canonical models, which we callprototypesforK1 N1, can be used to captureK1 N1inFO[2], in a similar way as we used the unique canonical model of the evolution result inDL-Lite+Rto capture the evolution inDL-Lite+R. Definition 13. LetKbe a DL-LiteRKB andN be an ABox. Aprototypal set forK N is a minimal subset

J

= {J1, . . . ,Jn} ofK N satisfying the property: for every J ∈ K N there isJi

J

such thatJi,→ J. EveryJi

J

is aprototype forK N.

Continuing with Example 12, one can check that the prototypal set

J

forK1 N1 consists of four models:{J0,J1,J2,J6}, where

J6: AI ={x1, x2}, CI ={b}, RI ={(x1, d),(x2, e)}. Note thatJ6,→ J3andJ0,→ J4.

We now introduce a restriction ofDL-LiteRfor which the result of evolution under La can be captured inFOusing prototypes.DL-LiteRI (whereI stands for (mutual) independenceof roles) is a restriction ofDL-LiteRin which TBoxesT satisfy: for any two rolesRandR0,T 6|=∃Rv ∃R0andT 6|=∃Rv ¬∃R0. That is, we forbid direct role interaction (subsumption and disjointness) between role projections. The interaction is still possible but in a simple manner only, e.g., projections may contain the same concept. This restriction allows us to analyze evolution that affects roles independently for every role. For the ease of exhibition of the following procedure constructing

J

, we

further assume that inDL-LiteRI for each roleRthere is exactly one conceptARsuch thatK |=ARv ∃R. As forDL-Lite+R, one can syntactically check (in quadratic time), given aDL-LiteRKBK, whetherKis inDL-LiteRI using the closure ofK.

Iff is an ABox assertion, thenrootT(f)is a set of all the ABox assertions, that T-entailf. The procedureBP(K,N)(whereBP stands for build prototypes) of con- structing

J

for the case ofDL-LiteRI, generalizes what was done in Example 12 in the following way: In Items 1 and 2 it takes into account all the roles and concepts that may be interpreted differently in different resulted models (R,AandCin our example) and all the constants that could not be present at the second coordinate of those roles in the models of the original KB (eanddin our example). In Items 3, 4, and 5 it builds the prototypeJ0. Finally in Item 6 it builds the rest of prototypes basing on J0. A pseudocode of the procedureBP(K,N)is the following:

1. SR={R1, . . . , Rn}is a set of all roles such that (i) N |=T ¬Ri (bi)for somebiand

(ii) for everyRi(ai, bi)infclT(A), an atomAR(ai)∈fclT(A)andR(ai, b0i)∈/ Align(fclT(A),N), whereb0i∈ {/ bj}nj=1.

Sat=Sn

j=1{Rj(a, bj)|Rj(a, bj)∈fclT(A)}. 2. FA(Ri)is equal to the following set

{D(c)∈fclT(A)| ∃Ri (c)∧D(c)|=T ⊥andN 6|=T D(c), andN 6|=T ¬D(c)}. Then FA=S

Ri∈SRFA(Ri), FC={c|D(c)∈FA},

where the acronymsFAandFCstand for forbidden atoms and constants, respectively.

(12)

3. I0:=Align(Ican,N)∪ N, whereIcanis the canonical model ofK. 4. For eachR(a, b)∈ Sat, doI0 :=I0\ {AR(a)}.

5. J0:=I0,

J

:={J0}.

6. For each subsetD={D1(c1), . . . , Dk(ck)} ⊆FAdo

for eachR= (Ri1, . . . , Rik)such thatDj(cj)∈FA(Rij)forj= 1, . . . , kdo J[D,R] :=h

J0\Sk

i=1rootT(Di(ci))i

∪Sk i=1

clT(R0i(xi, ci))∪ {AR0i(xi)} , where allxi’s are different constants from∆\adom(K), fresh forIcan.

J

:=

J

∪ {J[D,R]}.

Theorem 14. LetK= (T,A)be a DL-LiteRI KB, andN a DL-LiteRABox consistent withT. Then the setBP(K,N)is a prototypal set forK N.

We proceed to correctness of BP in capturing evolution inDL-LiteRI.

Theorem 15. LetK= (T,A)be a DL-LiteRI KB,N a DL-LiteRABox consistent with T, andBP(K,N) ={J0, . . . ,Jn}is a prototypal set forK N. Then

K N =Mod(T)∩Mod(A1∨. . .∨ An)∩Mod(Φ∧Ψ), whereAiis a DL-LiteRABox such thatJiis a canonical model for(T,Ai), and

Φ= ^

cj∈FC

∀x.[(R(x, cj)→AR(x))∧ ∀y.(R(x, cj)∧R(x, y)→y=cj)],

Ψ = ^

R(a,b)∈Sat

∃R(a)→AR(a).

Note that Theorem 4 about computation ofK N forDL-Lite+Ris a particular case of Theorem 15 whenFC=∅and there is just one prototype. Next theorem allows us to approximate result of evolution by aDL-LiteRKB, sinceFO[2]is decidable.

Theorem 16. K N underLafor KBs in DL-LiteRI is inFO[2].

7 Conclusion

We studied expressibility of ABox evolution (for both update and revision) over two subfamilies ofDL-LiteR:DL-Lite+RandDL-LiteRI, that both extend RDFS. The first one is closed under most of the evolution semantics, while the second one is not closed even under MBAs were distance is based on atoms. We isolated conditions on TBox assertions that lead to inexpressibility: pairs of assertions of the formA v ∃Rand∃R v ¬C bring inexpressibility. Note that this condition is similar to the notion of unexpected facts introduces in [10], for formula-based semantics of evolutions. ForDL-Lite+Rwe provided algorithms how to compute semantics that are expressible, and how to approximate those that are not. ForDL-LiteRI we captured local model-based semantics, where the distance between models defined on atoms, inFO[2]. For this purpose we introduced prototypes and showed how they can be used to capture results of evolution.

It is the first attempt to provide an understanding of inexpressibility of MBAs for DL-Liteevolution. Without this understanding it is unclear how to proceed with the

(13)

study of evolution in more expressive DLs and what to expect from MBAs in such logics.

Moreover, isolating the smallest fragment ofFOthat is sufficient to capturedDL-Lite evolution is important in understanding whether it is possible and how to do reasonable approximations of evolution results. We also believe that our techniques of capturing semantics based on prototypes give a better understanding of how MBAs behave on FOtheories. We are currently working on extending the results to capture evolution for generalDL-LiteRKBs.

Acknowledgements We are thankful to Diego Calvanese, Balder ten Cate, and Werner Nutt for discussions. The authors are supported by EU projects ACSI (FP7-ICT-257593), Ontorule (FP7-ICT-231875), and ERC FP7 grant Webdam (agreement n. 226513).

References

1. Borgida, A., Brachman, R.J.: Conceptual modeling with description logics. [13] chapter 10 2. Flouris, G., Manakanatas, D., Kondylakis, H., Plexousakis, D., Antoniou, G.: Ontology change: Classification and survey. Knowledge Engineering Review23(2) (2008) 117–152 3. Abiteboul, S., Grahne, G.: Update semantics for incomplete databases. (In: Proc. of VLDB’85) 4. Katsuno, H., Mendelzon, A.: On the difference between updating a knowledge base and

revising it. In: Proc. of KR’91. (1991) 387–394

5. Eiter, T., Gottlob, G.: On the complexity of propositional knowledge base revision, updates and counterfactuals. Artificial Intelligence57(1992) 227–270

6. Liu, H., Lutz, C., Milicic, M., Wolter, F.: Updating description logic ABoxes. In: Proc. of KR 2006. (2006) 46–56

7. Calvanese, D., Giacomo, G.D., Lembo, D., Lenzerini, M., Rosati, R.: Tractable reasoning and efficient query answering in description logics: TheDL-Litefamily. JAR39(3) (2007) 8. Giacomo, G.D., Lenzerini, M., Poggi, A., Rosati, R.: On the update of description logic

ontologies at the instance level. In: Proc. of AAAI 2006. (2006) 1271–1276

9. Wang, Z., Wang, K., Topor, R.W.: A new approach to knowledge base revision in DL-Lite.

In: AAAI. (2010)

10. Calvanese, D., Kharlamov, E., Nutt, W., Zheleznyakov, D.: Evolution ofDL-Liteknowledge bases. In: International Semantic Web Conference (1). (2010) 112–128

11. Calvanese, D., Kharlamov, E., Nutt, W., Zheleznyakov, D.: Evolution of DL-Lite knowledge bases (extended version). Technical Report KRDB11-3, (KRDB Research Centre)

12. Kharlamov, E., Zheleznyakov, D.: Understanding inexpressibility of model-based ABox evolution in DL-Lite (extended version). (Technical Report KRDB11-4)

13. Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Patel-Schneider, P.F., eds.: (The Description Logic Handbook)

14. Artale, A., Calvanese, D., Kontchakov, R., Zakharyaschev, M.: TheDL-Litefamily and relations. J. Artif. Intell. Res. (JAIR)36(2009) 1–69

15. Artale, A., Calvanese, D., Kontchakov, R., Zakharyaschev, M.: TheDL-Litefamily and relations. J. of Artificial Intelligence Research36(2009) 1–69

16. Poggi, A., Lembo, D., C., D., Giacomo, G.D., Lenzerini, M., Rosati, R.: Linking data to ontologies. J. on Data SemanticsX(2008) 133–173

17. Calvanese, D., Kharlamov, E., Nutt, W.: A proof theory forDL-Lite. In: Proc. of DL 2007.

(Volume 250.) 235–242

Références

Documents relatifs

In recent years, there has been growing interest in ontology-based data access, in which information in the ontology is used to derive additional answers to queries posed over

We consider stan- dard decision problems associated to logic-based abduction: (i) existence of an expla- nation, (ii) recognition of a given ABox as being an explanation, and

In Section 4 we give an intuition of our approach to capture WS of evolution for DL-Lite R KBs using prototypes and FO[2] theories.. Finally, we discuss properties and approximation

Also, we present polynomial time algorithms for the evolution of the instance level knowledge bases expressed in DL-Lite A,id , which the most expressive Description Logics of

(i) [3] considers the very expressive DL SHIN , in which all the semantics considered by our approach are intractable ([6]); (ii) the two approaches are based on different se-

Thus, we examined formula-based approaches and proposed two novel semantics for ABox updates, called Naive Semantics and Careful Semantics, which are closed for DL-Lite.. For both,

The DL-Lite family consists of various DLs that are tailored towards conceptual modeling and allow to realize query answering using classical database techniques.. We only

We present three main results: (1) TBoxes formulated in the extension of DL-Lite core with ELI-concepts on the right-hand side of concept inclusions can be learned in polynomial