• Aucun résultat trouvé

3 Model checking uniform CGSOs

N/A
N/A
Protected

Academic year: 2022

Partager "3 Model checking uniform CGSOs"

Copied!
15
0
0

Texte intégral

(1)

To appear in EPTCS.

c Franc¸ois Laroussinie, Nicolas Markey, Arnaud Sangnier This work is licensed under the

Creative Commons Attribution License.

Franc¸ois Laroussinie

LIAFA

Univ. Paris Diderot & CNRS, France

Nicolas Markey

LSV

ENS Cachan & CNRS, France

Arnaud Sangnier

LIAFA

Univ. Paris Diderot & CNRS, France

Alternating-time temporal logic with strategy contexts (ATLsc) is a powerful formalism for expressing properties of multi-agent systems: it extendsCTLwithstrategy quantifiers, offering a convenient way of expressing both collaboration and antagonism between several agents. Incomplete observation of the state space is a desirable feature in such a framework, but it quickly leads to undecidable verification problems. In this paper, we prove thatuniformincomplete observation (where all players have the same observation) preserves decidability of the model checking problem, even for very expressive logics such asATLsc.

1 Introduction

Model checking is a powerful technique for automatically checking properties of computerized sys- tems [Pnu77, CE82, QS82]. Model-checking algorithms classically take as input a model of the system under analysis (e.g. a finite-state automaton), and a formal property (expressed e.g. in some temporal logic, such asLTLorCTL) to be checked; they then automatically and exhaustively verify whether the set of behaviors of the model satisfies the property.

During the last 15 years, model checking has been extended to handle complex systems, whose behaviors are the result of the interactions of several components. Games played on graphs are a convenient model for representing such interactions, and temporal logics have been hence proposed in order to express relevant properties in such a setting. One of the specification language to navigate in the execution trees of multi-agents systems is the temporal logicATL[AHK02] (Alternating-Time Temporal Logic); it is an extension of the branching temporal logicCTLwhich allows to express properties such as the fact that a component can enforce a certain behavior independently of the actions performed by the other components.ATLhas then be enriched in different ways to obtain more expressive logics for multi-agent systems. In particular,ATLsc(ATLwith strategy contexts) [BDLM09, LM15] and Strategy Logic [CHP07, MMV10] are two powerful extensions with similar properties in terms of expressive power and algorithmic properties. It was furthermore proved that those two logics have decidable, but Tower-complete model-checking algorithms.

In the approaches cited above, it is always assumed that all the players in the games have perfect observation of the state of the game, and that they also have perfect recall of the sequence of states that have been visited. In other words, they can choose an action to perform based on the entire sequence of states visited before. However, in many applications, components only have bounded memory, and most often they do not have the ability to fully observe all the other components of the system. While considering imperfect recall—the hypothesis that each player can only store into a finite memory the history of the seen states seen—can greatly simplify verification algorithms (since the number of strategies in the systems becomes finite), partial observation is known to makeATLmodel checking undecidable [AHK02, DT¸ 11].

Such results obviously carry over to more expressive logics likeATLsc. Decidability can be regained by

This work was partly supported by ERC Starting grant EQualIS (FP7-308087) and by FET project Cassting (FP7-601148).

(2)

restricting to imperfect-recall strategies [Sch04], or by considering hierarchical information [BMV15] or special communication architectures in distributed synthesis [KV01, Sch08].

In this paper, we consider a restricted case of partial observation, where all the players have the same information about the state space. We call such a caseuniformpartial observation. We prove that under this hypothesis, model checking concurrent game structure is decidable, even for the powerful logicATLsc. In particular, it is decidable whether there exists a strategy, based only on a subset of atomic propositions (assuming that the precise states and the other propositions are not visible), to enforce a given property.

Note also that the restriction to uniform observation is not significant when one looks for a strategy of a single agentaagainst all other players, since the semantics we use requires that the strategy be winning for any outcome (hence the exact observation of the opponent players is irrelevant). The decidability proof for model-checking under uniform partial observation is obtained by adapting a previous approach developed in [DLM12, LM15], which consists in transforming the model-checking problem forATLscinto a model- checking problem forQCTL, an extension ofCTLwith propositional quantification. A similar technique also allows us to prove that when restricting the strategy quantifiers to range over memoryless strategies, then the model-checking problem forATLscwith partial observation is again decidable. We finally prove that satisfiability checking forATLscwith partial observation (i.e., deciding whether there exists a game structure with partial observation satisfying a given formula ofATLsc) is undecidable, even in the case of turn-based games (where satisfiability is decidable under full observation [LM13]).

2 Definitions

2.1 Game structures with partial observation

In this paper, we consider concurrent games with partial observation. They correspond to classical concurrent game structures [AHK02] where, for each agent, an equivalence relation over the states of the structure defines sets of states that are observationally equivalent for this player. Observation equivalence extends to sequences of states in the obvious way. The strategies of the agents then have to be compatible with their observation, in the sense that after two observationally equivalent plays, a strategy has to return the same action. In this preliminary section, we formalize this setting.

All along this paper, we consider a setAPof atomic propositions. We recall that aKripke structure overAPis a tuplehQ,R, `iwhereQis a countable set of states,R⊆Q×Qis the transition relation and

`:Q→2APis a state-labeling function.

Definition 1. Aconcurrent game structure with partial observation(CGSO) is a tupleC =hQ,R, `,Agt, M,Mov,Edge,(∼a)a∈Agtiwhere:hQ,R, `iis a finite-state Kripke structure;Agt={a1, . . . ,ap}is a finite set of agents(or players); M ={m1, . . . ,mr}is a finite set of moves (or actions); Mov: Q×Agt→ 2Mr{∅}defines the set of available moves of each agent in each state; Edge:Q×MAgt →Q is a transition table associating, with each state q and each set of moves of the agents, the resulting state q0, with the requirement that(q,q0)∈R; finally,(∼a)a∈Agt assigns to each player an equivalence relation over Q.

In the following, we assume w.l.o.g. (w.r.t existence of specific strategies) thatMov(q,a) =M for every q∈Qand a∈Agt: all the actions are available to all the players at any time. A move vector is a vector m∈MAgt; for such a vector and for an agenta∈Agt, we denote by m[a]the move of agentain m. Aturn-based game structure with partial observation(TBGSO) is a CGSO for which there exists a mappingOwn:Q→Agtsuch that for anyq∈Qand for any two move vectorsmandm0 ifm[Own(q)] =m0[Own(q)], thenEdge(q,m) =Edge(q,m0).

(3)

As we mentioned above, each relation∼awitha∈Agtcharacterizes the observation of the agentain the game structure: ifq∼aq0, then agentais not able to make a distinction between the statesqandq0; in such a case, we say thatqandq0area-equivalent. As an important special case, an observation relation (∼a)a∈Agt is said to beuniformwhen∼a=∼a0 for alla,a0∈Agt. In that case we might substitute the set (∼a)a∈Agt by a unique equivalence relation∼.

A finite path in the CGSO is a finite non-empty sequence of states ρ =q0q1q2. . .qk such that (qi,qi+1)∈Rfor alli∈ {0, . . . ,k−1}. Aninfinite path(or run) is an infinite sequence of states such that each finite prefix is a finite path. We denote byPath(resp.InfPath) the set of finite (resp. infinite) paths.

Let|ρ|the length of the pathρ(with|ρ|=∞ifρ is infinite). For 0≤i<|ρ|, we writeρ(i)to represent thei+1-th element of the pathρ. For a pathρ, we writefirst(ρ)for its first elementρ(0)and, when

|ρ|<∞, we writelast(ρ)for its last elementρ(|ρ|). For 0≤i<|ρ|, we denote byρ≤i the prefix ofρ until positioni, i.e. the finite pathρ(0)ρ(1). . .ρ(i). We extend the equivalence relation∼afora∈Agt to paths as follows : two pathsρandρ0 area-equivalent (writtenρ∼aρ0) if, and only if,|ρ|=|ρ0|and ρ(i)∼aρ0(i)for every 0≤i<|ρ|.

Given a CGSOC and one of its statesq0, we writeTC(q0)for the execution treeofC from q0: formally,TC(q0)is the pairhT, `iwhereT is the set of all finite paths (callednodesin the context of trees) inC with first stateq0, and`labels each nodeρwith the labeling oflast(ρ)inC. It will be convenient in the sequel to see execution trees as infinite-state Kripke structures. To alleviate notations, we still write TC(q0)for the Kripke structure hT,R, `iwherehT, `i is the tree defined above, andR⊆T×T is the transition relation such that(ρ,ρ0)∈Rwheneverρis the prefix ofρ0 of length|ρ0| −1.

Astrategyfor agenta∈Agtis a function fa:Path→M; it associates with any finite path a move to be played by agentaafter this path. A strategy fafor agenta∈Agtis said to bememorylesswhenever for any two finite pathsρandρ0 such thatρ(|ρ| −1) =ρ0(|ρ0| −1), it holds fa(ρ) = fa0). Hence the decision of a memoryless strategy depends only on the current control state; for this reason, we may simply give such a strategy as a fonction fa:Q→M. A strategy fAfor a coalition of agentsA⊆Agt is a set of strategy{fa}a∈Aassigning a strategy fato each agenta∈A(note that a strategy for agenta is equivalent to a strategy for coalition{a}). Given a strategy fA={fa}a∈A for coalitionA, we say that a pathρ respects fA from a finite path π if, and only if, for all 0≤i<|π|, we have ρ(i) =π(i)and for all|π| ≤i<|ρ| −1, we have thatρ(i+1) =Edge(ρ(i),m)wheremis a move vector satisfying m(a) = fai)for alla∈A. Given a finite pathπ, we denote byOut(π,fA)the set of infinite pathsρ0 such thatρrespects the strategy fAfromπ. Given a strategygA={ga}a∈Afor a coalitionAand a strategy fB={fb}b∈Bfor a coalitionB, we denote bygA◦fB, the strategy{hc}c∈A∪Bfor coalitionA∪Bsuch that hc=gcfor allc∈Aandhc= fc for allc∈B\A. Finally given a fB={fb}b∈Bfor a coalitionBand a set of agentsA⊆Agt, we denote by(fB)|A (resp.(fB)rA) the strategy{fb}b∈B∩A (resp.{fb}b∈B\A) for coalitionB∩A(resp.B\A).

Partial observation comes into the play by restricting the space of allowed strategies: in our setting, we only consider strategies that arecompatiblewith the observation in the game, which means that after any twoa-equivalent finite pathsρ andρ0, the strategies for agenta have to take the same decisions (i.e. fa(ρ) = fa0)). We could equivalently define a compatible strategy foraas a function from the quotient setPath/∼a toM, such that if[ρ]is the equivalence class ofρwith respect to∼a, then fa([ρ]) gives the move to play forafrom any history equivalent toρ. A strategy fA={fa}a∈A for coalitionA iscompatibleif fais compatible for alla∈A. We writeStrat(A)to denote the unrestricted strategies for coalitionA, Strat(A) for the set of compatible strategies, andStrat0(A) is the set of compatible memoryless strategies forA.

(4)

2.2 ATLwith strategy contexts

We will be interested in the logicATLsc, which extends the alternating-time temporal logic of [AHK02]

with strategy contexts. We assume a fixed set of atomic propositionsAPand a fixed set of agentsAgt.

Definition 2. The formulas ofATLscare defined by the following grammar:

ATLscss::=P| ¬ϕss∨ψs| h·A·iϕp| h·A·iϕp|LAMϕp|LAMϕp

ϕpp::=ϕs| ¬ϕpp∨ψp|Xϕppp

where P ranges overAPand A ranges over2Agt.

We interpretATLscformulas over CGSOs, within a context (i.e., a preselected strategy for a coalition):

state formulasof the formϕsin the grammar above are evaluated over states, whilepath formulasof the formϕpare evaluated along infinite paths. In order to have a uniform definition, we evaluate all formulas at a given position along a path.

In ATLsc, contrary to the case of classical ATL[AHK02], when strategy quantifiers assign new strategies to some players, the other players keep on playing their previously-assigned strategies. This is what “strategy contexts” refer to. Informally, formula h·A·iϕp holds at positionn alongρ under the contextF if it is possible to extendF with astrategyfor the coalitionAsuch that the outcomes of the resulting strategy afterρ≤nall satisfyϕp. In Section 4, we also consider the strategy quantifiers h·A·i0, which quantifies only onmemorylessstrategies. Finally strategies can be dropped from the context using the operatorL-M. Notice that in this paper, all strategy quantifiers are restricted to compatible strategies.

We now define the semantics formally. LetC be a CGSO with agent setAgt,ρ be an infinite path ofC, andn∈N. LetB⊆Agtbe a coalition, and fB∈Strat(B). That a (state or path) formulaϕholds at a positionnalongρinC under strategy context fB, denotedC,ρ,n|=fB ϕ, is defined inductively as follows (omitting atomic propositions and Boolean operators):

C,ρ,n|=fB LAMϕs iff C,ρ,n|=(fB)rA ϕs

C,ρ,n|=fB LAMϕs iff C,ρ,n|=(fB)|A ϕs

C,ρ,n|=fB h·A·iϕp iff ∃gA∈Strat(A).∀ρ0∈Out(ρ≤n,gA◦fB).C,ρ0,n|=gAfB ϕp

C,ρ,n|=fB h·A·iϕp iff C,ρ,n|=fB h·Agt\A·iϕp

C,ρ,n|=fBp iff C,ρ,n+1|=fB ϕp

C,ρ,n|=fBϕpp iff ∃l≥0.C,ρ,n+l|=fBψpand∀0≤m<l.C,ρ,n+m|=fBϕp

Finally, we write C,q0 |=ϕs when C,ρ,0|=f/0 ϕs (with empty context) for all path ρ such that ρ(0) =q0. Notice that this does not depend on the choice ofρ. The usual shorthands such asFandG are defined as forCTL. It will also be convenient to use the constructs J·A·Kϕpas a shorthand for¬ h·A·i ¬ϕp, and h·A·iϕsas a shorthand forh·A·i ⊥Uϕs. TheCTLuniversal path quantifiersEand Acan be expressed inATLsc. For instance, Aϕpis equivalent to L/0Mh·/0·iϕp. Finally the fragmentATLscofATLscis defined as usual, by restricting the set of path formulas to

ϕpp::=¬ϕp|Xϕsss.

Remark 3. The strategy quantifiers for complement coalitions (namely h·A·i and LAM) are only useful in case the setAgtis not known in advance, as it is the case when dealing with satisfiability or with expressiveness questions.

(5)

Remark 4. As opposed to Strategy Logic [CHP07, MMV10], the (existential) strategy quantifiers in ATLscinclude an implicit (universal) quantification over the outcomes of the strategies of the unassigned players. This is indeed a quantification over all the outcomes, and not over the outcomes that would result only from compatible strategies.

Example 5. Consider the CGSOC in Figure 1. It is a turn-based CGS with two players (Player a1plays in circle node, and Player a2 plays in box node). The setM is{1,2,3}. The partial observation for Player a1is then given by∼a1 is≡P (i.e. two states are equivalent if, and only if, the truth value of P is the same in both states). Now consider the formulaϕ=h·a1·iFf . To see thatϕ holds for true in q0 with the standard semantics (where the strategy quantifiers are not restricted to compatible strategies), it is sufficient to consider the (memoryless) strategy for a1consisting in choosing the move m1from q2 and the move m2(or m3) from q3. Note that this strategy is not compatible (after q0q2and q0q3, agent a1 should choose the same move), thus we need to consider another one to show thatϕis satisfied with the partial-observation semantics; for example, the strategy consisting in choosing m1after q0q2and q0q3, m2after q0q1q3and m1after q0q3q2. If furthermore we look for a memoryless compatible strategy for a1, we can use the strategy assigning the move m1to q2and q3, hence the formula h·a1·i0Ff also holds true in q0. Therefore, formulaϕholds true at q0. On the other hand, formula h·a1·i0X X Xf does not hold true in q0, but formula h·a1·iX X Xf , where we drop the memory constraint, does hold true.

q0

q1

P

q2

P

q3

q4

f

q5 m1

m3

m2

m1;m2;m3

m1

m2;m3

m1

m2;m3

C

Fig. 1: The turn-based CGSC

In the sequel we will consider themodel-checking problemofATLsc over CGSOs which takes as input a CGSOC, a control stateq0and anATLscformulaϕ and which answers yes ifC,q0|=ϕand no otherwise. We will also consider thesatisfiability problemwhich given a formula ofATLscdetermines whether there exists a CGSOC and a control stateq0such thatC,q0|=ϕ. We recall that when considering concurrent game structures (with full observation),ATLscmodel-checking is decidable, but its satisfiability problem is undecidable (except when restricting to turn-based games) [LM15].

2.3 From concurrent games to turn-based games

Following our definitions, turn-based game structures can be seen as special cases of concurrent game strcutres, where in each location only one player may have several non-equivalent moves. In this section, we prove that any partial-observation CGSO can be turned into anequivalentpartial-observation TBGSO (whereequivalentwill be made precise later). While all players play at the same time in a CGSO, they play one after the other (in any predefined order) in the correspondig TBGSO, but the intermediary states are made undistinguishable to all players, so that no player can gain information from playing after another one. Figure 2 schematically represents this transformation in the case of two players. Obviously, since we add intermediary states, we also have to modify theATLsc formula to be checked, by making the intermediary states “invisible”; this is a classical construction in temporal logics. In the end, we prove the following result:

(6)

q0

q1 q2 q3

hm1,m1i hm2,m1i

hm1,m2i hm2,m2i

q0

q1 q2 q3

m1 m2

m1 m2 m1 m2

Fig. 2: Turning a CGSO into an equivalent TBGSO

Theorem 6. For any CGSOC=hQ,R, `,Agt,M,Mov,Edge,(∼a)a∈Agti, we can compute a TBGSOT = hQ0,R0, `0,Agt,M,Mov0,Edge0,(≈a)a∈Agtiwith Q⊆Q0, for which there is a logspace translationϕ∈ ATLsc7→ϕe∈ATLscsuch that for anyϕ∈ATLscand any state q ofC, it holds:

C,q|=/0ϕ ⇔ T,q|=/0ϕ.e Furthermore, ifC is uniform, then so is T.

2.4 QCTLin a nutshell

As we explain in the sequel, under some restrictions (uniformity or restriction of the considered strategies), the model-checking problem ofATLscover CGSOs can be reduced to the model-checking problem of the temporal logic QuantifiedCTLover Kripke Structures.QCTLextends the classical branching time temporal logicCTL withatomic propositions quantifiers∃P.ϕs(and its dual∀P.ϕs), wherePis an atomic proposition inAP:

QCTLss::=P| ¬ϕss∨ψs|Eϕp| ∃P.ϕs

ϕpp::=ϕs| ¬ϕpp∨ψp|Xϕppp

where Pranges over AP. We briefly reviewQCTL here, and refer to [LM14] for more details and examples.

QCTLformulas are then interpreted over a (classical) Kripke structureS =hQ,R, `i. Informally

∃P.ϕsis used to specify the existence of a valuation forPoverS such thatϕsis satisfied. We consider two different semantics of∃P.ϕ: thestructure semanticswhere one looks for a valuation of the structureS, and thetree semanticswhere one considers the valuation of its execution tree.

ForX⊆AP, two Kripke structuresS =hQ,R, `iandS0=hQ0,R0, `0iareX -equivalent(denoted by S ≡X S0) wheneverQ=Q0,R=R0, and`≡X`0(i.e.,`(q)∩X =`0(q)∩Xfor anyq∈Q). Thestructure semanticsofQCTL (whose satisfaction relation is denoted by|=st) is derived from the semantics of CTLby adding the following rule:

S,ρ,n|=st∃P.ϕs iff ∃S0.S0APr{P}S andS0,ρ,n|=stϕs.

In other terms,∃P.ϕsmeans that it is possible to (re)label the Kripke structure withPin order to makeϕs

hold. As for CGSO, we writeS,q0|=stϕswheneverS,ρ,0|=stϕsfor all pathρsuch thatρ(0) =q0. Thetree semantics(whose satisfaction relation is denoted by|=tr) is defined as:S,q0|=trϕsif, and only if,TS(q0),q0|=stϕs(whereTS(q0)is the execution tree ofS fromq0, seen as an infinite-state Kripke structure).

(7)

Example 7. Consider formula EXϕ∧ ∀P.EX(ϕ∧P)⇒AX(P⇒ϕ), which we write EX1ϕ in the sequel. This formula states the existence of a unique immediate successor satisfyingϕ. We will reuse it later in our construction.

Theorem 8([LM14]). 1. For the structure semantics, the model-checking problem of QCTL is PSPACE-complete and the satisfiability problem is undecidable.

2. For the tree semantics, the model-checking and satisfiability problems ofQCTLare decidable, and Tower-complete.

3 Model checking uniform CGSOs

As we have already mentioned, it is well known that the model-checking ofATLover CGSO is undecid- able [AHK02, DT¸ 11]. SinceATLis a fragment ofATLsc, this undecidability result holds also forATLsc. We prove in this section that decidability can be regained by restricting to uniform partial observation, i.e., by assuming that the observation is the same for all the agents. The idea consists in reducing the model-checking problem ofATLscover uniform CGSOs to the model-checking problem forQCTLwith tree semantics over a complete Kripke structure representing the possible observations.

In the sequel, we consider a uniform CGSOC =hQ,R, `,Agt,M,Mov,Edge,(∼a)a∈Agtiwith the finite set of statesQ={q0, . . . ,qs}, the finite set of movesM ={m1, . . . ,mr}and∼a=∼for alla∈Agt.

For eachq∈Q, we denote by[q]the equivalence class ofqwith respect to∼,i.e.[q]⊆2Qis such that q0∈[q]if, and only if,q∼q0. We also consider anATLscformulaϕswhose h·-·i-depth (i.e. the maximal number of nested modalitiesh·-·i) isλ(we assume here that modalitiesh·-·i andL-M have been previously removed fromϕs, as the set of agents is known). The idea behind our proof is to consider a Kripke structureSC as a complete graph whose set of nodes is the set of equivalence classes[q]. We then use the power ofQCTL(with the tree semantics) to build paths and strategies in this Kripke Structure. The fact that, in the Kripke structure, the paths go through equivalent classes of states (and not through states), guarantees that the corresponding strategies are compatible with the partial observation.

Before we build the Kripke Structure, we need to introduce some sets of fresh atomic propositions (not appearing inC or inϕs) which we will use to encode paths and strategies in the Kripke structure. Let APQ={q`i |1≤i≤sand 0≤κ≤λ},APaM={ma1, . . . ,mar}for everya∈Agt, andAPM=Sa∈AgtAPaM andAPS={si|1≤i≤s}be those sets of fresh atomic propositions. We assume thatAPQ,APM and APSare subsets ofAP. Intuitively, propositions inAPQwill be used to fix a path ofC at each levelκ of quantification (a path at levelκbeing a sequence of statesqκi ); propositions inAPaM will be used to label the choices corresponding to a strategy of agenta.

The Kripke structureSC associated withC is then defined ashQC,RC, `CiwhereQC ={[q]|q∈Q}, RC =QC×QC and`C([q]) ={si|qi∈[q]}. Compatible strategies can then be encoded as functions labeling the execution trees of this Kripke structure (from the state where we evaluate the formula):

a strategy fafor some agent ais represented as a function fa:Q+C →APaM that labels the execution treesTSC(q)ofSC from the current stateqwith proposition inAPaM. However note thatSC being a complete graph, paths in this structure might not correspond to paths in the associated CGSOC; we will use atomic propositionsqito identify concrete paths ofC.

Given a coalitionBinAgt, an integer 0≤κ≤λ, and a subformula ofϕsath·-·i-depthκ, we define a QCTLformulaϕeB,κinductively as follows:

PeB,κ= _

{i|P∈`(qi)}

qκi LAMϕs

B,κ

s

BrA,κ

(8)

ϕ∧ψB,κeB,κ∧ψB,κ ¬gϕB,κ=¬ϕeB,κ

ϕ·pp

B,κ=fϕp

B,κfp

B,κ

Xfiϕp

B,κ=Xfϕp

B,κ

For a formula of the shape1 h·a·iϕp, the construction is as follows:

·h·a·iϕp

B,κ=∃ma1. . .maκ.hΦstrat({a})∧ ∀qκ+11 . . .qκ+1s .Φpath(κ+1)∧ _

1≤i≤s

(qκi ∧qκ+1i )∧Φout(κ+1,B∪ {a})⇒AG _

1≤i≤s

qκ+1ifϕp

B∪{a},κ+1i

with

Φstrat(A) =^

a∈A

AG _

1≤j≤r

(maj^

j06=j

¬maj0)

Φpath(κ+1) =EG _

1≤i≤s

Äqκ+1i ∧si^

i06=i

¬qκ+1i0

ä∧EX1( _

1≤i≤s

qκ+1i )

Φout(κ+1,B) =EG _

1≤i≤s

_

(m,qj)

m∈MAgt qj=Edge(qi,m)

©

qκ+1i ∧mB∧EXqκ+1j

wheremBis a shorthand for the formulaV{(b,mk)|b∈B∧m[b]=mk}mbk. We point out the fact thatΦout(κ+1,B) is based on the transition tableEdgeofC; consequently, its size is inO(|Q|2· |M||Agt|)(i.e., inO(|Q| ·

|Edge|)).

The intuition behind formula·h·a·iϕp

B,κ

is as follows: the formula first “selects” a strategy for agenta under the form of a labeling of the execution tree ofSC withmai; it then uses subformulaΦstrat({a})to ensure that this labeling correctly encodes a strategy fora; finally, it checks that all the outcomes of the selected strategy satisfyϕp. The latter task is achieved by considering all the labelings of the structure withqκ+1 , and, for the labelings that correspond to one outcome of the selected strategy, by checking the formulaϕp. Ensuring that a labeling corresponds to an outcome is done as follows:

1. it corresponds to an infinite branch in the execution tree ofSC, and each node labeled byqκ+1i should correspond to a node [qi](labeled by si) in SC (both points are ensured by formula2 Φpath(κ+1));

2. at the present position, one of the propositionsqκ+1i has to match with one of the stateqκi of the previous level (this ensures that the path labeled byqκ+1 starts from the “current state” considered in the game);

3. the branch obtained by this labeling effectively follows the choices dictated by the labels mb encoding the strategies forb∈B∪ {a}; this is checked by the formulaΦout(κ+1,B∪ {a}).

Finally, the formula checks that the corresponding path satisfies the formulafϕp

B∪{a},κ+1

. The correctness of the reduction is stated in the following theorem:

1For the sake of readability, we restrict to one-player coalitions here; the construction easily extends to the the general case with a coalition (including the empty coalition).

2See Example 7 for the definition ofEX1

(9)

Theorem 9. Letϕsbe anATLscstate-formula,C be a uniform CGSO, and qαbe a state ofC. Then:

C,qα|=ϕs if, and only if, SC,[qα]|=tr∃q0α.ϕs

/0,0∧q0α

Proof.First we point out the fact that none of the formula used in the reduction checks that the considered strategies are compatible, but in fact this is guaranteed because we evaluate the formula over the Kripke structureSC, and two equivalent paths inC with respect to∼are necessarily matched to a unique path in the execution tree ofSC.

We now prove that our reduction is correct. LetTSC([qα]) =hTC,R0C, `0Cibe the execution tree of the Kripke structureSC from the state[qα]. Note that nodes inTSC([qα])are thus finite paths of the form [q0][q1]. . .[qk](withq0=qα). For a finite pathπ inC such thatπ(0) =qα, we denote by[π]the path inSC such that|π|=|[π]|and for all 0≤i<|π|, we have[π](i) = [π(i)]. We also denote bypath(π) the path inTSC([qα])such that|path(π)|=|π|and for all 0≤i<|π|, path(π)(i) = [π≤i]. We now consider the treeT0=hTC,R0C, `0iwhere`0 extends the function`0S with propositions of typemaj andqκi. This extension is used to encode a strategy context and to describe a path ofC. Given a compatible strategy fB={fb}b∈B for the coalitionB={b1, . . . ,bh}, an infinite pathρinC suchρ(0) =qα and a positionn≥0, we say that`0is:

• an fB-labeling whenever, for every nodeγ∈TC withγ= [π]for some finite pathπinC, for any b∈B, for any 1≤ j≤r, we havembj∈`0(γ)if, and only if, fb(π) =mj;

• a(κ,ρ)-labeling if the following two conditions are verified:

1. for all j≥0, it holdsqκi ∈`0([ρj])if, and only if,ρ(j) =qi

2. ifqi∈`0(π)for a nodeπ∈TC, then there exists j≥0 such thatπ= [ρj].

In other words, the propositionsqκ1, . . . ,qκs label a unique branch in the tree, that can be matched with the pathρ.

We say that for 0≤κ ≤λ, a subformulaϕ ofϕsoccurs at h·-·i-depthκ, if the number of h·-·i- quantifier aboveϕ in tree representing formulaϕsisκ.

Proposition 10. Letϕbe some subformula ofϕswhich occurs at h·-·i-depthκwith0≤κ≤λ,ρbe a run ofC such thatρ(0) =qα, n be a natural number, fBbe a compatible strategy for coalition B. Let T0=hTC,R0C, `0ibe the tree defined as above. If`0is an fB-labeling and a(κ,ρ)-labeling, then we have:

C,ρ,n|=fB ϕ if, and only if, T0,path(ρ),n|=stϕeB,κ.

Proof.The proof is done by structural induction overϕ. In the following, we letqα0=ρ(n).

• caseϕ=P: we haveρ(n)|=fB Pif, and only if,P∈`(qα0). As`0 is a(κ,ρ)-labeling, we know that T0,path(ρ),n|=qκα. By definition ofPeB,κ, the implication follows. Conversely assume T0,path(ρ),n|=PeB,κ, we know thatρ(n)has to be labeled byP, because`0is a(κ,B)-labeling.

• caseϕ=ϕpp: ifC,ρ,n|=fB ϕpp, then there existsi≥ns.t.C,ρ,i|=fBψpand for anyn≤ j<i, we haveC,ρ,j|=fBϕp. By i.h., we getT0,path(ρ),i|=ψfp

B,κand, for anyj,T0,path(ρ),j|= fϕp

B,κ; from this we obtainsT0,path(ρ),n|=ϕeB,κ. The converse is similar.

• caseϕ=h·a·iϕp: AssumeC,ρ,n|=fB h·a·iϕp. Then there exists a∼-compatible strategy fas.t. for anyρ0∈Out(ρ≤n,fa◦fC), we haveρ0|=fafB ϕp. From this strategy fa, we deduce a valuation for propositions ma1, . . . ,mar extending `0 over T (because fa is ∼-compatible), and satisfying Φstrat({a}). Now extend`0with a valuation forqκ+11 , . . . ,qκ+1s following the runρ0(i.e. for every stateρ0(i) =qβ, the corresponding node[ρ0](i)is labeled byqκ+1β , and only[ρ0]-nodes are labeled by these propositions). Let`00be this new valuation forT0.Then clearly we have:

(10)

– T0,path(ρ),n|=Φpath(κ+1), sinceqpropositions label a path;

– T0,path(ρ),n|=qκα0∧qκ+1α0 , because the current position belongs to the runsρandρ0, and the new runρ0is issued from the current position;

– T0,path(ρ),n|=Φout(κ+1,C∪ {a}), meaning that the labeling ofqκ+1 propositions follows the ”correct” pathρ0 fromOut(ρ≤n,fa◦fB);

– finally,T0,path(ρ0),n|=fϕp

B∪{a},κ+1

by i.h.

Therefore we have:T0,path(ρ),n|=ϕeB,κ.

We now prove the converse implication. Assume T0,path(ρ),n|=ϕeB,κ. From the existence of a labeling forma1, . . . ,mar satisfyingΦstrat({a}), we deduce a ∼-compatible strategy fa inC for every finite runs issued fromρ(≤n)Now consider a valuation forqκ+11 , . . . ,qκ+1s . Either it makes the left-hand side subformula of the implication to be false, and there is no consequence, or this subformula is true and in this case, the valuation describes a runρ0inC issued fromρ(n) and belonging in Out(ρ≤n,fa◦fB); this run has to satisfy fϕp

B∪{a},κ+1

, and by i.h. we get that C,ρ0,n|=fafBϕp.

Corollary 11. The model-checking problem of ATLscover uniform CGSOs is decidable (andTower- complete).

Remark 12. Our algorithm can be used to decide whether one player (with partial observation of the system) has a compatible strategy to win against all the other players, whatever is the observation of the other players (since the implicit quantification in strategy quantifiers ranges over all the outcomes).

As a consequence, when considering the fragment ofATLin which strategy quantifiers always involve at most one-player coalitions, the model-checking problem is decidable for possibly non-uniform partial observation. It can be checked that indeed the undecidability proof of [DT¸ 11] involves a strategy quantification over a coalition of two players with different observation.

4 Restriction to memoryless strategies

In this section, we show that another way to obtain decidability for the model-checking problem ofATLsc over game structure with (possibly non-uniform) partial observation is by restricting the set of considered strategies to memoryless strategies. We denote byATLsc,0, the temporal logic obtained fromATLscby replacing respectively the quantifiers over strategiesh·A·i by h·A·i0 and h·A·i by h·A·i0. Given a CGSOC a pathρ ofC, andn∈Nsuch thatn<|ρ|and fB∈Strat0(B), the semantics of these two operators is given by:

C,ρ,n|=fB h·A·i0ϕp iff ∃gA∈Strat0(A).∀ρ0∈Out(ρ≤n,gA◦fB).C,ρ0,n|=gAfBϕp

C,ρ,n|=fB h·A·i0ϕp iff ∃gA∈Strat0(Agt\A).∀ρ0∈Out(ρ≤n,gA◦fB).C,ρ0,n|=g

AfB ϕp

In [LM15], a reduction from model-checkingATLsc,0andATLsc,0toQCTLmodel-checking problem over concurrent game structure with perfect observation is given, the main idea is to use the structure semantics instead of the tree semantics. In this framework the complexity is simpler, since both problems arePSPACE-complete: the number of memoryless strategies for one player being bounded (by|M||Q|), we can easily enumerate all of them, and store each strategy within polynomial space. When considering partial observation, we can use the same approach, we need to also ensure that the chosen strategies are compatible, but this can easily be achieved when considering memoryless strategies, since one only needs to check that a strategy faproposes for two equivalent states (w.r.t.∼a) the same move.

(11)

LetC =hQ,R, `,Agt,M,Mov,Edge,(∼a)a∈Agtibe a CGSO with finite set of statesQ={q0, . . . ,qs}, finite set of movesM={m1, . . . ,mr}and the finite set of agentsAgt={a1, . . . ,ap}. For eacha∈Agt, we suppose that the number of equivalence classes for∼aisnaand we use the notationEiawith 1≤i≤nato represent thei-th equivalence class of∼a. We denote byKC the Kripke structurehQ,R, `KiunderlyingC based on the same states and the same transitions, and where the labeling function extends`by adding labels from the sets{Pq|q∈Q} ∪ {Pi a |a∈Agt∧1≤i≤na}in such a way that the following two conditions hold:

• Pq∈`K(q0)if, and only if,q0=q,

• for alla∈Agt,Pi a∈`K(q)if, and only if,q∈Eia.

Below we show how to translate a formulaϕsofATLsc,0 into a formula ofϕsofQCTL such that we will haveC,q0|=ϕsfor a stateq0ofC iffKC,q0|=stϕs. Note that at the opposite of the previous section, we consider here thestructure semantics[LM14] when performing the model-checking ofKC: this means that instead of ranging over labelings of the execution tree, propositional quantification ranges over labelings of the Kripke structure. This is due to the fact that the compatible memoryless strategies we take here into account are functions mapping equivalent states (and not anymore path of equivalent states) to the same move. Given a coalitionBinAgtandϕ a subformula ofϕs, we define aQCTLformulaϕbB inductively as follows:

PbB=P ÷LAMϕs

B

s

BrA

÷ϕ∧ψBbBψB ¬dϕB=¬ϕbB ϕpp

B=cϕp

Bcp

B

p

B=Xcϕp

B

For a formula of the shape h·a·i0ϕp, we let3:

ÿh·a·i0ϕp

B

=∃mr1. . .maκ.Φstrat0({a})∧Ah0out({a} ∪B))⇒ cϕp

B∪{a}i

where

Φstrat0(A) =^

a∈A

ñ

AG _

1≤j≤r

(maj^

j06=j

¬maj0)^

1≤c≤na

^

1≤j≤r

EF(Pca∧maj)⇒AG(Pca⇒maj) ô

and

Φ0out(B) =G ^

q∈Q

^

(m,q0)

m∈MAgt q0=Edge(q,m)

©

î(Pq∧mB)⇒XPq0ó.

wheremBis a shorthand for the formulaV{(b,mk)|b∈B∧m[b]=mk}mbk. The last FormulaΦ0out(B)characterizes the outcomes of the strategies in use for some coalitionB. We point out that theQCTLformulaϕs

/0has sizeO(|Φ| · |Q| ·(|Agt| · |M|2+|Q| · |Edge|)).

A memoryless and compatible strategy fB can easily be encoded by labeling states ofKC with the moves of the coalitionBthanks to the propositionsma1, . . . ,mar fora∈B. TheΦstrat0(B)is here to ensure both that the labelsmacorrespond to a strategy for coalitionBand to check that such a strategy is memoryless and compatible: for allreachablestates labeled with some move, the formula verifies that all

3As in the previous case, the extension toh·A·i0 is straightforward.

(12)

the equivalent states are labeled by the same move. Note that this property ranges over reachable states:

we could have unreachable states that are labeled in an improper way but this is not a problem as such state will never be reached from the current position (and consequently they cannot impact on the truth value of the formula).

We will now consider the Kripke structureKC0 =hQ,R, `0iwhere`0 extends the function`K with propositions of typemaj. Given a compatible memoryless strategyfB={fb}b∈Bfor the coalitionB, we say that`0is an fB-labeling if for every nodeq∈Q, for anyb∈Band for any 1≤j≤r, we havembj∈`0(q) iff fb(q) =mj(we recall that a memoryless strategy fbcan be seen as a function fromQtoM).

Proposition 13. Letρ be a run ofC, n be a natural number, fB be a compatible strategy inStrat0(B) for coalition B. If`0is an fB-labeling, then we have:

C,ρ,n|=fB ϕs iff KC0,ρ,n|=ϕs

B

Proof.The proof is done by structural induction overΦ.

• Φ=ϕpp: AssumeC,ρ,n|=fBϕpp. Therefore there existsi≥ns.t.C,ρ,i|=fB ψpand for anyn≤ j<i, we haveC,ρ,j|=fBϕp. For every position betweennandi, the induction hypothesis can be applied and we deduceKC0,ρ,n|=ϕcp

Bcp

B. The converse is done similarly.

• Φ=h·a·iϕp:(1)⇒(2). AssumeC,ρ,n|=fB h·a·iϕp. There exists a memoryless compatible strategy fas.t. for anyρ0∈Out(ρ≤n,fa◦fB), we haveC,ρ0,n|=fafB ϕp. Thus we can find a labeling of propositionsma1, . . . ,mar for KC0 to represent fa. This labeling completes the existing one for strategy contextFBand the formulaΦstrat0({a} ∪B)is then satisfied onKC0,ρ,n. And anyKC0-run satisfyingΦ0out({a} ∪B)belongs to the set of outcomes generated by the strategy context fa◦fB, and then satisfiescϕp

{a}∪B

by induction hypothesis.

(2)⇒(1). Now assumeKC0,ρ,n|=ϕbB. Therefore there exists a labeling forma1, . . . ,mar such that Φstrat0({a})holds true. Such a labeling defines a memoryless and compatible strategy for the reachable states (fromρ,n). And finally every run satisfyingΦ0out({a} ∪B)has to satisfyϕcp

{a}∪B

. By induction hypothesis we get the stated result.

Thus we have:

Theorem 14. The model-checking problem ofATLsc,0isPSPACE-complete.

Proof. QCTL model checking can be solved in polynomial space for the structure semantics. This immediately gives aPSPACEalgorithm for our problem. Conversely,ATLsc,0 model checking (with perfect observation) isPSPACE-hard [BDLM09]. This extends immediately to the partial-observation setting.

Remark 15. Quantification over memoryless strategies could also be achieved using the tree semantics, following the presentation of Section 3. To do so, it suffices to label each state with its name (hence adding a few extra atomic propositions) and to require that the labeling of the execution tree with strategies satisfies that whenever some state s is labeled with some move mi, then all the occurrences of s must be labeled with the same mode mi.

While this has little interest for model checkingATLsc,0in terms of complexity, it shows that model checking remains decidable for the logic involving quantification over both memoryful, bounded-memory and memoryless strategies.

(13)

5 Satisfiability and partial observation

In this section, we consider satisfiability checking: given a formulaΦ, we look for a game structureC, an equivalence∼over states, and a control stateq0, such thatC,q0|=Φ. This problem is undecidable for ATLsc(andATLsc) in the classical setting of perfect-observation games.

First note that considering partial observation makes the problem different: there exists formulas that are satisfiable under partial observation, and not satisfiable for full observation. Consider formulaΦ= AX(h·a1·iXf)∧ ¬ h·a1·iX Xf. ClearlyΦis satisfiable when considering games with partial observation:

for example, one can consider the turn-based structure in Figure 3, wherea1plays in circle nodes anda2 plays in box nodes, and where∼is≡P: fromq0, there is no∼-strategy fora1ensuring the propertyX Xf (because from the strategy should play the same move afterq0q1andq0q2), but the subformula h·a1·iXf holds for true inq1and inq2.

q0 P

q1

P

q2

q3

f

q4 1

2

1

1 2 2

C

Fig. 3: The turn-based CGSC s.t.q0|=Φ

But formulaΦis not satisfiable in the classical case: assume that AX(h·a1·iXf)is satisfied in some stateq, then from everyq-successorq0, there is a strategy fq0 fora1to ensureXf. Therefore the strategy for a1 consisting, fromq, to choose an arbitrary move, and then to choose the strategy fq0 for every possible successorq0ensures the propertyX Xf.

From a decidability point of view, considering partial observation does not make satisfiability problems to be simpler: in fact this problem remains undecidable forATLsc. Furthermore, it is even worse than in the classical setting: while the turn-based satisfiability is decidable when perfect information is assumed, it is undecidable when one considers the partial observation case.

Theorem 16. Satisfiability problems forATLsc. ATLsc,ATLsc,0andATLsc,0(with partial observation) are undecidable even restricted to turn-based structures.

Proof.We can mostly reuse the proof of Troquard and Walther [TW12] (we have slighty modified in [LM15]). The key idea of their proof is to reduce S5n satisfiability to ATLsc satisfiability. Given an S5nformulaΦ, one can build anATLscformula h·/0·iÙΦsuch thatΦis satisfiable iff h·/0·iÙΦis satisfiable.

Without considering the details of the proof, one can just note thatÙΦuses Boolean operators and strategies quantifiers h·ai·i (for 1≤i≤n) and formulas of the formXP: there is no U modality and there is no nesting ofX. Therefore with such a formula, every strategies quantifier is interpreted in a unique statew and we only consider the moves done in this statew: adding an equivalence∼over states or considering memoryless strategies do not change the semantics ofÙΦ.

Assume thatΦis satisfiable. From the proof of [TW12], one can build a game structureC satisfying h·/0·iÙΦ. Moreover one can use the reduction of Theorem 6 to get a tun-based games with partial information.

Now assume thath·/0·iÙΦis satisfiable. Therefore there exists a game structureC with an equivalence∼ such thatC,q|= h·/0·iÙΦ. This structure is based on a set of agents{1, . . . ,k}withk≤n. And there is a strategyFforAgtsuch thatC,q|=FÙΦandÙΦmay only modify the choices for playersa1, . . . ,an. Ifk>n,

Références

Documents relatifs

What we want to do in this talk is firstly to explain when (§1) and why (§2) a presupposition trigger becomes obligatory in the very case where it is redundant, and secondly

We illustrate the important expressive power of strategy contexts by proving that they make the extended logics, namely ATL sc and ATL?. sc , equally expressive: any formula

We extend the framework of ATL model-checking to “simply timed” concurrent game structures, i.e., multi-agent structures where each transition carry an integral duration (or

If the single-indexed model checking problem for CPDSs were decidable, so would be the reachability problem. However, it has unfortunately been proven by Ramalingam in [16] that

Although solving a single instance of the satisfiability problem (SAT) is sufficient to decide on the existence of a counterexample for any arbitrary bound typically one starts

First, diagram satisfiability refers to the existence of a model, i.e., an interpretation that satisfies all constraints expressed by the diagram and where at least one class has

This work presents a simple and extensible translation approach from Amalthea to Promela models for analysis with the widely used Spin model checker.. This is a first step

While query answering with Description Logics (DLs) is now well-developed, this is much less the case for reasoning services that support ontology engineering and target query