• Aucun résultat trouvé

A Note on the Value of Zero-Sum Sequential Repeated Games with Incomplete Information By

N/A
N/A
Protected

Academic year: 2022

Partager "A Note on the Value of Zero-Sum Sequential Repeated Games with Incomplete Information By"

Copied!
7
0
0

Texte intégral

(1)

A N o t e o n t h e V a l u e o f Z e r o - S u m S e q u e n t i a l R e p e a t e d G a m e s w i t h I n c o m p l e t e I n f o r m a t i o n

By S. S o r i n , Paris I )

Abstract: We consider repeated two-person zero-sum games with lack of information on both sides.

If the one shot game is played sequentially, it is proved that the sequence v n is monotonic, v n being the value of the n shot game. Moreover the speed of convergence is bounded by K/n, and this is the best bound.

Introduction

The class of games considered in this note are those introduced and studied by A u - m a n n / M a s c h l e r [ 1966, 1967, 1968]. Later it was proved by M e r t e n s / Z a m i r [ 1971 ] that, if v n is the value of an n repeated zero-sum game with lack of information on both sides, lim v n exists and d n = t v n - - lira v n [ is bounded by K / ~ . Then Z a m i r [1972] has proved that it is the best bound. Now, if we consider the "independent" case, where moreover the moves are made sequentially, we can compute v n by using an explicit formula proved by P o n s s a r d [ 1975] for "games with almost perfect information".

Then we prove that the sequence v n is monotonic and that d n is bounded by K i n . We also give an example of a game with d n = 1 / 2 n .

1. The Game

Let ( A r s ) , r E {1 . . . . , R } , s E {1 . . . S}, be m • n matrices viewed as payoff matrices of two-person zero-sum games, with elements 4 f ' i E I = (1 . . . m}, j E J = {1 . . . n}. L e t P (resp. Q) be the simplex of R R (resp. RS).

For eachp C P , q ~ Q, n E N , G n (p, q ) is the n-times repeated game played as follows.

Stage 0. Chance chooses some r (resp. s) according to the probability distribution p (resp. q). Then player I is informed o f r and player II ofs. (All of the above description is common knowledge.)

Stage 1. Player I chooses il E I , player II is told which il was choosen and chooses j1 E J; then player I is informed of i t .

1 ) Dr. S. Sorin, Laboratoire d'Econom~trie, Universit~ Paris 6, 4 Place Jussieu, F-75230 Paris Cedex 05.

(2)

218 S. Sorin

Now this stage is repeated again and again. After the n-stage player I receives from player II the following amount:

1 ~ -Ih~S where (i h , ]h) are the strategies used at the h-th stage. We denote by n h=l ]h'

v n (p, q) the value of G n (p, q).

Le us define, for eachp EP, q EQ:

R S

A 09, q) = ~ ~ pr qS ArS and let u (p, q) be the value of the zero-sum sequential r=l s=l

game with matrix A (p, q). (The game where both players play non revealing strategies.) For each real function f on P • Q we denote by Cf the smallest real function g on P • Q such that:

g (-, q) is concave on P for each q E Q g(p, q)>~ f ( p , q) o n P • Q.

Similarly we define Vf = Vex f.

Q

We can now state the main result of Mertens/Zarnir [1971]:

lim v n (p, q) exists and is the only solution of the system:

/'/---~ o o

x (p, q) = V max ~u (p, q), x (p, q))

x (p, q) = C min {u (p, q), x (p, q)}.

(1)

In order to symplify the notations we shall denote by:

M the maximum over i, i E L m the minimum over/, ] E J,

R S

the expression ~ Z pr qS ~1" (In fact ~ is some ~ (i, ]) but no confusion r=l s=l

will result.) 2. The Results

Proposition 1: For each p EP, q E Q, the sequence v n (p, q) is increasing.

Proof: Using Theorem 1 of Ponssard [1975] we have:

nv n (p, q) = CMVm (Y, + (n -- 1) Vn. 1 (p, q)) for all n/> 1, where vo (p, q) = 0 on P • Q. Then

(2)

2 v2 (p, q) = CMVm (F., + vl (p, q))

>~ CM ( Vm "Z + v l (19, q ) )

(3)

since Vex (a + b)/> Vex (a) + Vex (b) and v n (t 7, q) is convex w.r.t, q, for all n.

Now we have

2 v2 (p, q) >1 C (MVm E + vl (p, q)).

Let us denote by gx (P, q) the function M V m ~, then vl (p, q) = Cgl (p, q).

But Gay (a + Gay (a)) = 2 Cav (a), so that 2v2 (p, q)~> 2Vl (p, q).

Let us assume now that

v n (p, q)>~Vn_ 1 (p, q) o n P • Q.

We have

(n + 1)Vn+ 1 (p, q) = c m v (m ~ + n v n (t 9, q))

>I CMV (m ~ + (n -- 1) Vn. 1 (p, q) + v n (p, q)).

But the right expression is greater than

CM ( V (m Z + (n -- 1) V n. 1 (l 3, q)) + v, (p, q)).

Denoting by g n (p, q) the function M V (m Z + (n -- 1) V n. 1 (p, q)), we obtain (n + 1)Vn+ 1 (p, q) >~C(g n (l 9, q) + n C g n (p, q)) 1

and the right side is n + l

n C g n ( P ' q ) = ( n + l ) v n ( P ' q ) " "

If we denote lim v n (p, q) by v (t9, q) we get obviously.

Corollary 1:

v n (p, q) <<. v (p, q) on P • Q for all n.

Proposition 2 (The Error Term):

I v (lO, q) -- v n (p, q) I <<- Kin on P • Q, for some K E R, and this is the best bound.

(4)

220 S. Sorin

Proof:

We still suppose that player I is the maximizer. We shall write, for i E I,

R S

f i ( p , q ) = - - m ( X

Z p r q S ~ ] ) .

r=l s=l

Note that the 1~ are convex and piecewise linear in both variables.

Let f ( p , q) = Z f/(p, q) - - L where L E R is choosen such that m

i= 1

vl (p, q)/> v (p, q) + f ( p , q) o n P X Q. (vl and v are bounded o n P X Q.) Let us suppose now that:

n v. (p, q) i> n v (p, q) + f (p, q).

Using (2) we have

(n + 1)

Vn+ 1 (p, q) >1 CMVm (Y, + f (p, q) + n v (p, q))

>-CM (V (m Y, + f(p, q)) + n v (p, q))

since v is convex w.r.t.q.

But, by constructions, m 2~ + f ( p , q) is convex w.r.t, q, for each i E I , so that (n + 1)

Vn+ 1 (p, q) >t C (u (p, q) + f ( p , q) + n v (p, q)).

Now Gay (a + b) ~< Cav (a) + Cav (b) so by letting a = u (p, q) + f ( p , q) + n v (p, q) and b = - - f ( p , q), the right member is greater than

C (u (p, q) + n v (p, q)) - C ( - f ( p , q)).

Now - f i s concave w.r.t, p so we obtain

( n + 1)Vn+ 1

( p , q ) > ~ C ( u ( p , q ) + n v ( p , q ) ) + f ( p , q )

/> C ((n + 1) min (u (p, q), v (p, q))) + f ( p , q).

Using (1), the fact that f i s bounded o n P X Q and

v n <<, v

for all n (Cor. 1), we arrive

at the proof. 9

Example:

The following example shows that it is the best bound. Assume that R = 1 and S = 2 (there is lack of information on one side but player I is uninformed). The payoff matrices are given by:

(5)

A l l =

t: 1)tl 2)

h 12 _~_

- 2 0 1

The functions u (q), v n (q), v (q), are given in the diagrams below.

We note that v (1/2) -- v n (1/2) = 1/2n.

Remarks: If there is lack of information on one side, the informed player maximizing and moving first, we obviously have vl (p) ~> Cav ru(p) = v (t7) so that Prop. 1 and Cor. 1 imply v n (p) = v (p) for all n, a result which was already proved by Ponssard/

Zamir [1973].

In the general case of game with lack of information on one side, the sequence v n is monotonic, but decreases if the informed player is the maximizer, as already men- tioned by Aumann/Maschler [ 1968]. This can be seen immediatly if one writes the recursion formula [Zamir] in the following manner:

(n + 1) Vn+ 1 (p) = Max {Min Z p k s k A k t + n Z si V n (Pi))

s t k i

where s = (s I . . . s k . . . . , s r ) , S k is a probability vector over I for all k, t is a probability vector over J , ~ = ~k ~ p k and Pi is the conditional probability over K given i.

Assuming v n (t9) <<, Vn. 1 (p) we have

(n + 1) Vn+ 1 (p) < Max {Min Z p k slc A k t + (n -- 1) )2 si Vn-1 (Pi) +

s t k i

+ Z. Vn (pi)

l

and since v n is concave it follows that

(n + 1) Vn+ 1 (19) ~< Max (Min Z p k s k A k t + (n -- 1) Z g/Vn.l(Pi)) + v n (p).

s t k i

Hence

(n + 1) Vn+ 1 (19) <~ (n -t- 1) v n (t9).

Acknowledgements

This note greatly benefited from extensive discussions with J.P. Ponssard in the framework of ioint research on games with incomplete information.

(6)

I 0 I 0

I 0

o

(7)

References

Aumann, R.J., andM. Masehler: Game Theoretic Aspects of Graduate Disarmament. Report to the U.S.A.C.D.A. Final report on contract ACDA/ST-80, prepared by Mathematica, Princeton, N.J., June 1966, Chapter V, 1-55.

- : Repeated Games with Incomplete Information. A Survey of Recent Results. Report to the U.S.

A.C.D.A. Final report on contract ACDA/ST-116, prepared by Mathematica, Princeton, N.J., September 1967, Chapter III, 287-403.

- : Repeated Games with Incomplete Information. The Zero-Sum Extensive Case. Report to the U.S.A.C.D.A. Final report on contract ACDA/ST-143, prepared by Mathematica, Princeton, N.J., November 1968, Chapter II, 25-108.

Mertens, J.F., and S. Zamir: The Value of Two-Person Zero-Sum Repeated Games with Lack of In- formation on Both Sides. International Journal of Game Theory, 1, 1971, 39-64.

Ponssard, J.P. : Zero-Sum Games with "Almost" Perfect Information. Management Science 21 (7), 1975, 794-805.

Ponssard, J.P., and S. Zarnir: Zero-Sum Sequential Games with Incomplete Information. Interna- tional Journal of Game Theory 2, 1973, 99-107.

Zamir, S.: On the Relation Between Finitely and Infinitely Repeated Games with Incomplete In- formation. International Journal of Game Theory 1, 1992, 179-198.

Received December 1978

Références

Documents relatifs

Personalized diagnostic technologies will aid in the stratifica- tion of patients with specific molecular alterations for a clin- ical trial.. It has been shown in the past

Abstract: Consider a two-person zero-sum stochastic game with Borel state space S, compact metric action sets A, B and law of motion q such that the integral under q of every

Starting with Rosenberg and Sorin (2001) [15] several existence results for the asymptotic value have been obtained, based on the Shapley operator: continuous moves absorbing

• For other classes of stochastic games, where the existence of a limit value is still to establish : stochastic games with a finite number of states and compact sets of

observations, Geophys. Reversing causality, we can use concentration measurements to improve our knowledge about surface fluxes by adopting a probabilistic approach, usually

To address this issue, a small-scale tracking test was developed and applied to twenty-one sealants held at temperatures between 46°C and 82°C, typical temperature maxima in

The phenomenon of violence against children is one of the most common problems facing the developed and underdeveloped societies. Countries are working hard to

Extensions of these results to games with lack of information on both sides were achieved by Mertens and Zamir (1971).. Introduction: Shapley Extensions of the Shapley operator