A NNALES DE L ’I. H. P., SECTION B
B ERNARD D E M EYER
From repeated games to brownian games
Annales de l’I. H. P., section B, tome 35, n
o1 (1999), p. 1-48
<http://www.numdam.org/item?id=AIHPB_1999__35_1_1_0>
© Gauthier-Villars, 1999, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
From repeated games to Brownian games
1
Bernard DE MEYER
(1)
Laboratoire de Probability, Universite Pierre et Marie Curie, Tour 56, 4, place Jussieu, 75252 Paris cedex 05, France
E-mail: [email protected]
Vol. 35, n° 1, 1999, p. 48 Probabilités et Statistiques
ABSTRACT. - The
subject
of this paper is related to theanalysis
of theconvergence rate of the value of the n-times
repeated
zero-sum game withone sided information and full
monitoring. Particularly,
the ultimate aim of this work is theproof
of the existence of anasymptotic expansion
forthis value vn : .
As
suggested
in the conclusion of[6],
thefunction ~ appearing
in thisexpansion
should beregarded
as the value of a"continuously repeated"
game.
In this paper, we propose and
analyze
a game of this kind. In this game, thestrategies
areprogressively
measurable processes on the filtrationgenerated by
a Brownian motion and thepayoff
function is definedby
useof the
Ito-integral.
Our main result is theproof
of the existence ofoptimal strategies
for bothplayers
in this game. @Elsevier,
ParisRESUME. -
L’ objet
de cet article est lie al’analyse
de la vitesse de convergence de la valeur vn desjeux
a somme nulle n foisrepetes
eta information
incomplete
d’un cote. Plusprecisement,
le but ultime de(1 )
This paper is a revised version of the third chapter of my Ph. D. thesis [3]. I am indebtedto Professor J.-F. Mertens, my adviser, for many fruitful comments and discussions. My sincere
thanks go to him. I also want to thank an anonymous referee for his very carefull reading and
his many suggestions on the presentation of the paper.
AMS classifications : 90020; 93E05; 35J60.
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques - 0246-0203
Vol. 35/99/0l/@ Elsevier, Paris
ce travail est la preuve de l’existence d’ un
developpement asymptotique
pour vn :
Comme le
suggerait
la conclusion de l’ article[6],
lafonction 1b qui apparait
dans ce
developpement
devrait etre consideree comme la valeur d’un"jeu repete
entemps
continu".Dans cet
article,
nous definissons etanalysons
un teljeu.
Lesstrategies
des
joueurs
y sont des processusprogressivement
mesurables sur unefiltration
generee
par un mouvement Brownien et la fonction depayement
fait intervenirFintegrale
de Ito de ces processus. (c)Elsevier,
Paris1. INTRODUCTION
The notion of Brownian games that will be introduced in this paper is
intimately
related to theanalysis
of theasymptotic
behavior of the valuevn (p)
of the n timesrepeated
zero sum game with one sided information andmore
specifically
with the existence of thefollowing asymptotic expansion
for v~
(p) :
As it will be recalled in the next
section,
twotypes
ofarguments
werepreviously
invoked to prove the existence of such anexpansion:
on onehand,
the recurrence formula for vnyields
in the limit a Partial DifferentialEquation (PDE) for 03C8
and the main result of[5]
states that if a smoothfunction
f
withappropriate boundary
conditions satisfies to thisPDE,
then( 1 )
holdswith ~
=f.
This PDE is ingeneral strongly
non linear and the existence results of the PDE literature do notapply
to thisequation.
Inparticular
caseshowever,
this PDE can be solvedexplicitly
and its solution is then related to the normaldensity.
On the other
hand,
for theseparticular
cases, theexpansion ( 1 )
isproved
in
[6]
as a consequence of the central limittheorem, giving
in this way a rational to the appearance of this normaldensity.
Annales de l ’lnstitut Henri Poincaré - Probabilités et Statistiques
The
approach
initiated in this paper is anattempt
to prove the existence of solutions the PDEproblem
in thegeneral
non linear caseby
aprobabilistic
argument.
Morespecifically,
we will introduce in thenext
section the notion of Brownian games as a kind of limit in continuous time of the n timesrepeated
games.These Brownian games are stochastic differential games where two
completely antagonistic players
control themartingale
term of a diffusionprocess. Part of the interest of this paper comes from this
particular
feature ofthe Brownian games: "stochastic" differential games are
usually
consideredas "classical" differential games with an additional uncontrolled random
perturbation.
The next
part
of this paper ismainly
devoted to the definition of the Brownian games and to theunderlying
intuition. We alsopresent
there the main resultsconcerning
these games that will beproved
in theremaining parts.
In
part 3,
weessentially
prove that the Brownian games have a value’ljJ,
and bothplayers
haveoptimal strategies.
The recursive structure of the Brownian games and the Markovianproperties
of theseoptimal strategies
are
analyzed
inpart
4.In the last section of the paper, we prove that the
function ~
is a kindof
generalized
solution of the above PDE:if 03C8
was a smoothfunction,
it would
satisfy
a Bellmantype equation
that coincides with the above mentioned PDE.The
regularity
of thefunction ~
will beproved
in aforthcoming
paper[7]
under a strict
ellipticity
condition and thevalidity
of( 1 )
will then follow.2. DEFINITION OF THE BROWNIAN GAMES AND MAIN RESULTS
2.1. On the
history
of theproblem.
Therepeated
zero-sum game withone sided information and full
monitoring
was firstanalyzed
in the nowclassical paper
by
Aumann and Maschler in 1966(see [ 1 ]). They
deal therewith the
following
n-timesrepeated
gameDEFINITION 2.1.1. - For a
finite
set will denote in thefollowing
the
simplex of probability
distributions on B.Let J( be the set
of the
Kpossible
statesof nature.
To each stateof nature
k
corresponds
anelementary
one shot zero sum game. The action setsfor player
1 andplayer
2 in this game arerespectively
denotedby
I and,7.
1-1999.
They
containsrespectively
I and J elements. Apair (i, j ) of actions
in I X~’
yields
apayoff a ~ for player
1 in state k.For p E
0 (JC),
the gameT n (p) proceeds
asfollows:
Nature choosesinitially
and oncefor
all a state k at random with thelottery
p.Player
1is
informed by
nature about the true statek,
whileplayer
2 is not. At eachsuccessive
stage
qofr n (p) (q
=1,
...,n),
theplayers choose, independently of
eachother,
an actioniq
andjq
in theirrespective
action set.They
arethen
informed
about the choiceof
theiropponent.
At the endof
the gameplayer
2 pays theamount n 03A3nq=1 akijq
toplayer
1. We assume that bothplayers
are awareof
the abovedescription of
the game. Inparticular, they
know the
payoff matrices,
as well as the initialprobability
distribution p.Let us remark here that
only players’
actions are communicated at eachstage,
but not thecorresponding payoffs.
As a consequence,player
2ignores
the
elementary
game he isactually playing
up to the end ofr n (p).
The value
vn(P)
ofrn (p)
is known(see
e.g.[ 12] chapter V)
to bea concave function of p which
decreases,
as nincreases,
to a limitvex:;(p)
=cav(u) (p),
whereu(p)
denotes the value of the average one shot gameG(p)
withpayoffs E~=1 cav(u)
denotes theconcavification of this function on the
simplex 0 ( JC) .
Theproof
of thisconvergence
given
in[ 1 ]
is based on a bound on the variation of a boundedmartingale
and leads to the result :=Vn (p) - Vex:; (p) ~
for apositive
constant cdepending
on thepayoffs
In
[13],
:== isproved
to converge to a limit relatedto the normal
density
for aparticular two-states-of-nature-game, leading
tothe
asymptotic expansion ( 1 )
for vn .The
reasoning
of Mertens and Zamir isgeneralized
in[5]
to a broader class of games calledDEFINITION 2.1.2. - The games in are square games
(i. e.
I =J)
such that the
completely
mixed(i.e.
Vi : >~) strategy
~o is theunique optimal strategy for player
1 inG(p), Vp,
and such that the valueu(p) of
G(p)
isidentically
0.The next lemma is
proved
in[5],
Section 4:LEMMA 2.1.3. - For games in
0°0, player
2 has aunique optimal strategy
p(p)
inG(p).
Thisstrategy
iscompletely
mixed(i.e.
>0, Vj)
andthe
mapping
p :0(lC) -~ belongs
toC°°, since, Vj, Pj(p)
may beexpressed
as aquotient of
twopolynomials
in p.Annales de l’lnstitut Henri Poincaré - Probabilités et Statistiques
Furthermore,
theoptimal strategies
inG(p)
areequalizing.
With thevector notations aij :=
(a~
..., and aiT := Tjaij, this means:where
(pia)
stands for the Euclideanproduct
inIRK : (pia)
The
importance
of the class0°o
is stressed in another paper of Mertens and Zamir[14]:
in case I = J = K =2, they
prove a convergence faster than2014
outside0°0 .
As it follows from lemma 4.3 in
[12], chapter V,
there is a recurrence relation between the functions andvn(.).
Since = forgames in
0°0 ,
this recurrence formula for vn becomes one for~n
that canbe written in an abstract way: where
Hn
is a functionaloperator. So, heuristically,
if the were to converge, theirlimit ~
shouldbe
"nearly
a fixedpoint
ofHn,"
as n tends to oo.By giving
themeaning
=
O {n-3~2 )
to theexpression " f
isnearly
a fixedpoint
of
Hn",
the converse of this claim isproved
in[5].
Moreprecisely:
iff
is such a fixedpoint,
bounded on thesimplex A(/C)
andvanishing
atits extreme
points,
then~~
convergesuniformly
tof
with a rateof,
atleast,
Whenreplacing f by
itsTaylor expansion
around p inHn ( f ) (p),
the "fixedpoint" property
off
becomes a Partial DifferentialEquation (PDE). Conversely,
anysufficiently
smooth solution of thisPDE,
withappropriate boundary
conditions will be a "fixedpoint"
forHn
andwill therefore be the limit of the as
proved
in[5],
theorem 5.4.This PDE is
strongly
non linear: it involves the inverse of the Hessian matrix off.
It is however made moreappealing by introducing
Fenchelduality:
DEFINITION 2.1.4. - The Fenchel
conjugate of
afunction f :
D CIRK ~ IR is defined
as thefunction fO : IRK
~ 1R that maps x to-.=
We are then led to consider a dual game
r~
ofrn,
whose valuevn
isthe Fenchel
conjugate
of the concave function vn .DEFINITION 2.1.5. - For x in
IRK, rn (x) is
thefollowing
game:Player
1chooses k E
J’C,
withoutinforming player
2.Next,
as inT n,
theplayers
haveto select
successively
their actions(iq, jq).
Thefinal payoff player
1 has toVol. 35, n° 1-1999.
pay to
player
2 is now~~ - n ~~,-~ (Player
1 is here theminimizer
and ~~ has to be
interpreted
as apenalty for having
chosen the statek. )
If the
factor ~
isreplaced
with2014=
in thepayoff
of this dual game, weget
an other game whose value~
is Inparticular,
for games in0°~ , ~n
is the Fenchelconjugate
ofThe interest for
introducing
a dual game comes from thefollowing
observation: In the
primal
gamern,
as underlined in[19], player
1 hasoptimal strategies
that are Markovian in thefollowing
sense: at eachstage,
thesestrategies only depend
on the aposteriori probability
on ICgiven
the
past
moves. The process of these aposteriori
distributions is thusMarkovian,
since itonly depends
onplayer
1’s mixed moves.In
counterpart
the dual gamer~
isparticularly
well suited toanalyze
the"Markovian" behavior of
player
2 asemphasized
in[6]
section 4.When restricted to concave upper semi continuous
functions,
the Fenchel transform is anisometry
in the uniform norm. The rate of convergence of vn may then beanalyzed through
that ofvn
and the convergence ofthrough
that of
~.
The recursive formula for can be transformed in thefollowing
recursive formula
for ’lj; (see
Lemma 5.1 in[5]): ~n+1
= wherethe dual recurrence
operator Hn
is defined asAs in the
primal model, proving
the convergence of~
turns out to beequivalent
tofinding
a functionsatisfying ~03C8* - H*n(03C8*)~~ =
O( n -3/2).
Thisfinally
leads to thefollowing
dual PDE forwhere is considered as a column vector and
~*"(x)
stands forthe Hessian matrix
of ’ljJ*.
Thisequation
isquasi-linear elliptic (see [10],
p.
257)
and is easier to handle than theprimal
one. Thisequation
is solvedin
[5],
section 6 for a sub-class of0°0 . Surprisingly,
the solutionthen obtained is related to the normal
density
ashappened
to be the casein Zamir’s game
(see [13]).
The appearance of the normal
density
in a subclass ofRao
isexplained
in[6]
as follows:1 )
We first prove(see
theorem 3.1 in[6]) that,
both in andT n (p),
the
knowledge
ofpast
actions( j 1, ... , jq-l)
ofplayer
2 is irrelevant for bothplayers
whenplaying stage
q.Annales de l ’lnstitut Henri Poincaré - Probabilités et Statistiques
This allows the
players
to consideronly
their "reduced"strategies (see
section 3 in
[6]).
Forplayer 2,
a behavioral reducedstrategy
is an n-uple (1(2022),...,n(2022))
whereTq(.)
is amapping
fromIq-1
toa (,7 ) .
Amixed reduced
strategy ~
forplayer 1
in the normal form gamern (x)
is a
probability
distribution over In xJ’C,
while inr n (p)
themarginal
distribution of E on 1C must coincide with p.
Using
thefollowing
notationsthe
payoff
ofplayer
2 inrn (x)
is then written as(see equation (4)
in[6]):
Similarly,
thepayoff
ofplayer
1 in isgiven by:
In
0°0 ,
we have = so, relation(5)
leads to:where,
as in[5],
denotesmin{x1,
...,xK}.
In thisformula,
7r is themarginal
distribution of ~ on In and the minimization over the conditionalprobabilities
on 1C leads to the function~y* .
The min and maxoperators
commute in this formula.
2)
For games in theoptimal strategy
7r* forplayer
1 in formula(7)
written as a min max isproved (see
theorem 5.2. in[6])
to be the i.i.d.ao-distribution: 7r* (i1, ... , in)
:== This facttogether
with theparticular shape
of the vectors aij inRuo
leads to:where the
Aq’s
are i.i.d. random vectors under 7r*. The Central Limit Theoremjustifies
then the appearance of the normal distribution.35, n°
2.2. An heuristic
approach
to the Brownian games. To introduceheuristically
thelimiting
gameF*,
let us first rewrite formula(7)
in away that
emphasizes
the role of x*: aobeing completely mixed,
anyprobability
distribution 7r on In has adensity y
withrespect
to 7r*. So:The central limit theorem indicates
that, asymptotically,
the distribution of the sum of nindependent
F-distributed random variablesonly depends
on the
expectation
and variance of F.This
suggests
thatreplacing
the sum inequation (8) by
a sum of randomvectors with the same conditional
expectation
and covariance matrix shouldnot
crucially
affect the final result.Since
Tq ( ~ )
is a function of( i 1, .. , , i q _ 1 ) ,
weget
withequation (2):
Furthermore,
the covariance matrix of conditional on(iI,
...,iq _ 1 ),
under the
probability
x* isaTiq (vectors
are considered in this paper as column matrices andaT
denotes thetranspose
ofa).
Let us then
replace
the random termaiqTq by zqiaiq o,
wherezq are
independent
I-dimensional standard normal random vectors.Finally, z g
may be seen as the increment of the 7- dimensional Brownian motion W on a filtration E[0,1].
In this way,we
get heuristically:
In
equation (8),
Tqonly depends
on thepast history (i1, ... , zq_1).
Thisfeature will be taken into account in formula
(9) by assuming
it to be0- -measurable. Similarly,
y, thatdepends
on(iI,
...,in)
in(8),
will be.~’1-measurable
in(9).
Letting
now T denote the{~-progressively
measurablestep
processT(W, t)
:= ifg n 1 t ~
relation(9)
becomes:Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
The
limiting
gameF*(x)
isfinally
obtainedby dispensing
with thecondition on T to be a
step
process and thestrategy
space forplayer
2is chosen to be the set of
progressively
measurable processes T valued in0 (,7) .
Before further transformations of formula
(10),
let us observe here that it formalizespretty
well theoptimization problem player
2 is confronted:The infimum
appearing
in this formula is the essential infimum of the random variableV" (x - f01
soplayer
2 has to controlthe
martingale x - f; dWi,t
in such a way that it will remain up to time 1 in the orthantRa := {~
EIRK : ~*(~) ~ a ~
for thehighest possible
a.We can still
modify
formula(10)
in order to obtain apayoff
functionsimilar to
(5),
which is linear in bothstrategies:
Since-y* (x) _
we have:
where P denotes the set of
~-measurable A(/C)-valued
random vectors.Observing
that yp is valued inIRK+
and thatE[yp]
E0(1C)
sinceE[y]
=1,
we
finally
have:In
analogy
with formula(6),
we also should have:Let us stress here that the above link between the
finitely repeated
games and the Brownian games ispurely
heuristic and thesymbol ~
in the aboveformulas has
only
a formalmeaning:
In formula(9),
sinceW2, ~ -
is
normally distributed,
the random variable to theright
of y in(9) ranges
all over JR.
Therefore,
if no further condition isimposed
on y, the minappearing
in this formulaequals -oo !
2.3. Definitions and main results. Let T be in
[0,1],
let be afiltration on a
probability
space(Z, 0, P) .
Vol. 35, n° 1-1999.
DEFINITION 2.3.1. - For a E
(1, oo),
willrefer
hereafter
to the setof
IRn-valued(0t)-progressively
measurable processes csuch that :=
E~(, f~,
oo, where standsfor
theEuclidean norm
of
Ct in IRn.If
c and c’ are suchthat ~c-c’~M03B12
=0,
wesay that
c’
is amodification of
c and we write c ~ c’.M2 (
denotes the
quotient of M03B12 ( by
theequivalence
relation rv.These notations are shorten to
M03B12
andM03B12
when thefiltration
as wellas the
target
space IRn areclearly fixed by
the context.A
strategy
T forplayer
2 in formulas( 11 )
and( 12)
is valued inand is therefore in Va > 1. The
Burkholder-Davis-Gundy inequality (see
e.g. theorem 4.1 inChapter
IV in[ 15])
indicates that theIt6-integrals appearing
in these formulasbelong
to and thecorresponding expectations
are thus finite for all Y in as it results from Holder’sinequality.
We are then led to the
following
definitions of the Brownian games where theorigin
T of the time interval[T, 1]
is taken as aparameter
toanalyze
later their recursive structure:DEFINITION 2.3.2. - Let T be in
~0, l~,
let be afiltration
on aprobability
space(Z, P),
and let W be an I-dimensionalFt-Brownian
motion.
For a process T in
M2 (
weadopt the following
notation:For p E
0 ( 1C ),
theprimal
Brownian gameY ( p, T)
isdefined by
thefollowing
elements:The
strategy
spacefor player
1 is the setyp of IRK+-valued
randomvectors Y in with
E[Y]
= p.The
strategy
space Tfor player
2 is the setof 0394(J)-valued Ft- progressively
measurable processes T.The maximizer in the
primal
gameY(p, T)
isplayer
1 and hispayoff
is
given by
DEFINITION 2.3.3. - For x E and T E set
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
The dual Brownian game
r*(x, T)
is thendefined
asfollows:
Player
l ’s strategy space isy
:=(i. e.
the setof IRK+
-valuedrandom vectors Y in with
E[Y]
E0(lC)).
Player
2’s strategy space is T as in theprimal
game.Player
2 is here the maximizer and hispayoff T)
isgiven by
To
simplify
thepresentation
of the main results of this paper, let us remind here some fundamental definitions of gametheory:
We say that astrategy
Y ofplayer
1guarantees
apayoff a
inr(p,T)
if VT E T :gT(Y, T)
> a.The least upper bound of the
payoffs player
1 canguarantee
isclearly
This
quantity
is referred to assup inf gT .
ForE >
0,
anE-optimal strategy
ofplayer
1 is astrategy
thatguarantees
sup
inf gT -
E. It follows from the definition of supinf gT
that therealways
exist
E-optimal strategies
for E > 0. The0-optimal strategies,
if any, are calledoptimal strategies.
When there exists such anoptimal strategy,
thesup inf of the game is in fact a max inf.
Similar definitions hold for
player
2 who wants to minimize thepayoff.
The lowestpayoff
he ever couldguarantee
isinf sup
gT :=infTET supY~yp gT (Y, T).
Theinf sup
of a game isalways greater
thanthe sup inf. In case both
quantities
areequal,
the game is said to have avalue v
equal
to v := inf sup gT = sup inf gT.These definitions also translate in the framework of the dual Brownian games, but the maximizer is here
player
2.The main results of this paper are
presented
in the next five theorems.The first one deals with the existence of a value for the Brownian games
as well as of
optimal strategies
for bothplayers.
THEOREM 2.3.4. - For all p E
4i($i)
and x EIRK,
VT E[0,1].
thegames
r(p, T)
andT * (x, T)
have a value that will berespectively
denoted’ljJ(p, T)
and Moreover bothplayers
haveoptimal strategies.
Inother words:
These values are
independent of
thefiltration
and on theBrownian motion W on which the games are
defined.
The two first
equalities
in the next theorem indicate thatprimal
and dualgames are indeed dual models and their values are Fenchel
conjugates
ofeach other.
Relations
(21)
and(22) emphasize
the fact that the Brownian gamestarting
at time T E[0, 1]
is in fact arescaling
of the Brownian gamestarting
at 0:THEOREM 2.3.5. - The
function T) (respectively ’lj;* (., T))
is concaveand continuous on
0(7C) (resp.
on Bothfunctions
are linkedby
theduality relationships:
They fulfill further
thefollowing relations,
bT E~0,1 ) ,
Vx EIRK, bp
Eð.(1C):
The next result claims that
player
1 hasoptimal strategies
Y in theBrownian games that are
uniformly
bounded in La andcompletely
mixedi.e.
P[Y
=0]
= 0.THEOREM 2.3.6. - There exist
positive
numbers a > >0, C,
""depending only
on thepayoffs
suchthat, for
all p in0(~’C) (respectively
dx E
IRK), player
1 has anoptimal
strategy Y inT (p, T) (resp.
inr*
(x, T )) fulfilling:
where u :==
( l, ... ,1 )
EIRK.
Inparticular: P[Y
=0]
= 0.The next two theorems are devoted to the recursive structure of the dual Brownian games and its consequences: More
specifically,
astrategy
T ofplayer
2 inr* (x, 0)
can be viewed as apair (T~, T’ )
whereT
andT’
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
denote the restriction of T to the
respective
times intervals[0, T)
and[T, 1].
If X denotes then = and the
payoff
T)
in can be written aswhere YT is defined as The conditional
expectation
in theabove
equation
is in fact thepayoff
inr* (X~,o,T (T~ ), T ) resulting
fromthe
strategies
andT’ .
Thissuggests
that theplayers
will manage to chose forT’
andYjYT
anoptimal strategy
inTwo conclusions can be derived from this kind of
argument:
on onehand, assuming
that all theoptimal strategies
T* inr* (x, T )
start in thesame way, i.e. =
R(x, T)
for anappropriate
functionR, then,
if T isoptimal
inr* (x, 0), T’
isoptimal
in r*(X~,o,T (T~ ), T )
and thus TT =T$
should be
equal
toT ) .
Thisbeing
true for all timesT,
theprocess should
satisfy
a diffusionequation.
This is the content of theorem2.3.8,
where the function R isspecified.
On the other
hand, if,
after timeT,
theplayers play
anoptimal strategy
inr* (Xx,o,T (T~ ), T ), they guarantee
apayoff
r*(x, 0) and
we may thenconjecture
formula(25). Moreover,
sinceoptimal strategies
inT* (x, 0)
should also beoptimal
in(25),
we conclude withtheorem 2.3.6 that
player
1 has acompletely
mixedoptimal strategy
in the second line of(25),
from which(26)
can be derived.THEOREM 2.3.7.
Furthermore, if
T* isoptimal
inr* (x, 0),
thenfor
all t E[0,1],
Let
~~* (x, T)
denote the supergradient
at x of the concave function(., T).
Theduality
relation(19) implies
inparticular: 0 # (x, T)
C0(1C).
We can then introduce thecorrespondence
R that maps(x,T)
EIRK
x[0,1)
toR(x,T):={03C1(p)|p
E~03C8*(x,T)},
where p was introducedin lemma 1.2.3.
THEOREM 2.3.8. - The
correspondence
R issingle
valued and cantherefore
viewed as
a function. This function
mapscontinuously IRK
x[0, 1)
onIf T
isoptimal for Player
2 inr* (x, 0),
then the process T and the processr,
defined
as rt :==t)
aremodifications of
each others. As aconsequence, the process X
defined
asXt
:- is a solutionof
thestochastic
differential equation:
3. EXISTENCE RESULTS FOR UNBOUNDED BROWNIAN GAMES
3.1. The behavioral form of the Brownian games. In this
section,
weprove that Brownian games are, as stated in theorem
2.3.4, independent
ofthe filtration and the Brownian motion on which
they
are defined.This result will allow us to consider the games in their behavioral form.
Let W be a I-dimensional
Ft-Brownian
motion on[T, I],
let{3j
denotesthe process
03A3I03C30,i03B1ijWi and
let9t
be Ey;T
st}.
Wewill then refer to the
9t-adapted strategies
ofplayer
2 and the91 -measurable strategies
ofplayer
1 as the~-strategies.
The set of thesestrategies
will berespectively
denoted and9Y.
We have then the theorem:THEOREM 3.1.1.. - In the Brownian games
T (p, T)
andr* (~, T) defined
on
Ft
andW,
there is no lossfor
theplayers
to restrict themselves to their~-strategies.
In otherwords,
bothplayers
can guarantee the samepayoff
in the Brownian games and in the
corresponding ~-games
wherethey
arerestricted to their
G-strategies. Furthermore,
the~-optimal strategies
in theG-games
are still~-optimal
in thecorresponding
Brownian games.Proof. -
We will prove the result forr(p, T),
the sameargument
holds forr*(x, T).
Let x and
~f
denoterespectively
theimage
of T and97 by
themapping XT,i
introduced in(13).
Obviously,
theoperator E {~ ~ ~1 ~
maps3~p
ontoSimilarly,
if T ET,
is the value at time 1 of theGt- optional projection
of the processXT,t (T)
and can,according
toproposition
7 in
[7],
be written as(T’)
where T’ is theGt-predictable projection
of T.
7-~
is thus one version ofand,
assuch, belongs
a. s. toA(J).
The process T’ is thus a
9-strategy
ofplayer 2,
andE{~~~1~
maps therefore JYonto 9X.
The result follows then from these onto
properties
of themapping E {~ ~ ~1 ~ .
Indeed,
thepayoff guaranteed by
astrategy
Y ofplayer
1 is then alsoAnnales de l’Instatut Henri Poincaré - Probabilités et Statistiques
guaranteed by
its9-strategy
sinceThis indicates that
Moreover,
if Y Eguarantees p -
Eagainst any X
E Y still doesso
against
all X EX,
since =Similarly,
let T be astrategy
ofplayer
2 and let T’ be the~-predictable projection
of T, thenTherefore,
and if a
~-strategy
T ofplayer
2guarantees
him apayoff
lessthan ~
+ Eagainst
the~-strategies
ofplayer 1,
it does soagainst
allstrategies
Y ofplayer 1,
since]
= DIt follows from the last theorem that the
players
may restrict themselves to theirSt-measurable strategies,
T st ~ .
Sincethe
St-Brownian
motion W isisomorphic
to the Wiener Brownian motion defined on the Wienerprobability
space, we conclude that:COROLLARY 3.1.2. - The
payoff a player
can guarantee in a Brownian game does notdepend
on theprobability
space(Z, P),
nor on thefiltration
neither on the
Ft-Brownian
motion on which the game isdefined.
The
~-strategies
Y ofplayer
1 inr(p, T)
are random vectors inwith
E [Y]
= p. Since the Brownian motion has thepredictable representation property
on the filtration(see
e.g. theorem 3.5 inchapter
V of
[15]),
the random variable Y may be written as theIto-integral
Y =
p +
wheref~
is a(K
x~-dimensional
processin If we define the K-dimensional
process b2
asit follows from the definition of
{3j
that Y = p +;
It results from lemma 2.1.3 that = O. It is then convenient to introduce the
following
notations:DEFINITION 3.1.3. - From now on,
A
will denote the linear spaceof
b =
(bl , ... , bI)
in such that03A3i~I03C30,ibi
= 0. Forb,
b’ EA
we set
«b|b’»
:== andfor
T EIRJ,
wedefine a(T)
asa(T) := (alT, ... , a2T).
SinceVi,
>0, ((.1.))
is a scalarproduct
onA and,
due toequation (2), a(.)
is a linearmapping from IRJ
toA.
With these
notations,
thepayoff gT (Y, T)
becomesWe conclude then that the game
F(p, T)
isequivalent
to its behavioral form:DEFINITION 3.1.4. -
For q
E1R~
and aA-valued 0t-measurable
processb,
we setA behavioral
strategy for player
1 inr(p, T)
is aA-valued Ft-measurable
process b in such that
YT,1(p, b) belongs
a.s. toIR+.
Player
l ’spayoff 9T(b, T)
isgiven by
Similarly,
a behavioralstrategy for player
1 inr* (x, T)
is apair (p, b),
where p E0(J’C)
and b is a behavioralstrategy
inI‘(p, T).
Thebehavioral
form of player
2’spayoff
in T *(x, T)
is:( (p, b), T) - (pix) -
’ ,
3.2. The unbounded Brownian Games.
Player
2’sstrategy
space in the Brownian games isclosed, bounded,
and convex inM2 .
It is thereforecompact
in the weaktopology
of the reflexive spaceM2 .
Thepayoff
functions
gT (Y, T)
and are affine and continuous in bothstrategies.
We could thenapply
Sion’s theorem(see [18])
to infer thatthe Brownian games have a value and
player
2 anoptimal strategy.
The main
difficulty
of thisapproach
is nevertheless theproof
of theexistence of
player
l ’soptimal strategies.
Thisdifficulty
comes from theasymmetry
between bothplayers: player
2’sstrategy
space isbounded,
whileplayer
l’s is not.Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
To
bypass
thisdifficulty
it istechnically
convenient to firstanalyze auxiliary
games, referred to as the unbounded Brownian games, where bothstrategy
spaces areunbounded,
and then to prove theequivalence
betweenbounded and unbounded games.
A
strategy
forplayer
2 in the bounded Brownian games takes values in0 (,7) .
In the unbounded games, astrategy
forplayer
2 will be a process T in for a~3’
E( 1, oo )
to be fixedlater,
such that Tj = 1.We have then to reduce
player
1strategy
space inr(p, T)
for thepayoff 9T (b, T)
to be defined. To thisend,
it is convenient to introduce thefollowing
definitions:DEFINITION 3.2.1. - In the
following,
~ and D will denote the setof vectors
h E
IRJ
such thathj
isrespectively equal
to 1 and 0.We also set
E1
:==a(D)
and theorthogonal
space to[1..
inA
withrespect
to((.1.))
will be denoted [. Since isjust
a translateof ~~-,
its intersection with ~ reduces to a
single point e°.
In thefollowing T°
willdenote a
point T°
in ~l such thata(T°)
=e°.
vectors in ~ and ~-valued processes are
referred
to asequalizing.
Similarly,
vectors in[1..
and~1-valued
processes arereferred
to asortho-equalizing.
The set
of equalizing (resp. ortho-equalizing)
processes inM2
will bedenoted [a
(resp. ~l~a).
We can then see the
strategy
T ofplayer
2 as the sumT°
+ 8 of theconstant process
T°
and a D-valued process 8 inSimilarly
thestrategy
b ofplayer
1 can be written in aunique
way as the sum c + z of anequalizing
process c and anortho-equalizing
process z.We have then
since the
expectation collapses according
to thedefinition of ~.
The constant process
T°
is inM2 °,
the firstexpectation
in(29)
is definedfor all
equalizing
processes c in for all a E(1, oo).
The secondexpectation
in turn will be finite for allortho-equalizing
process in where1/a’
+1//3’
=1,
since T EM~.
This leads us to the
following
definition of unbounded Brownian games in behavioral form:Vol. 35, n° 1-1999.