HD28
.M414
0. ^.s ^1^
.rn:^?'^^
^.
amiotm
WORKING
PAPER
ALFRED
P.SLOAN SCHOOL
OF
MANAGEMENT
Continuous Time Stopping
Gaineswith
Monotone Reward Structures
Chi-fu Kuang
and Lode Li
To
appear
in"Mathematics of Operations
Research'
\-rP
#2109-89-EFA
Revised
March
1989
MASSACHUSETTS
INSTITUTE
OF
TECHNOLOGY
50
MEMORIAL
DRIVE
Continuous Time Stopping Games with
Monotone Reward Structures
Chi-fu Kuang
and
Lode
Li
To
appear
in"Mathematics of Operations Research"
Continuous
Time
Stopping
Games
with
Monotone
Reward
Structures*
Chi-fu
Huang
and
Lode
LiSchool
ofManagement
Massachusetts
Institute ofTechnology
Cambridge,
MA
02139
July
1987
Revised,
March
1989
ABSTRACT
We
provethe existence of aNash
equilibrium ofa classofcontinuous time stoppinggames
when
certain monotonicity conditions are satisfied.
'Thisis an extensively revised version of thefirst halfofapaperentitled "Continuous TimeStoppingGames"
by the authors. They would like to thank conversions with Michael Harrison and Andreu
Mas
Colell. Threeanonymous referees and seminar participantsat Princeton University' and Yale Universit>' provided useful com-ments. Remaining errors areofcourse the authors'.
1 Introduction 1
1
Introduction
The game
theoretic extension of the optimal stopping theory in the discrete time framewori<was
initiated byDynkin
[1969] in an analysis ofaclass oftwo-personzero-sum
stopping games.The
extensions of Dynkin'swork
includeChaput
[1974], Elbakidze [1976],Neveu
[1975], andOhtsubo
[1986] in discrete timeand Bensoussan and FViedman
[1974],Bismut
[1977, 1979],Chaput
[1974], Kiefer [1971], Krylov [1971], Lepeltierand
Maingueneau
[1984],and
Stettner [1982] in continuous time.Non-zero-sum
stoppinggames
havebeen
studied byMamer
[1986]and
Ohtsubo
[1986] indiscrete time
and
byBensoussan and
Friedman
[1977],Morimoto
[1986],Nagai
[1987],and
Nakoulima
[1981] in continuous time.The
widespreadand
potential use of stoppinggames
can be found in economics, finance,and
management
science.Examples
include the entryand
exit decisions of firms, job search,optimal investment in research
and
development,and
the technology transition in industries. (See Fineand
Li [1986],Mamer
and
McCardle
[1987],Reinganum
[1982] etc.) In fact,many
stochastic
dynamic games
inwhich
each player's strategy is a singledichotomous
decision at each time can be formulated as a stopping time problem.The
existing literature on continuous time non-zero-sum stoppinggames
mentioned
above, with the exception ofMorimoto
[1986], uses stochasticenvironments
that have theMarkov
property.Morimoto
[1986] considers cyclic stopping games.The
purpose of this paper is to provide an existencetheorem
forNash
equilibria for a classofnon-zero-sum non-cyclic stoppinggames
in anon-Markov
environment.We
basically extend the discrete time analysis ofMamer
[1987] to a continuous time setting.Some
properties of asymmetric Nash
equilibrium are also characterized.The
rest of this paper is organized as follows. In Section 2we
formulate an A'-personcontinuous time non-zero-sum stopping
game.
Reward
processes are optional processes thatmay
beunbounded
and
can take the value-oo
at t=
+co.A
martingaleapproach
isadopted
in Section 3 to
show
the existence of optimal stopping policies of players under fairly general conditions.The
existence of aNash
equilibrium ingames
withmonotone
payoff structures isproved in Section 4 by using Tarski's lattice theoretic fixed point theorem.
We
show
in thesame
section that, for asymmetric
stoppinggame,
there always exists asymmetric
equilibriumwhen
thereward
processes satisfy amonotone
condition. Moreover, asymmetric
equilibrium,when
it exists,must
be uniquewhen
reward processessatisfy anothermonotone
conditionand
are separable in a sense to bemade
precise.We
discusstwo
duopolislic exitgames
in Section 52
The
formulation 2to
demonstrate
thetwo
kinds of monotonicity conditions posited in Section 4. T}ic rewardprocesses of these
two
exitgames
also satisfy the separability conditionmentioned
above.2
The
formulation
Let (n, 7,
P)
be acomplete
probability spaceequipped
with an increasing family ofsub-sigma-field of
7
, or a filtration,F =
{Jj; tG
5?+}.We
shall denote the smallestsigma-field containing{7t] t
€
5R+} by Joo aridassume
thatF
satisfies the usual conditions:1. right continuity: 7t
=
Clg^^t ^>' ^^'^2. complete: 7q contains all the P-null sets.
We
interpreteachw
€
n
tobe acomplete description ofthe state of the world.The
filtrationF
models
theway
informationabout
the true state of the world is revealed over time. In a discrete time finite state setting, a filtration can be thought of as an event tree.Let 5R+ denote the extended positive real line.
An
optional timeT
is a functionfrom
n
into 3i+ such that{ujeQ:T{u>)
<t}
e
7tyte^+,
where
!R+ denotes the positive real line.An
optional timeT
is finite ifP{T
<
oo}=
1.An
optionaJ time
T
is said to bebounded
ifthereexists aconstantK
G
9?+ such thatP{T
<
K) =
1.Let
T
be an optional time.The
sigma-field 7t, the collection of events prior to T, consists ofall eventsA
E
7oo such thatAf]{T<t}e7t
yte^+.
A
processX
in thispaper
is amapping
X
: fi x 3?+ h-> SRmeasurable
with respect to7®B{yi+),
theproduct
sigma-field generated by7
and
the Borel sigma-field of !R+.A
processX
is said to be adapted, if X{t) ismeasurable
with respect to 7( V^G
5R+.The
optional sigma-field, denoted by 0, is the sigma-field onn
x5R+ generated byadapted
processes havingright-continuous paths (see, e.g.,
Chung
and
Williams [1983]).A
process is optional if it ismecisurable with respect to 0. Naturally,
any adapted
process with right-continuous paths isoptional. It is also
known
thatany
optional process isadapted
(see, e.g.,Chung
and
Williams[1983]). For any process
X, we
writeX+{t)
=
max[X(0,0]
and A'"(0
=
max[-A'(0,0].
itis clear that
X{t)
=
X'^{t)-
X~{t). For an optional time T,we
will useX{T)
to denote the2
The
formulation 3We
consider an A' player s£o;)pinffyame.
Players areindexed by »=
1,2,...,A'.The
payoffs ofthisgame
are described by a family of optional reward processes:z,(u;,<;T_.);<e5R+,t=
1,2, ...,7V,where
r_, runs through all (A^-
l)-tuple of optional times. Interpret z,{uj,t;T_,) to be the payoff that player i receives in statew
when
his strategy is T,and
T,{u)=
t, if T-, are the strategiesemployed
by players other than t.Note
that the value ol a reward process "at infinity" specifies the payoff to a player if henever stops.
There
are at leasttwo
possibilitiesdepending
on the nature of thegame.
First, ifa player does not receive any
reward
when
he never stops,we
simply set 2,(w,+oc;T_,)=
0.Second, ifthe valueat time t ofreward process represents
accumulated
payoffs a player receivesfrom
time to time t if he does not stop until time t, then a natural selection of the value at infinity is2,(w,+oo;T_,)
=
limsup,_+^2,(a;,t;r_,).For an
example
ofthe second case, see Section5. In the abovetwo
cases, it is easily verified that the reward processes as defined on 5R+ are optionaland
their values at infinity aremeasurable
with respect toTea-The
followingassumptions
on reward processes will bemade
throughout
this paper:Assumption
1 For any[N
-
l)-tupl£ optional times r_,and
anybounded
optional time T,we
have E[2,{T,;T_,)]>
-co. Moreover, there is a martingale^m
=
{m{t);t£
9?+} such that "ilt^.O>
z,{u,t-T-,) for all {oj,t)e
fi x^+.
Assumption
2The
reward process 2,(w,/;T_,) is upper-semi-continuous on the rightand
quasi-upper-semi-continuous on the left}Assumption
1 ensures that-oo <
sup E[z,{T;T.,)]<
+oo
reT
"The process
m
=
{m{t);t G»+}
is a martingale if it is adapted to {I,;i £ !ft+}, £||m(/)|J < oo for allt€S-)-, and £[m(t)|J,]
=
m(s) a.s. for all t > s. Note that sincem
isdefined on the extended positive realline,it is uniformly integrable. Thus E[m(T)|77.1
=
m{T) a.s. foroptional times t>T.
An
optional process x=
{z[t);t 6 S?+} is upper-semi-continuous from the right iffor every optional time twe have lim8up„_„2(T-f- l/n) < x[t) on theset {r < oo}. Similarly, an optional process z
=
{x{t);t 6 3?+} issaid tobe quasi-upper-semi-continuous on the left iffor each optionaltimet and any sequence ofoptional times
3 Existence of best responses 4
for all T_,,
which
we
will formally state inTheorem
1,where
T
denotes the collection of alloptional times.
The
usefulness ofAssumption
2 willbecome
clear later.Note
that a rewardprocess satisfying
Assumption
1 can beunbounded
from
aboveand from
below,and
can take the value—
oo at t=
+oo.
Given
the strategy ofhisopponents
T_,, the objective of player i is to find an optional time T, that solves the followingprogram:
supE[r.(T;T_,)!. (1)
TeT
Ifsuch an optional time T, exists,
we
willsometimes
say T, is a best response to T_,-.A
Nash
equilibrium ofthe stoppinggame
is an TV-tuple of optional times (T,;«'=
1,2, .. . ,A')
such that T, solves (1) for i
=
1,2,...,A^.3
Existence
of
best
responses
In this section
we
willshow
that there always exists a solution to (1). This solulion willalso be said to be a "best response."
Our
analysis closely follows that ofThompson
[1971 .In
Thompson
[1971], players are allowed to use only the finitely-valued optional times. Here,however, the admissiblestrategy space is
composed
ofall the optional times.Our
generalization ofThompson's
analysis isquite straightforward. For completeness ofthis paperwe
shall providesome
details ofthis generalization.Note
that Fakeev [1970]and
Maingueneau
[1976/77] also analyzed optimal stoppingprob-lems by allowing non-finite optional times in a
non-Markov
setting. Fakeev [1970] required that £'[sup( 2~(i;r_,)]<
ooand
Maingueneau
[1976/77]worked
with reward processes that are ofdass
D,
that is,random
variables{z,{T\T-i);T
G
T}
are uniformly integrable.Assumptions
1
and
2admit
processes that lie outside theframework
of Fakeev [1970]and
Maingueneau
[1976/77].We
proceed first withsome
definitions.Definition
1A
regular supermartingale {X{t);tG
!R+} is an optional process so that for anybounded
optional time T,E[X-{T)]<^
(2)and
for all optional timesS
>T,
E[X{S)]
is well-definedand
3 Existence of best responses 5
Remark
1Out
definition ofa regular supermartingale is a generalization ofMerten
[1969J toinclude values at infinity.
Mertens
/1969,Theorem
l] hasshown
that almost all of the paths of a regular supermartingale have rightand
left limitsand
are upper semi-continuousfrom
theright.
The
regular supermartingale definedin Definition 1 also has thesame
characteristics.An
optional processY
is said to lie above the optional processW
, denoted byy
>
H', ifY{u),t)
>
W{uj,t) forall (w,t) outside a subset ofn
x9?+whose
projection onn
isofP-measure
zero.A
regular supermartingaleY
is theminimum
regular supermartingale(MRS)
lyingabove
an optional processW
i(Y
>
W
and
iiX
>W
for any other regular supermartingale A', thenX
>Y.
The
following propositionshows
that, for any (A'^-
l)-tuple of optional times T^,, there exists aMRS
lyingabove
the reward process {2,(<;T_,);<6
9?+}-Proposition
1 For any[N
—
\)-tuple of optional times T_,, there exists aMRS,
denoted by {Y,{t;T-,);tG
^+}
, lying above the reward process {z,{t;T-,);tG
^-,}.The
MRS
Y,{T-,) isright-continuous on the set {[iu,t)
G
Q
X 3?+ : Y,{uj,t;T-t)>
2,(0;,t;T-,)}. Moreover, fix f>
and
tG
9f+,and
let7,
=
inf{s>
t : 2.(s;T_,)>
y;(s;T_.)-
e}.Then
P{t,<
00}=
1and
E\Y,{u-T.,)\Tt]
=y.(t;T_.)
a.s.Proof
Put
z^{t-T.,)
=
m^x[z,{t-T.,),-M\
V<G ^+.
The
processz^[T-,)
isbounded
above
by amartingale byAssumption
1and
isbounded
belowby
M.
Dellacherieand
Meyer
[1982,Appendix
I,Theorem
22]shows
that there exists aMRS
Y-^{T-i) lying
above
2,^(T_,).Now
definey.(<;T_.)
=
essinf{y,^(t;T_.);M=l,2,...}
for all <
G
^+
.We
claim that Yi{T-i) istheMRS
lying above z,[T-,). First, it is easy tosee that }\(T-,) isoptional. Second,
we
show
that Yf{T-,) is a regular supermartingale. Since Y,{T^,) lies above3 Existence ofbest responses 6
and
5
betwo
optional times withT
<
S.By
Assumption
1, £'[7,(5;T_,)] is strictly less than+00.
Moreover,E[Y,{S-T_,)\Tt]
<
E{Y,'^{S-T_,)\Tt]<
yA^(T;T_.)
as. for allN
=
1,2,...where
the firstinequality followsfrom
thedefinition of y,(T_,)and
the second inequality followsfrom
the fact thatY/^{T-,)
is a regular supermartingale. Relation (4) implies thatE[Yi{S;T-i)\TT]
<
essinfr,^(T;T_.)
=
7.(7; T_,) a.s.M
Thus
Y,{T-i) is a regular supermartingale.The
rest of the cissertion followsfrom
similararguments
ofLemmas
2.2and
4.2-4.5 ofThompson
[1971]. INow
define an optional timer.(T_,)
=
inf{ie
5R+ : Y,{t;T.,)=
z,{t;T^,)}.The
following is ourmain
theorem
of this section.Theorem
1 T,(T_,) is a best response to T_, for player i. Moreover,£[y.(0;T_,)]
=
sup E{z,{T-T^,)]TeT
and
E[\Y,{0;T_i)\]<
oo.Proof
Fix n>
and
letT„
=
inf{s>
: y.(s;r_.)<
2.(s;T_.)+
-}.n
By
Proposition 1, ?„ is finite a.s.and
E[y,(r„;T_.)l
=
^[r.(0;T_.)]. (5)By
the hypothesis that z,{T_,) is upper-semi-continuous on the right(Assumption
2), the fact that y,(r_,) is right-continuouson
the set {Yi[s;T-,)>
2,(s;T_,)} (Proposition 1),and
(5)we
have 1
n
E{Y,{0-T-,)] -n 1 ^[^.(r„;T_.)]>
E[Y,(t„;T-,)] -1>
sup E\z,[T-T^,)]TeT
"
3 Existence of best responses
Letting
n
—> ocand
noting thatsup E[z,{T-T^,)]
>
E[z,{r„-T_,)] Vn,TeT
we
getsup E[z,{T-T-,)]
=
ElY,{0-T-,)]. (6)TeT
Moreover,
Assumption
1 implies that £[|yj(0;T_,)|]<
oo.These
are the secondand
the third assertions.Next
put t'=
limnTn.The
limit is well-defined since t„ is increasing in n. It is clear thatr'
<
T,{T_,).We
haveE[Y,{T';T^,)]>E{z,{r'-T-,)]
>
lim E[^.(r„;T_.)]n—OO
=
£;[y,(0;T_.)]>
E[r.(7';T_.)],where
the second inequality followsfrom
the hypothesis that 2,(T_,) isquasi-upper-semi-continuous
on
the left. It then follows thatE[z,{r';T-,)]
=
E[Y,{t';T-,)]=
E[Y,{0-T-,)\.Recall
from above
that E{\Yi{0;T-,)\]<
oo. Therefore itmust
be that z,(r';T_,)=
Y,{t';T-,) a.s.and
r'=
T,{T^i).By
(6), it follows that T,{T-,) is a best response to T^,. This is the firstassertion. I
The
following propositions givesome
properties of a best responsewhich
we
will find usefulin Section 4.
Proposition
2 Fix T_,. Let t, be a best response for player tand
let Yi{T-,) be theMRS
lying above £,(T_,).Then
y.(7.;T_.)
=
z.(r.;T_,) a.s., (7)and
henceT,{T_,)
<
T, a.s. (8)Proof
By
thefact that F,(T_,) is a regularsupermartingale, r, isa best response, and V, (7"_,)lies
above
2,(T'_,),we
have, almost surely,3 Existence ofbest responses 8
where
the equality followsfrom
the second assertion ofTheorem
1.Hence
E[y.(0;T_.)]
=
E[y.(r.;T_.)l=
E[z,{t,-T.,)].It then follows
from
Y,{T-i)>
z,{T-i) a.s.and
the last assertion ofTheorem
1 thaty,(7-,;T_,)
=
2,(r,;T_,) a.s.This is (7).
By
the definition oi T,{T-i),we
then have T,(T_i)<
t, a.s.,which
is (8).I
Note
that (7) says that any best response r to T-,must
be therandom
times at whichYi{T-t)
meets
2,(r_,). Since T,{T^i) by definition is the first time that Y,{T_,) is equal to2,(T'_,), Ti{T_i) is the
minimum
best response for player i, given that hisopponents
play T.,.This is (8).
The
following propositionshows
that a best response t, chosen by player» at time continuesto be optimal at
any
optional timeS
as long as atS
player » has notstopped
according to r,.
Proposition
3 Let r, be a best response to T_,and
letS
be an optional time.Then
on the set{'«
>
S)
, r, is a solution tosup E{z,{r;T_,)\rs].
t€T,t>5
Proof
If the eventB
=
{r,> 5} €
Ts is of zero probability, there is nothing to prove.Suppose
therefore thatB
is ofa strictly positive probability.Suppose
that the assertion isfalse.Then
there exists an optional time ?>
S
as. such thatP[B) >
0,where
B
= {<^eB:
E{z,{t-T_,)\Ts]>
E\z,{r,;T_,)\rs]. DefineI
T,{u)
Uujen\B;
^ ^'^^
\ f[u)
liujeB.
It is eaisily verified that t' is an optional time
and
E{z,{t'-T^,)]> E[z,{t,-T.,)],
4 Naish equilibria
when
z,'s have amonotone
structure 94
Nash
equilibria
when
2;,'shave
a
monotone
structure
In this sectionwe
willshow
that there exists aNash
equilibriumwhen
the reward processes of players satisfy certainmonotone
structures.Consider the following eissumptions:
Assumption
3 There existsQ
e
T
withP(n)
=
1and
for u;e
h, t, t'e
^+
and
t>
t',
Zi{ijj,t;T-i)
—
Zi[oj,t';T-,) is nondecreasing in T-,.Assumption
4
There existsQ
e
T
withP{h)
-
1and
foru
e
Q, t, t'G
^+
and
t>
t',
Zi[LL),t,T-i)
—
z,[oj,t']T^t) is nonincreasing in T-i.Assumption
5 There existsQ
e
7
withP{Q)
=
1and
for oje
fl, t, t'E
!R+and
t>
t',
Zi{ijj,t]T-,)
-
r,(w,f';T_,) is strictly decreasing in T-i.Assumption
3 says that the longerhisopponents
stay in thegame,
thehigher the "incremen-tal"reward
a player gets by staying in thegame.
For example,we
often observe departnientstores
and
hotels gather in nearby locations. Being clustered together, they can generate a higherdemand
for their services such as convention business for hotels.Assumption
4 is theconverse of
Assumption
3: a player's incrementalreward
decreases the longer hisopponent
stayinthe
game.
This is amore
naturalassumption
for,say, oligopolists facing afixeddemand
func-tion for the industry as a whole.Assumption
5 is a strengthening ofAssumption
4.Examples
of
games
havingreward
processes that satisfyAssumption
3 or 4 can be found in Section 5.Denoting
byT^
the collection of A''-tuple of optional times,we
define <I> : T' >—» T' as$(T,,...,r;v)
=
(T.(T_.)).'li.By Theorem
1, "^ is well-defined. Since$
specifies best responses for players, it willsometimes
be referred to eis a reaction
mapping.
For
two
A'^-vectors of optional times t—
[ri,.. . ,Tfi)and
S
=
(S,,. . .,5^-),we
denote byr
>
5
if r,>
5, a.s. for all i'=
1,2,. .. ,A'^.
The mapping
$
is said to bemonotone
increasingif for
any two
A^-vectors ofMarkov
times t>
S, r'=
$(r)and
5'=
^{S)
imply t'>
S'.The
mapping
$
is said to bemonotone
decreasing iffor anytwo
A^-vectors ofMarkov
times r>
S
,
t'
=
$(r)and
S'=
$(5)
implies t'<
S'.
The
following propositionshows
that$
ismonotone
increasing underAssumption
3 and is4
Nash
equilibriawhen
z,'s have amonotone
structure 10Proposition
4$
ismonotone
increasing ifAssumption
S is validand
ismonotone
decreasingif
Assumption
4 is valid.Proof
Suppose
thatAssumption
3 is satisfied.Choose two
A^-vectors of optional timesT
>
S. Let t'=
$(7-)and
5'=
$(5).Suppose
the setA
=
{.;<
5.'}is ofstrictly positive mear-jre for
some
i.By
Assumption
3we
know,
almost surely,[^.•(5,';r_.)
-
z.(r.';r_.)]l^>
[z.(S:;5_.)-
z.(r,';5_.)]U.
Taking
conditional expectations with respect to T^' on both sides of the above relation gives>
£;[2.(5,';r_.)-2.(r,';r_.)|^r'llA>
E[z,{Sl-S_,)-
r,(r,';5_,)l.';.]U>
0,where
we
have used the fact thatA
G
Tj' (see, e.g., Dellacherieand
Meyer
[1978,Theorem
IV.56]),
where
the inequalities followsfrom
Proposition 3.Thus
£[2,(5,-; S_,)|.%.')
=
z,(t,';5_,) a.s. on the set A. (9)Now
define ct,=
S,'A
r' a.s. It is easily checked that a, is an optional timeand
cr,<
S-,Oi
^
S'-on
a set of strictly positive measure.We
claim that a, is a best response to 5_,.To
see this,
we
note thatE[z,{a,-5_.)]
=
E[z,{tI-S_,)U
+
z.{Sl\5-,)^^]
=
E
E[z,{S[S-,)\A
+
z,[SlS^,)\A^Jr'=
E\z,{S[-S_i)\,where
we
have used (9)and
where
A'^ denotesH
\ >4. This is a contradiction to the fact that S[ is theminimum
best response to 5_,; see Proposition 2.Thus
A
must
be ofmeasure
zero.The
rest of the assertion followsfrom
similar arguments. IThe
following is the firstmain
theorem
of this section:4
Nash
equilibriawhen
z,'s have amonotone
structure 11Proof
Suppose
thatAssumption
3 issatisfied.From
Proposition 4, <C> is amonotone
increas-ingmapping
from
T^
to itself. Proposition VI.1.1 ofNeveu
[1975] implies that (T'^',>) is acomplete
lattice. It then followsfrom
Tarski's fixed pointtheorem
(see Tarski [1955]) tliat there exists a fixed point for <^. It is easily verified that the fixed point is aNash
equilibrium.I
A
stoppinggame
is said to besymmetric
if for any (A^-
l)-tuple of optional limes T_,we
have
2,(u;,t;T_,)=
2j(w,t;T_,) except on a subset of fi x !R+whose
projection toH
is ofmeasure
zero.A
Nash
equilibrium {Ti)f_^ is asymmetric
equilibrium ifT,—
T, a.s. for all i,j.Under
Assumptions
3, the followingtheorem shows
that, in asymmetric game,
there always exists asymmetric
equilibrium.Theorem
3Suppose
thatAssumptions
S are satisfied.Then
there exists asymmetric Nash
equilibrium in asymmetric
stopping game.PROOF
LetD
=
{T
eT^
-.T,^ T, a.s.Vi,;}and
letF
be the restriction of^
to D.Note
that for eachT
e
D, $,(T_,)=
$;(T__,) a.s. forall t,,;',
where
<I>, denotes the t'-thcomponent
of $. Therefore,F
maps
D
into D.Arguments
similar to the proof of Proposition 4
show
thatF
ismonotone
increasing. It is also easily verified thatD
is a complete lattice.The
cissertion then followsfrom
the Tarski's fixed pointtheorem. I
In the next
theorem
we
specialize ourmodel
to the casewhere
thenumber
of players isequal to two. In this case, there exists a
Nash
equilibrium underAssumption
4.Theorem
4Suppose
thatN
=
2.Then
there exists aNash
equilibrium underAssumption
^for the stopping game.
Proof
From
Proposition 4,we
know
$
: T'^ i—T^
ismonotone
decreasing. Let <I>,(r_,) bethe t'-th
component
of ^{T). It is easily seen that $,'s aremonotone
decreasing. Consider thecomposite
mapping
$i
o$2
:T
•—> T. This compositemapping
ismonotone
increasing since$1 and $2
aremonotone
decreasing. It then followsfrom
the Tarski's fixed pointtheorem
(see Tarski [1955]) that there exists a fixed point of the compositemapping,
that is, there exists TiG
T
such that ^i{^2{Ti))=
Tj.Hence
(Tj, <f>2(7'i)) is aNash
equilibrium. IEven
in thetwo
player case, theremay
not exist asymmetric
Nash
equilibrium for a sym-metricgame
under
Assumption
4; for anexample
of nonexistence seeHuang
and
Li [1987].4 Ncish equilibria
when
z,'s have amonotone
structure 12Note
thatwe
only consider pure strategy equilibria in this paper.Our
conjecture is that asymmetric
equilibrium exists for asymmetric
stoppinggame
underAssumption
4 ifmixed
strategies are allowed.
The
following theorem, however,shows
that if there exists asymmetric
Nash
equilibrium for a general A'^ playersymmetric
game
underAssumption
5and
ifthe rewardprocesses satisfy a separability property, then this equilibrium
must
be the uniquesymmetric
Nash
equilibrium.Theorem
5Suppose
thatAssumption
5 is satisfiedand
that there exists asymmetTic
Nash
equilibrium in a
symmetric
stopping game.Then
thisNash
equilibrium is the uniquesymmetric
equilibrium provided that reward processes satisfy the following condition of separability: IfT,S
G
T^~^
and
t=
S
a.s. on a setB
€
', then almost surelyZi{u,,t;T)
=
z,{oj,t;S)yte^+yoje
B.Proof
Let "I* be amapping
from
T^
to all thesubsets ofT'^ that givesall the best responsesto an elementof
T
.By
Theorem
1, thismapping
iswell-defined.We
claim that^
ismonotone
decreasing in the sense that for
two
optional times t> S
a.s.and
r'£
'^{t)and
S'G
'^{S)we
have r'
<
S'.To
see this,we
suppose
that the setA =
{rl>
5.'}is ofstrictly positive probability for
some
i.By
Assumption
5,we
know
almost surely,(^.(r;;r_.)
-
2,(5,';r_.))U
<
{z.irl^S-,)-
z,iS';S.,)) 1^.Taking
conditional expectations with respect to 7^' givesE
k(r:;r-.)-
2.(5,';r_.)l.vl Ia<
E
Uirl-S.,) -
^.(5.';5_.)j.vlU,
where
we
have used the fact thatA
£
Ts' (see Dellacherieand
Meyer
[1978,Theorem
IV.56J).The
left side ofthe relation is nonnegative almost surely by Proposition 3.Hence
the right sideis strictly positive
and
is a contradiction to Proposition 3.Now
letT'
,T'e
D
and T*
#
T' betwo
symmetric
Nash
equilibria.We
willshow
that this leads to contradiction by considering three cases.Case
1:T*
<
T' a.s. SinceT'
and
T' areNash
equilibria,we
must
haveT'
e
*(T") and
T'e
*(r').The
monotonicity of*
implies thatT'
>
T' a.s., henceT'
=
T' a.s.. This a contradiction.Case
2: T'<
T'
. Using the monotonicity of 'i',we
have a contradiction similar to the4
Nash
equilibriawhen
z,'s have amonotone
structure 13Case
3:P{T'
< T'}
e
(0,1). DefineT =
T'
^T'
. SinceD
is a lattice,T
E
D. Letf
e
*(T).
By
the monotonicity of*
and
the fact thatT'
E
*(T") and
T'E
*{T'),we know
T
>
T
a.s.and
T
>
T*
a.s.Let
B =
{T'<
T'}.By
Dellacherieand
Meyer
[1978,Theorem
IV. 56],B
E
It--We
havelB^[^.(T;;T^)|/rl
=
lB^[^.(7;';r_.)|.Vl<
lBE[z,{f,;T^,)\7r]=
lBElz,(f,;ri.)17j-;],where
the first equality followsfrom
the separability assumption, the inequality followsfrom
Proposition 3,and
the second equality follows againfrom
the separability assumption. SinceT,' is a best response to T'_^
and
sinceT
>
T', Proposition 3 implies that the above inequalitymust
be an equality.That
islBE\z,{T:-T'_,)\7r]
=
lBE[z,{f,-T'_,)\Tr] a.s. (10)Now
we
define^
^'^^-\
f(a;)
ifuEB^,
where
B'^= Q\B.
SinceB
E
Tt- "^^f,'^^ 's easily verified that
T°
isa optional time (see, e.g.,Dellacherie
and
Meyer
[1978,Theorem
IV.53]).We
claim thatT° E
*(T).To
see this,we
note thatE[z,{T:;T^,)]
=
E\lBZi{T°;T.,)
+
lB'Z,{T°;T^,)\=
E[lBZ,iTl;T^,)+lB'Z,{f,-T_,)]
=
E[1b2.(T.;T_.)+
1b^^.(T'.;T_,)!=
^[r.(T.;T_.)],where
the third equality followsfrom
(10).Hence
we
proved our claim thatT°
E
*(T). This implies thatT°
>
T'
a,s., by the monotonicity of *.But
on the set B,T°
=
T'<
T', a contradiction. Therefore the setB
must
have a zero probabilityand
T'>
T\T'
^
T'. This is5
Two
duopolistic exitgames
145
Two
duopolistic
exit
games
In this section
we
will considertwo
continuous time duopolistic stochastic exit games.The
firstsatisfies
Assumption
3and
thus has aNash
equilibriumbyTheorem
2. In addition, ifthisgame
is
symmetric,
there is asymmetric
Nash
equilibrium byTheorem
3.The
secondgame
satisfiesAssumption
5and
therefore has aNash
equilibriumby
Theorem
4.Let the probability space
and
filtrationbe as specified in Section 2.There
aretwo
processesX
=
{X{t)]te^+}
and
W
=
{W{t);tG
^+}
adapted
to F.There
aretwo
firms in themarket
at the beginningofboth
games.These two
firms are sissumed to be risk neutraland
are indexed by I=
1,2.The
riskless interest rate is a constant denoted by r.A
firm'sproblem
is to find an exit time in order tomaximize
the present value of its expected profits.In the first
game,
let ;r,i(A'(<)) be the profit rate at time t for firm :when
it is the only firm in themarket and
let 7r,2(^V(<)) be the profit rate at time t for firm twhen
there aretwo
firms in the market.
Suppose
that |7r,j| isbounded.
We
interpret A' to be the industrydemand
when
there is only one firm in the industryand
W
to be thatwhen
there aretwo
firms in the industry.Assume
that thedemand
is higherwhen
there aretwo
firms in the market.Thus
itis natural to require that t,2(V^(<))
>
jr,i(X(<)) for t=
1,2.The
reader can think about theexample
we
gave in Section 4 ondepartment
storesand
hotels clustering together to generatea higher
demand.
Given
that its competitor exits at the optional time T, the reward process for firm t isz,{io,t-T)
=
/o'^^'^'e—V,2(Vy(a;,s))ds+/'
c-";r„(X(u;,5))cf5,te^^,
(11) Zi[u},+oo\T)
=
limsup(_+^
z,{uj,t\T).This
reward
process is absolutely continuous on [0,+oc)and adapted
and
therefore optional.It satisfies
Assumptions
1, 2,and
3,and
there exists aNash
equilibrium byTheorem
2.When
7rjy(-)
=
n2]{') forj=
1,2, this is asymmetric
game
and
there is asymmetric
Nash
equilibrium byTheorem
3.Now
consider the secondgame.
Let ;r,j(A(<)) be the profit rate firm i receives at time twhen
there are j firms in themarket and
when
firm «' is still in the market,where
j
=
1,2.Assume
that |7r,j(-)| isbounded.
Also interpret X[t) to be thedemand.
Unlike in the firstgame,
the totaldemand
here is independent of thenumber
of firms in the market.Thus
it isnatural that a firm's profit is lower
when
it is aduopoly
thanwhen
it is amonopoly.
Hence
we
5
Two
duopolistic exitgames
15Given
that its competitor exits at the optional time T, the reward process of firm i is(12) Zi{u>,+oo;T)
=
limsup,^^^
z,{uj,t;T).This
reward
process can be verified to be optionaland
satisfiesAssumptions
1, 2,and
5. llthen follows
from
Theorem
4 that there exists aNash
equilibrium ofthis duopolistic exitgame.
Readers
interested in detailed analysis of thisgame
and
discussion on refinements ofNash
equilibriawhen
X
is a Diurtiiianmotion
are referred toHuang
and
Li [1987].Finally,
we
remark
that the reward processes defined in (11)and
(12) satisfy the separability6 Bibliography 16
6
Bibliography
1. A.
Bensoussan
and
A. Friedman, Nonlinear variational inequalitiesand
differentialgames
with stopping times. Journal of Functional Analysis
16
(1974), 305-352.2. A.
Bensoussan and
A. Friedman,Nonzero-sum
stochasticdifferentialgames
with stopping timesand
freebomidary
problem. Transactions ofAmerican Mathematical
Society231
(1977), 275-327.3. J.-M.
Bismut,
Surun
probleme
de Dynkin. Z. Wabrscb. Verw. Gebiete39
(1977), 31-53.4. J.-M. Bismut, Control deprocessus alternantset applications. Z. Wabrscb. Verw. Gebiete
47
(1979), 241-288.5. H.
Chaput,
Markov
games, TechnicalReport
No.33,Department
ofOperations Research, Stanford University, 1974.6. K.
Chung
and
R. Williams, 1983,An
Introduction to Stochastic Integration, BirkhauserBoston
Inc.7. C. Dellacherie
and
P.Meyer,
1978, Probabilitiesand
Potential A: GeneralTheory
ofProcess,
North-Holland
PublishingCompany,
New
York.8. C. Dellacherie
and
P. Meyer, 1982, Probabilitiesand
Potential B:Theory
of Martingales,North-Holland
PublishingCompany,
New
York.9. E.
Dynkin,
Game
variant of aproblem
on optimal stopping, SovietMath.
Dokl.10
(1969), 270-274.10. N. Elbakidze, Construction of the cost
and
optimal policiesin agame
problem
ofstoppinga
Markov
process.Theory
Probab. Appl.21
(1976), 163-168.11. A. Fakeev,
Optimal
stopping rules for stochastic processes with continuous parameter. Th. Prob. Appl.15
(1970), 324-331.12. C. Fine
and
L. Li, Stopping time equilibriaingames
withstochcisticallymonotone
payoffs.Working
paper#158-87,
Operations Research Center,MIT,
1987.13. C.
Huang
and
L. Li,Continuous
Time
StoppingGames,
Working Paper #1796-86,
Sloan School ofManagement,
1986.6 Bibliography 17
14. C.
Huang
and
L. Li,Brownian motion
exit games,MIT
mimeo,
1987.15. J. Kiefer,
Optimal
stopping ingames
with continuous time, Prob. Th. Appl.18
(1971),556-562.
16. N. Krylov, Control of
Markov
Processesand W-spaces.
Math.
USSR-Izv
5 (1971),233-266.
17. J. Lepeltier
and
M.
Maingueneau,
Le jeu deDynkin
en theorie general sansrhypolhese
deMokobodski.
Stochastics13
(1984), 25-44.18.
M.
Maingueneau,
Temp
d'arretoptmaux
et theorie generale, Seminarie de Probabilites(1976/1977), 457-467.
19. J.
Mamer, Monotone
stopping games, J. Appl. Prob.24
(1987), 386-401.20. J.
Mamer
and
K.McCardle,
Uncertainty, competition,and
the adoption ofnew
technol-ogy,Afanagement
Science33
(1987), 161-177.21. J. Mertens, Sur la theorie des martingales, C.r. Acad. Sc. Paris
268
(1969), 552-554.22. H.
Morimoto, Non-zero-sum
discreteparameter
stochcisticgames
with stopping times.Probab. Th. Rel. Fields
12
(1986), 155-160.23. H. Nagai,
Nonzero-sum
stoppinggames
ofsymmetric
Markov
processes, Prob. Th. Rel. Fields75
(1987), 487-497.24. O.
Nakoulima,
Inequations variationnelles et quasi-variationnelles bilaterales associees a desproblemes
de jeux stochastiques asomme
nulleou
non
nulle. These, Universite deBordeaux
I, no. 674, 1981.25. J.
Neveu,
DiscreteParameter
Martingales,North-Holland
PublishingCompany,
Amster-dam,
1975.26. Y.
Ohtsubo, Optimal
stoppingin sequentialgames
with or without a constraint ofalways terminating.Math.
Oper. Res. 11 (1986), 591-607.27. Y.
Ohtsubo,
A
nonzero-sum
extension ofDynkin's stopping problem.Math.
Opcr. Res.12
(1987), 277-296.6 Bibliography jg
29. L. Stettner.
Zero-sum Markov games
with stoppingand
impulsive strategies. Appl.Math. Optim. 9
(1982), 1-24.30. A, Tarski,
A
lattice theoretic fixed point theorem, Pacific Journal of Mathematics^ 5 (1955), 285-309.31.
M. Thompson,
Continuous parameter
optimal stopping problem, Z. Wahrscheinlichkeits-theorie verw. Geb.19
(1971), 302-318.Date
Due
H012
'90milIIImimilIII