Montréal
Série Scientifique
Scientific Series
2001s-08
Leader and Follower:
A Differential Game Model
CIRANO
Le CIRANO est un organisme sans but lucratif constitué en vertu de la Loi des compagnies du Québec. Le
financement de son infrastructure et de ses activités de recherche provient des cotisations de ses
organisations-membres, d’une subvention d’infrastructure du ministère de la Recherche, de la Science et de la Technologie, de
même que des subventions et mandats obtenus par ses équipes de recherche.
CIRANO is a private non-profit organization incorporated under the Québec Companies Act. Its infrastructure and
research activities are funded through fees paid by member organizations, an infrastructure grant from the
Ministère de la Recherche, de la Science et de la Technologie, and grants and research mandates obtained by its
research teams.
Les organisations-partenaires / The Partner Organizations
•École des Hautes Études Commerciales
•École Polytechnique
•Université Concordia
•Université de Montréal
•Université du Québec à Montréal
•Université Laval
•Université McGill
•MEQ
•MRST
•Alcan Aluminium Ltée
•AXA Canada
•Banque du Canada
•Banque Laurentienne du Canada
•Banque Nationale du Canada
•Banque Royale du Canada
•Bell Québec
•Bombardier
•Bourse de Montréal
•Développement des ressources humaines Canada (DRHC)
•Fédération des caisses populaires Desjardins de Montréal et de l’Ouest-du-Québec
•Hydro-Québec
•Imasco
•Industrie Canada
•Pratt & Whitney Canada Inc.
•Raymond Chabot Grant Thornton
•Ville de Montréal
© 2001 Hassan Benchekroun et Ngo Van Long. Tous droits réservés. All rights reserved.
Reproduction partielle permise avec citation du document source, incluant la notice ©.
Short sections may be quoted without explicit permission, if full credit, including © notice, is given to the source.
Ce document est publié dans l’intention de rendre accessibles les résultats préliminaires
de la recherche effectuée au CIRANO, afin de susciter des échanges et des suggestions.
Les idées et les opinions émises sont sous l’unique responsabilité des auteurs, et ne
représentent pas nécessairement les positions du CIRANO ou de ses partenaires.
This paper presents preliminary research carried out at CIRANO and aims at
encouraging discussion and comment. The observations and viewpoints expressed are the
sole responsibility of the authors. They do not necessarily represent positions of CIRANO
or its partners.
Leader and Follower: A Differential Game Model
*
Hassan Benchekroun
†
, Ngo Van Long
‡
Résumé / Abstract
On analyse un jeu différentiel entre deux joueurs, dont le premier a
l'avantage de prendre sa décision avant son adversaire. On compare le profil de
stratégies d'équilibre de ce modèle avec celui d'un modèle d'actions symétriques.
On démontre que l'existence d'un leader favorise la conservation dans
l'exploitation du stock commun. On analyse les déviations possibles à partir d'un
équilibre. On démontre que si le leader peux s'engager à une politique
d'exploitation plus modérée, alors le suiveur peux répondre plus ou moins
agressivement, selon la durée de la période d'engagement.
We consider a differential game between two players, where one player
has the first mover advantage. We compare the equilibrium strategy profile of this
model with the one generated by a conventional symmetric model. It is shown that
the existence of a first mover results in more conservationist exploitation in the
aggregate. We also consider the implication of departures from the equilibrium.
We show that if the leader (the first mover) can commit to decrease its effort over
a finite interval of time, then the follower (the second mover) may respond by
increasing, or decreasing, its effort, depending on the length of the commitment
period.
Mots Clés :
Leadership, jeux différentiels
Keywords:
Leadership, differential games
JEL: C72, Q22
*
Corresponding Author: Ngo Van Long, CIRANO, 2020 University Street, 25
th
floor, Montréal, Qc, Canada
H3A 2A5
Tel.: (514) 985-4020
Fax: (514) 985-4039
email: [email protected]
We wish to thank two referees for very helpful comments.
Migratory sh that travel along the coast lineof several nations are subject
to sequential catching. This has been the cause of many disputes between
nations. Forexample, the Pacic salmon disputes between Canadaand the
United States of America are due largely to what is perceived by Canada
as Alaska'sunfair\interceptions". The Canadian salmon,born inCanadian
rivers, have the habit of travelling to the Pacic ocean, staying in the high
seasforafewyears(3to7years,dependingonthespecies)andthenreturning
toCanada forbreeding. On theirreturntrip,they travelalongthe coastline
of AlaskabeforereachingBritishColumbia. Alaska'sshershavethe chance
to catch these sh rst. This is called \interception" 1
. If they catch all the
Canadian salmonthat pass by, thenthere willbenomore sh inthe future.
So they dohave anincentive to conserve the resource. Canadian shers are
thesecondmover,becausetheyobservetheAlaskaninterceptionsbeforethey
decide how muchto catch 2
.
Several interesting theoretical questions arise: (i) what would be a
rea-sonabledenitionofequilibriumstrategiesforthistypeofgame? (ii)inwhat
sensethis isaleader-followergame? (iii)doestheleaderhaveanincentiveto
\set an example" by restraining its catch rate relative to a situation where
the two players are symmetrical? (iv) if an equilibriumis perturbed by one
player, howwould the otherplayerreact? Forexample,if one playerdecides
and commits to decrease its shing eort below its equilibrium value for a
specied time interval, would the other player react by also decreasing its
shing eort? These questions have not been raised in the existing
theoret-ical literature on common property resource management (Plourde (1970),
Sinn (1984), Clemhout and Wan(1985), Bolle (1986), Thomas (1992),
Mis-sios and Plourde (1996,1997)),nor in the literature on sh war, which deals
onlywithsymmetricsituations(LevhariandMirman(1980),Chiarellaetal.
(1984), Kaitala and Pohjola (1988),Plourde and Yeung (1989), Fischer and
Mirman (1992), Datta and Mirman (1994), Missios and Plourde (1996)) or
1
According to Huppert (1995)the interceptionsby Alaskaaccountedfor 20 percent
landedvalueofallsalmonspeciescaughtbyAlaskanshersovertheperiod1990-91. 2
ThePacicsalmondisputealsoinvolvesanotherplayer,namelytheshersfrom
Wash-ington/Oregon. Weneglectthis aspectfor simplicity. Fortechnicalandinstitutional
as-pects of the disputes, see Munroand Stokes(1987), Munro, McDorman and McKelvey
(1997),andHuppert(1995). FortheoreticalanalysesofthePacicsalmondisputes,which
This paper is an attempt to modelthis type of sequential shing game.
We formalize this shing game as a dierential game 3
, involving one state
variable,thesh stock, andtwocontrolvariables,theharvest rates, oreort
levels, of the players. Using the dierential game framework, we shall
in-vestigate the answer to the above questions. We consider a continuous time
modelwhereateach timet, playerone (sayAlaska)harvests fromthe stock
beforeplayertwo(say Canada)can harvest. Playertwoisthe secondmover:
it has the opportunity to observe the rate of harvest of player one, before
deciding on its own rate of harvest. Furthermore, we assume that at any t,
the harvest rate of playerone (therst mover) has anegativeimpact onthe
productivity of player two's shing eorts. We characterize the Stackelberg
equilibrium strategy prole, and compare it with the Naive Nash
equilibri-um. We show that the rst mover (i.e., the Stackelberg leader) in fact has
an incentive to be more restrained in its harvest. The combined harvest of
the two players in this leader-follower equilibrium is smaller than the total
harvest ina NaiveNash equilibrium.
Wealsoconsider adeviationfromthe equilibriumstrategies. Wesuppose
player one decreases its shing eort during a xed period of time.(This
may be due topressure formore conservation fromenvironmentalgroupsin
countryone.) Weshowthatplayertwowouldrespondbymodifyingitseort.
The modication of player two's shing eort is shown to be an increasing
functionoftime. Furthermore,weshowthat thebest responseby playertwo
to player one's decreased shing eort is ambiguous and is sensitive to the
length ofthe periodof timeduring which playerone isdecreasing itsshing
eort. Thisperiodoftime canbeinterpreted asthe commitmentperiodtoa
lowershing eort of player one. When the period of commitmentis short,
playertwoincreases itscatching eort. However, whentheleader'speriodof
commitment is long enough, player two responds by decreasing its catching
eort fora certain periodof time. This decrease could be even greater than
playerone's deviation. Ouranalysis thereforesuggests that ifenvironmental
groups in country one (the leader) want to achieve an overall moderation
in the exploitation rate, but manage only to get their own government to
commit to a short term decrease in shing, they may end up causing more
sh to be caught by the other country. Thus a good intention might well
3
See Clemhout and Wan (1994) for a concise treatment of dierentialgames. For a
the leaderwould servethe conservationists'sobjectives.
In section2, we set up the model, characterize the Stackelberg
equilibri-um, and compare it with the Naive Nash equilibrium. Section 3 deals with
deviation from the equilibrium strategies. Some concluding remarks are
of-fered insection 4.
2 The Model
We use a continuous time model 4
. Let x(t) be the stock of sh at time t.
Playerone's(Alaska's)catchrateattish 1 (t)= 1 (t)x(t),where 1 (t)maybe
interpretedas itsshing eort. Playerone's net benetsis R 1
(h 1
(t)), which
isindependentof playertwo'sshingeort 2
(t),becauseplayerone catches
rst, before player two. The net benets to player two, on the contrary,
depend onboth h 1
(t) and h 2
(t) aswellasthe stockof sh and are given by
R 2 (h 2 (t);h 1 (t);x(t) ), where h 2 (t)= 2 (t)x(t). We assume R 2 is decreasing in h 1
and increasing in x. This may be justied as follows. The greater the
catch rate of player one, the more diÆcult it is for player two, who must
sh after player onehas shed, toachieve agiven catchrate. This negative
impactofplayerone'scatchrateonplayertwo'sbenetsisstrongerthelower
the sh stock. WeassumethatR 2 (h 2 (t);h 1
(t);x(t))takesthemultiplicativelyseparable
form R 2 (h 2 (t);h 1 ;x(t))=R (h 2 (t)) 1 h 1 (t) x(t) ! . 4
In an earlierversionof the paper, adierence-equation game, in discrete time, was
used: weadopted amodication ofthe Levhari-Mirman(L-M,1980) model ofshwars
to allowfor thesequentialityof thegame under consideration. Using theL-M
function-al forms, with modication to capture sequential moves, we obtained apuzzling result:
Canada'sharveststrategyturned outtobeindependentofAlaska'sharvestingrate, i.e.,
Canada would have a dominant harvesting strategy. This puzzling result seems to be
attributableto theL-Mspecicfunctional formsfortheutilityfunction (logutility)and
the reproduction function (a power function, concave in the stock). Unfortunately, to
ourknowledge,thesearetheonlyfunctionalformsforwhichwecancomputeanalytically
equilibrium strategiesin adiscrete-time sequentialgame. We havetherefore decided to
trytocapturesomeaspectsof sequentialityusingacontinuoustimemodel: onegroupof
shers are disadvantaged by the\prior" harvesting by theother group: the externality
1
equation indicatesthat if playerone and playertwocatchthe same quantity
ofsh,thenetbenetstoplayertwowillbelower,becauseoftheexternalities
generated by playerone's catch. Thefact thatplayertwois directlyaected
byplayerone'scatchiscaptured by themultiplicativeterm 1 h 1 ( t) x(t) . The
largerthe extractionrateof playerone andthe smallertheavailablestock of
sh the more costly it is for player two to harvest and the lower will be its
net benet fora given harvesting level.
The rate of growth of the sh stock is
_ x(t)=G(x(t); 1 (t); 2 (t)): (1)
The objective of player one is to maximize J 1
, the discounted sum of its
instantaneous net benets
J 1 = Z 1 0 R 1 (h 1 (t))e rt dt
subject to (1), where r denotes the interest rate which we assume, for
sim-plicity,to be the same for both players.
Similarly,the objective of playertwo isto maximizeJ 2 : J 2 = Z 1 0 R 2 (h 2 (t);h 1 ;x(t))e rt dt subjectto (1).
WeseekaMarkovperfectNashequilibriumwhereeachplayerusesa
time-independent Markovian strategy: 1 = 1 (x) and 2 = 2 ( 1 (x);x). That
is, player one conditions its current eort level only on the current stock,
and player two conditions its eort level both on the current stock, and on
the observed eort level of its rival, whose catch rate is observed by player
two. Thisfeatureoftheplayertwo'scatchstrategiesallowsustocapture the
fact that the shing game is sequential: player one catches rst and player
twois informedof the catch ofplayerone beforetakingits action. Since the
action of player two depends on the action of player one, which it observes
beforetaking itsown action,and since playerone knows this, and takes this
intoaccount,the solutionwe areseekingisinfactwhateconomistsnormally
call a Stackelberg equilibrium. But, as pointed out by game theorists, and
reaction functions 5
.
The Hamilton-Jacobi-Bellman(HJB) equation for playerone is
rV 1 (x)=max 1 [R 1 ( 1 x)+V 0 1 (x)G(x; 1 ; 2 ( 1 (x);x))]
and the HJBequation for player two is
rV 2 (x)=max 2 [R 2 ( 2 x)(1 1 (x)x x )+V 0 2 (x)G(x; 1 (x); 2 )]:
To proceed further, it is convenient to specify explicit functional forms.
Assume the transitionequationis linear inx:
G(x(t); 1 (t); 2 (t))=Ax(t) 1 (t)x(t) 2 (t)x(t)
The assumed linearity allows us to derive analytical solutions. A possible
justication is that even if the true function is, say, quadratic 6
in x, when
the two players are facing a low stock level,a linear approximation may be
a reasonable price topay for tractability 7
. Linearity implies,in the absence
of human harvesting, an exponential rate of growth of the sh stock: the
resource growth is neitherlimitedby spacenor byfoodsupply (see
Dasgup-ta and Heal (1979), Clark (1990)). This assumption is not unreasonable,
because for most kinds of sh that are commerciallyexploited, it is human
harvesting, rather than natural conditions, that is the relevant limit to the
growth of the biomass. Their survival is endangered by mass exploitation
rather than by natural forces.
The net benet function of player 1 isassumed totake the simple form
R 1 ( 1 x)=( 1 x) ; 0< <1;
It is a concave function in the catch rate, 1
x. For example, if the harvest
is sold in an international market at a constant price, then net benet for
5
SeeDockneretal. (2000),forfurtherdiscussion inthecontext ofdierentialgames.
Fora similar point, made in the context of homogenous product duopoly, see Binmore
(1992). 6
Aquadraticfunctionisafrequentlymadeassumptionintherenewable-resource
liter-ature. 7
It ismoreoverarealisticassumptionin thecaseofstockswherefoodandspace
and total cost isconvex, then net benetis concave. Inthe case the harvest
is sold in the home country's market segregated from the world market,
the net benet to the home country is taken to be the sum of consumers'
surplus and producers' surplus; in this case, since the consumers' surplus is
normally concave and the cost function of catching sh is usually convex, it
makes sense to assume that the net benet is concave 8
. This justies the
assumption <1.
Forplayertwo,we specify
R 2 ( 2 x; 1 x;x)=( 2 x) (1 1 ); 0< <1: Notice thatif 1 1
0,thenplayertwowould set itseortlevelat 2
=0.
The equilibrium would thus be trivial in that case. Henceforth, we will
mainly focus on the cases where parameter values ensure that both players
willchoose positiveeortlevels.
Our rst task isto show that there exists a pair of Markov perfect Nash
equilibriumstrategies, asdescribed by the following proposition:
Proposition 1
Assume that r > A. There exists a pair of Markov perfect Nash
equi-librium strategies, where player one's equilibriumstrategy is
1 = r A 1 = 1 >0 (2)
and player two's equilibriumstrategy is
2 = r A+ 1 1 = 2 ( 1 ) (3) if 1 1 > 0, and 2 = 0 if 1 1
0. Given this pair of equilibrium
strategies, if 1 1
>0, the equilibriumeortlevelof player two is
2 = (r A) (1 ) 2 = 1 (1 ) > 1 (4)
Proof: See Appendix A.
Remarks:
8
Thedierence betweenapositiveconcavefunction andapositiveconvexfunction is
Playertwotakesthenumber 1
(tobeobserved) asindependentofitsaction,
and nds, from the set of functions 2
( 1;
x), a function that maximizes
its payo. This optimal choice is found to be the function described by
(3). Playerone, taken as given player two's strategy (3), chooses a function
1
(x) that maximizes its payo. This function is found to have a constant
value which is given by (2). This Nash equilibrium is sometimes called a
Stackelberg equilibrium, in the sense that player one harvests rst (even
though one can say that the strategies are chosen simultaneously).
(ii) Notice that, from (4), player two's harvesting eort is greater than
that of playerone if 1 1
>0.
(iii)Player two'sstrategy,whichmay be calleditsreactionfunction,
dis-plays the property of strategic complementarity: an increase in 1
implies
that 2
willincrease too 9
. Moreover, if >1=2, the increase in 2
is greater
than the increase in 1
.
(iv)Hadplayerone notknown that 2
isafunction, and hadittaken 2
as agiven number 10
, likein Cournot games,we would have ended up witha
pair of values, denoted by ( b 1 ; b 2 );where b i = r A 1 2 ; i=1;2 (5)
we assume r A > 0 and < 1=2. Callthis \equilibrium" the Naive
E-quilibrium. Comparing (5)with (2), wesee that the leader'scatch rate 1
is
smaller that the catch rate b i
of eachplayerat the Naive Equilibrium. This
makessense: the leader knows that if it catches more, the followerwill also
increase itseort. Therefore,the leaderhasanincentivetoexercise
restrain-t. Moreover, the overall catching eort under the Stackelberg equilibrium
( 2
+ 1
)isless thanthe catcheort undertheNaiveequilibrium( b 1 + b 2 ). 9
Thisstrategiccomplimentarityholdsalsoformoregeneralnetbenetfunctions,
pro-vided that theelasticity ofthemarginalnetbenetwiththerespectto thecatch, i.e.,
R 00 2 2x R 0 2
,issuchthatthefollowingconditionissatised:
(r A+ 1 ) 0 2 >1. This
con-dition canbederivedfromthe rstorderconditionoftheHJBequation associatedwith
player2'sproblemtogetherwiththeuseoftheenvelopetheoremappliedtothatequation.
In general,this conditionhasnodirect relationshipwith theconcavity(or otherwise) of
thenet benetsfunction. Inthespecial casewhere isaconstant,theaboveinequality
isequivalentto requiring0<<1. (Aproofisavailableuponrequest.) 10
Not thenumber 2
erisgiven therst moveradvantageisnot worseningthe preservationof the
resource.
(vi)Wehave found aNash equilibrium(witha leaderand a follower) on
the assumption that thefollowerassumesthat the leaderchooses aconstant
eort level 1
. The follower then solves for itsbest reply, which we call the
reaction function. Using this reactionfunction, the leader solves itsoptimal
controlproblem. It turns out thatthe solutionyieldsaconstant eortlevel,
thus justifying the follower's initialassumption.
(vii) The equilibrium found in Proposition 1 has the property that each
player's strategy is individually optimal and is unique, given the player's
strategy. This can be proven by writing down each player's problem, given
the specied strategy of the other player, as an optimal control problem,
and applyingthe standard suÆciency theorem. Note, however, the factthat
we havefound anequilibriumstrategy pair doesnot preclude the possibility
that there may exist non-linear strategy pairs that also constitute a Nash
equilibrium.
(viii) Note that the equilibrium determined in Proposition 1 is not
col-lectively optimal, that is, the sum of net benets of the two players is not
maximized. In additiontoa rst sourceof ineÆciency due tothe absence of
property rights over the resource, thereisa secondsourceof ineÆciency due
to thefact that player one ignoresitsdirect negative impactonplayertwo's
net benets.
Inwhatfollows,weshall focus onparametervaluesyielding positive
sh-ingeorts for both players : 0< r A
1 <1.
3 Departures from the equilibrium
Given playertwo's reactionfunction,playerone's choice of 1
maximizesits
objective function. This may be interpreted as follows: Suppose player two
has the following best replyfunction (which is aversion of (3)):
2 (t)= r A+ 1 (t) 1 = 2 ( 1 (t)) (6)
which can be a function of t to the extent that 1
(t) can in principle be a
function of time. Player one then has an optimal control problem to solve,
where 1
1
fromthis path,evenforashorttimeinterval,couldbeoptimalfortheleader,
if player two would continue to use the rule (6) at all time. But such
continuation is not necessarily optimal for player two. In fact, player two
would exercise constant eort onlyif player one doesso. Therefore, it does
make sense to ask the following question: if player one deviates from the
equilibrium strategy 1
for an interval of time, how will player two react?
One possible reason for such deviation is that the government of country
one may be responding to pressures from conservationist groups within the
country, by promisinga reduction in the catch rate over a specied number
ofyears. (Inadierentscenario,thegovernmentofcountryonemaypromise
to subsidize its shing industry over a xed period, implying an increase in
the catch rate.)
Wesuppose, inthe followinganalysis,thatplayerone decidestoincrease
its eort level to 1
+" over a specied time interval [0;]. (The case of a
decrease can be inferred from the analysis by a simple change of sign.) We
should note that the initial equilibrium strategies constitute a Markovian
equilibrium and therefore remain equilibrium strategies once the aggressive
move of player one ends. Our task willthen be todetermine the reactionof
player two over the interval of time [0;]. Let g(t) denote the modication
of playertwo'sshingeortover[0;] 11
.Beforeproceedingfurtherweshould
rstnotethatifplayeronecancommitpermanently( =1)tothisincrease
inits catch eort, thenplayertwo's best replyis easilyderived from(3)and
is g(t)=d 2 = 1 d 1 >0
Fora nite , player two's problem is to choose a function g that
maxi-mizes the discounted sum of instantaneous prots:
Max fg( t) g Z 0 (( 2 +g(t))x) (1 ( 1 +"))dt+e r V 2 (x()) (7) subjectto x= Ax ( 1 +"+ 2 +g(t))x (8) 11
Notethatwecouldjustaswellmodelizethemodicationofplayertwo'sshingeort
asafunctionoftheshstock(i.e. aMarkovianstrategy). Howeversinceplayertwoisthe
\onlydecisionmaker"during[0;](playeroneisassumedto follow 1
+over[0;])the
2
the initialequilibriumstrategies and where 2 isgiven by 2 = r A+ 1 1 (9)
Playertwo'sproblemisastandardoptimalcontrolproblemandisthussolved
by standard maximum principletechniques (see Leonard and Long (1992)).
LetH denotethe Hamiltonianassociated to (7):
H =(( 2 +g(t))x) (1 1 "))+(Ax ( 1 +"+ 2 +g(t))x)
where isthe shadowprice ofthe stock ofsh. Thesolutiontothe problem
above is characterized by the followingnecessary conditions:
x(( 2 +g(t))x) 1 (1 1 ") x=0 (10) =r ( 2 +g)(( 2 +g)x) 1 (1 1 " ) (A 1 " 2 g) (11)
and the transversality condition
()=V 0 2
(x()) (12)
By dierentiation of (10) with respect to time and substituting into (11) we obtain: g ( 2 +" 1 )g = 2 " 1 +g 2 (13)
where we have made use of (9). Furthermore,substituting from (10) into
(12) and using V 2 (x)=K 2 x ,we obtain (1 ( 1 +"))( 2 +g()) 1 =K 2 (14)
Player two'soptimal response, g(t);to playerone's shing eortincrease is
thus determined by the dierential equation(13) and (14). This dierential
equation (13) can be transformed into a Bernouilli equation after a change
of variable. Fordetails see Appendix B.
Proposition 2: The optimal modication of player two's shing eort,
g(t), to player one's increased shing eort is given by
g(t)= 1 1 ( 1 +") 1 1 1 1 1 2 1 2 " 1 e ( 2 " 1 )( t ) + 1 2 " 1 2 (15)
of time (g 0
(t)<0),and for"=0 we have g 0.
Proposition3: Playertwo'sreactiontoanincrease inplayerone's
sh-ing eort is ambiguous and is sensitive to the period of time during which
player one can commit to such anincrease. If the period of commitmentis
small, playertwo reduces its catching eort. Ifthe periodof commitmentis
large enough, player two's catching eort is initially increased for a certain
period of time though it is ultimately reduced. In the limiting case where
playerone commitspermanentlytoanincrease of itsharvestingeort,
play-er two unambiguously reacts by increasing its own harvesting eort. This
increase could beeven greater than playerone's deviation.
Proof: see Appendix C.
The intuition behind Proposition 3 is as follows. Player two is facing
a trade o: more harvesting increases the instantaneous net benet at the
expense of the stock of shavailablewhen the aggressivemoveof playerone
ends, atthe instant. The main incentivefor playertwo toreduceits catch
before is to enjoy a larger stock once competition becomes less aggressive
(after ). The larger is the less importantisthe presentvalueof the game
after for player two and the smaller are the incentives to conserve the
resource. For a suÆciently long period of commitment of player one (to
more aggressive catching eorts), player two responds by alsoincreasing its
catching eortfor a certaininterval of time.
Accordingto the above propositions, if the commitmentperiod is short,
the result is similar to the outcome of a static Stackelberg modelwith two
rms, a leader and a follower, both selling in the same market. In the case
of strategic substitutes in that static model the best reply of the follower
to any increase of output of the leader is to decrease of its own output.
However, despite this similarity, our modelis dierent. The two players do
not competeinthe samemarket 12
: they eitherconsume theircatches, orsell
them indistinctmarkets. Furthermore,wehaveshown inproposition1that
the twoeortlevelsarestrategiccomplements. Itwould beinterestingtosee
howtheintroductionofrivalryinacommonmarketwouldalterthebehavior
of the two players. Since there is no competition in the market place in the
case of separate markets, as opposed to the oligopolistic case, it could be
expected that the reaction of the follower to an aggressive move of its rival
12
The conclusions would hold if rmsoperate in the sameforeign market where they
thatthelongertheperiodofcommitmenttotheaggressivemove,the\shier"
shouldbetheresponseofplayertwo. AscanbeseenfromPropositions2and
3, such expectation turns out to be incorrect. The reaction of the follower,
in the case of a long period of commitment of the leader to an aggressive
move, turns out to be another aggressive move that could be even stronger
than the one that provokesit.
Asastraightforward implicationofpropositions2and3,wecanconclude
that if country one (the leader) is committed to a more conservationist
ex-ploitation ( i.e., " < 0) over a specied number of years, the follower will
follow suit only if the commitment period is long enough. Conservationist
groupsthat managetoget onlyvery shorttermcommitmentbycountry one
would nd theireorts counter-productive.
4 Concluding Remarks
Wehavedeveloped asimplemodelthat captures tosomeextentthe
sequen-tial aspect of transboundary shery. We have been able to show that in
a Markov perfect equilibrium, the rst mover has an incentive to be more
conservationist than a normal player ina symmetric situation. This picture
musthoweverbeseenwithguardedoptimism,forthereisadarkcloudinthe
horizon for the conservation-minded observer: the leader may deviate from
the equilibrium by making an aggressive move, and such a deviation may
triggerthe followertoreactby aneven moreaggressive move, ifplayerone's
periodofcommitmenttothe deviationis suÆciently long. Anothermessage
from our analysis is that pressures for a unilateral cutback in exploitation
Proof of Proposition 1:
Player two takes player one's constant strategy 1 = 1 as given, and chooses its 2
. The HJBequation for player two is
rV 2 (x)=max 2 [( 2 x) (1 1 )+V 0 2 (x)(A 1 2 )x] (16)
Let ustry the followingfunctional formfor V 2 (:) V 2 (x)=K 2 x (17) where K 2 =0 if 1 1 0 (18) For 1 1 >0, equation (16) becomes rK 2 x =max 2 [( 2 x) (1 1 )+K 2 (A 1 2 )x ] (19)
The maximizationof the right-hand side of (16) yields
2 = 1 1 K 2 1=(1 ) (20) Hence K 2 = 1 1 1 2 (21)
Substituting (21) into(19), we get
2 = r A+ 1 1 = 2 ( 1 )if 1 1 >0 (22) Then, from(18), 2 ( 1 )=0 if 1 1 0 (23)
We turn nowtoplayerone's problem. Weassumethat playerone knows
that playertwo uses the strategy (22)-(23). Playerone's HJB equation is
rV 1 (x)=max 1 [( 1 x) +V 0 1 (x)(A 2 ( 1 ) 1 )x] (24)
Letus trythe functionalformV 1 (x)=K 1 x . Notice that 2 (a 1 ) has akink.
We must solve (24) for two separate cases: case (i), where 1 1
0, and
case (ii), where1 1
>0.
Forcase (i), wenote that 2
=0, and hence we have the HJBequation
rK 1 x =max 1 [( 1 x) +K 1 (A 1 )x ] (25) subjectto 1 1 0: (26) Solving, we get K 1 = " r A 1 # 1 =K 1 and hence 1 = " r A 1 # = 1
which,in viewof the constraint (26), is asolution provided that
"
r A
1
#
>1 (27)
If (27) isnot satised, then (25), subject to (26), has nosolution.
Forcase (ii),we must solve
rK 1 x =max 1 [( 1 x) +K 1 (A r A+ 1 1 1 )x ] (28)
From this we get
1 1 =!K 1 (29) where != 1 1
Thus, substituting(29) into(28), we get
r! = 1 +! " A r 1 ! 1 # (30) 1 = (r A) (1 ) = 1 (31)
K 1 =(1 ) " r A 1 + 2 # 1 =K 1
This completes the proof of proposition1.
Appendix B
In this appendix we determine player two's optimal reply of Canada to
player one'scatch increase by solving the dierentialequation
g ( 2 +" 1 )g = 2 " 1 +g 2 (32)
with the transversality condition
(1 ( 1 +"))( 2 +g()) 1 =K 2 (33) Letus dene G(t)=g(t)+ 2
The dierentialequation (13) can be rewritten in termsof G(t)
G =G(G 2 +" 1 ) (34)
This dierentialequation is aBernoulli equation. LetY = 1 G , (34) gives Y = 2 " 1 ! Y 1
the solutionof whichis
Y (t)=Ce ( 2 " 1 ) t + 1 2 " 1
where C is constant of integration that can be determined by the
transver-sality condition (14).
After substitutionwe have
g(t)= 1 K2 (1 ( 1 +")) 1 1 1 2 " 1 e ( 2 " 1 )( t ) + 1 2 " 1 2
g(t)= 1 1 (1+") 1 1 1 1 1 2 1 2 " 1 e (2 " 1 )( t ) + 1 2 " 1 2 . Note that 1 1 (1 ( 1 +")) ! 1 1 1 2 > 1 2
Since, for ">0and for <1,
0< 1 2 " 1 < 1 2
wehavethatg(:)isacontinuousandadecreasingfunctionoftime. Moreover,
if " =0, g(:)is identically zero.
Appendix C
Wenowshowthatifplayerone increasesitseortlevelby"overthetime
interval [0;] then player two will increase its own eort, i.e. g(t) > 0 for
an interval of time. However player two will ultimately reduce its catching
eort, i.e. g()<0.
Westartbynotingthatg()<0. Indeed,from(15),itisstraightforward
that g()= 2 0 @ 1 1 (1 ( 1 +")) ! 1 1 1 1 A (35)
For ">0 and for <1,we have g()<0.
We can alsocalculateg(0) :
g(0)= 1 1 ( 1 +") 1 1 1 1 1 2 1 2 " 1 e ( 2 " 1 ) + 1 2 " 1 2 (36)
Now consider the implicationof the lengthof the commitment period .
Note that g(0)is acontinuous and increasing function of , and
lim !+1 g(0)= " 1 >0
given there exists t such that for all t 2 [0;t], g(t) > 0: In the
limiting case where =1 wehave
g(t)= "
1
>0for all t2[0;1)
We nowturn to the case of short commitmentperiod. It isclear that
lim !0 g(0)= 2 0 @ 1 1 (1 ( 1 +")) ! 1 1 1 1 A <0
and thus there a exists a positive value 0
such that for each given 0
we have g(0)<0. Since g()<0and g is a continuous function of time we
[1] Binmore, Ken, 1992, Fun and Games: A Text on Game Theory:, D.C.
Heath, Lexington, Mass.
[2] Bolle,F., 1986,On theOligopolisticExtractionofNon-renewable
Com-mon Pool Resources, Economica 53, 519-27.
[3] Chiarella,Carl,Murray C. Kemp,NgoVanLong, Koji Okuguchi,1984,
On the Economics of International Fisheries, International Economic
Review 25, 85-92.
[4] Clark, C. W.,1990,Mathematical Bioeconomics: TheOptimal
Manage-ment of Renewable Resources, second edition,Wiley,New York.
[5] Clemhout, Simone and Henry WanJr., 1985, Dynamic Common
Prop-erty Resources and Environmental Problems, Journal of Optimization
Theory and Applications 46, 471-81.
[6] Clemhout, Simoneand Henry WanJr., 1994, DierentialGames:
Eco-nomic Applications, in R. J. Aumann and S. Hart, eds., Handbook of
Game TheorywithEconomicApplications,Volume 2, Elsevier,
Amster-dam, 801-25.
[7] Crabbe,Philippe,andNgoVanLong,1993,EntryDeterrenceand
Over-exploitationofthe Fishery,Journal of EconomicDynamicsand Control
17, 679-704.
[8] Dasgupta, P. and G.Heal,1979, Economic Theory andExhaustible
Re-sources, Nisbet, Cambridge.
[9] Datta, M.and L.Mirman, 1994,Dynamic CapitalMarketInteractions,
Externalities, and Trade, CORE discussion paper9409.
[10] Dockner, Engelbert, Steen Jorgensen, Ngo Van Long, and Gerhard
Sorger, 2000, Dierential Games in Economics and Management
Sci-ences,CambridgeUniversity Press, Cambridge, UK
[11] Fischer, R. D., and L. Mirman, 1992, Strategic Dynamic Interactions:
the Salmon Wars, School of Marine Aairs, University of Washington,
Seattle, SMA 95-1.
[13] Kaitala, V. , and M. Pohjola, 1988, OptimalRecovery of a Shared
Re-source Stock: a Dierential Game with EÆcient Memory Equilibria,
Natural Resource Modelling 3,91-119.
[14] Leonard, Daniel,andNgoVanLong,1992, OptimalControlTheoryand
Static Optimizationin Economics, CambridgeUniversity Press
[15] Levhari, D., and L. Mirman, 1980, The Great Fish War: An Example
Using a Dynamic Nash-Cournot Solution, Bell Journal of Economics,
11, 322-34.
[16] Miller,K.A.,1996,SalmonStock Variabilityandthe Political Economy
of the Pacic SalmonTreaty. Contemporary Economic Policy,vol XIV,
112-29
[17] Missios, PaulC., and Charles G.Plourde,1996, The Canada-European
Union TurbotWar: A Brief Game-theoretic Analysis, Canadian Public
Policy 22(2)144-50.
[18] Missios, Paul C., and Charles G. Plourde, 1997, Transboundary
Re-newableResources ManagementandConservationMotives, Marine
Re-source Economics 12(1), 27-30.
[19] Munro, Gordon, and Stokes, Robert. L., 1987, The Canada-United
S-tates Pacic Salmon Treaty, Chapter 2 in D. McRae and G. Munro
(eds.), Canadian Oceans Policy: National Strategies and the New Law
of the Sea, UBC Press, pp17-35.
[20] Munro, Gordon, Ted McDorman, and Robert McKelvey, 1997, T
rans-boundary Fishery Resources and the Canada-United States Pacic
Salmon Treaty, Typescript, U.B.C.
[21] Plourde, Charles G., 1970, A Simple Model of Replenishable Natural
Resource Exploitation,American Economic Review 60(3), pp 518-22.
[22] Plourde, Charles G., 1971, Exploitation of Common Property
boundary Replenishable Fish Stock, Marine Resource Economics 6(1),
57-70.
[24] Sinn, Hans-Werner, 1984, Common Property Resources, Storage F
acil-ities, and Ownership Structure: A Cournot Model of the Oil Market,
Economica 51,235-52.
[25] Thomas, J., 1992, Cartel Stability in an Exhaustible Resource Model,
Economica 59 (235),279-93.
[26] Tian, Huilan, 1998, Optimal Management of Transboundary Fishery,
with SpecicReferencetothe PacicSalmon,M.Scethesis,Department