Leader and Follower: A Differential Game Model

(1)

Montréal

Série Scientifique

Scientific Series

2001s-08

Leader and Follower:

A Differential Game Model

(2)

CIRANO

Le CIRANO est un organisme sans but lucratif constitué en vertu de la Loi des compagnies du Québec. Le

financement de son infrastructure et de ses activités de recherche provient des cotisations de ses

organisations-membres, d’une subvention d’infrastructure du ministère de la Recherche, de la Science et de la Technologie, de

même que des subventions et mandats obtenus par ses équipes de recherche.

CIRANO is a private non-profit organization incorporated under the Québec Companies Act. Its infrastructure and

research activities are funded through fees paid by member organizations, an infrastructure grant from the

Ministère de la Recherche, de la Science et de la Technologie, and grants and research mandates obtained by its

research teams.

Les organisations-partenaires / The Partner Organizations

•École des Hautes Études Commerciales

•École Polytechnique

•Université Concordia

•Université de Montréal

•Université du Québec à Montréal

•Université Laval

•Université McGill

•MEQ

•MRST

•Alcan Aluminium Ltée

•AXA Canada

•Banque du Canada

•Banque Laurentienne du Canada

•Banque Nationale du Canada

•Banque Royale du Canada

•Bell Québec

•Bombardier

•Bourse de Montréal

•Développement des ressources humaines Canada (DRHC)

•Fédération des caisses populaires Desjardins de Montréal et de l’Ouest-du-Québec

•Hydro-Québec

•Imasco

•Industrie Canada

•Pratt & Whitney Canada Inc.

•Raymond Chabot Grant Thornton

•Ville de Montréal

© 2001 Hassan Benchekroun et Ngo Van Long. Tous droits réservés. All rights reserved.

Reproduction partielle permise avec citation du document source, incluant la notice ©.

Short sections may be quoted without explicit permission, if full credit, including © notice, is given to the source.

Ce document est publié dans l’intention de rendre accessibles les résultats préliminaires

de la recherche effectuée au CIRANO, afin de susciter des échanges et des suggestions.

Les idées et les opinions émises sont sous l’unique responsabilité des auteurs, et ne

représentent pas nécessairement les positions du CIRANO ou de ses partenaires.

This paper presents preliminary research carried out at CIRANO and aims at

encouraging discussion and comment. The observations and viewpoints expressed are the

sole responsibility of the authors. They do not necessarily represent positions of CIRANO

or its partners.

(3)

Leader and Follower: A Differential Game Model

*

Hassan Benchekroun

†

, Ngo Van Long

‡

Résumé / Abstract

On analyse un jeu différentiel entre deux joueurs, dont le premier a

l'avantage de prendre sa décision avant son adversaire. On compare le profil de

stratégies d'équilibre de ce modèle avec celui d'un modèle d'actions symétriques.

On démontre que l'existence d'un leader favorise la conservation dans

l'exploitation du stock commun. On analyse les déviations possibles à partir d'un

équilibre. On démontre que si le leader peux s'engager à une politique

d'exploitation plus modérée, alors le suiveur peux répondre plus ou moins

agressivement, selon la durée de la période d'engagement.

We consider a differential game between two players, where one player

has the first mover advantage. We compare the equilibrium strategy profile of this

model with the one generated by a conventional symmetric model. It is shown that

the existence of a first mover results in more conservationist exploitation in the

aggregate. We also consider the implication of departures from the equilibrium.

We show that if the leader (the first mover) can commit to decrease its effort over

a finite interval of time, then the follower (the second mover) may respond by

increasing, or decreasing, its effort, depending on the length of the commitment

period.

Mots Clés :

Leadership, jeux différentiels

Keywords:

Leadership, differential games

JEL: C72, Q22

*

Corresponding Author: Ngo Van Long, CIRANO, 2020 University Street, 25

th

floor, Montréal, Qc, Canada

H3A 2A5

Tel.: (514) 985-4020

Fax: (514) 985-4039

email: longn@cirano.qc.ca

We wish to thank two referees for very helpful comments.

(4)

Migratory sh that travel along the coast lineof several nations are subject

to sequential catching. This has been the cause of many disputes between

nations. Forexample, the Pacic salmon disputes between Canadaand the

United States of America are due largely to what is perceived by Canada

as Alaska'sunfair\interceptions". The Canadian salmon,born inCanadian

rivers, have the habit of travelling to the Pacic ocean, staying in the high

seasforafewyears(3to7years,dependingonthespecies)andthenreturning

toCanada forbreeding. On theirreturntrip,they travelalongthe coastline

of AlaskabeforereachingBritishColumbia. Alaska'sshershavethe chance

to catch these sh rst. This is called \interception" 1

. If they catch all the

Canadian salmonthat pass by, thenthere willbenomore sh inthe future.

So they dohave anincentive to conserve the resource. Canadian shers are

thesecondmover,becausetheyobservetheAlaskaninterceptionsbeforethey

decide how muchto catch 2

.

Several interesting theoretical questions arise: (i) what would be a

rea-sonabledenitionofequilibriumstrategiesforthistypeofgame? (ii)inwhat

sensethis isaleader-followergame? (iii)doestheleaderhaveanincentiveto

\set an example" by restraining its catch rate relative to a situation where

the two players are symmetrical? (iv) if an equilibriumis perturbed by one

player, howwould the otherplayerreact? Forexample,if one playerdecides

and commits to decrease its shing eort below its equilibrium value for a

specied time interval, would the other player react by also decreasing its

shing eort? These questions have not been raised in the existing

theoret-ical literature on common property resource management (Plourde (1970),

Sinn (1984), Clemhout and Wan(1985), Bolle (1986), Thomas (1992),

Mis-sios and Plourde (1996,1997)),nor in the literature on sh war, which deals

onlywithsymmetricsituations(LevhariandMirman(1980),Chiarellaetal.

(1984), Kaitala and Pohjola (1988),Plourde and Yeung (1989), Fischer and

Mirman (1992), Datta and Mirman (1994), Missios and Plourde (1996)) or

1

According to Huppert (1995)the interceptionsby Alaskaaccountedfor 20 percent

landedvalueofallsalmonspeciescaughtbyAlaskanshersovertheperiod1990-91. 2

ThePacicsalmondisputealsoinvolvesanotherplayer,namelytheshersfrom

Wash-ington/Oregon. Weneglectthis aspectfor simplicity. Fortechnicalandinstitutional

as-pects of the disputes, see Munroand Stokes(1987), Munro, McDorman and McKelvey

(1997),andHuppert(1995). FortheoreticalanalysesofthePacicsalmondisputes,which

(5)

This paper is an attempt to modelthis type of sequential shing game.

We formalize this shing game as a dierential game 3

, involving one state

variable,thesh stock, andtwocontrolvariables,theharvest rates, oreort

levels, of the players. Using the dierential game framework, we shall

in-vestigate the answer to the above questions. We consider a continuous time

modelwhereateach timet, playerone (sayAlaska)harvests fromthe stock

beforeplayertwo(say Canada)can harvest. Playertwoisthe secondmover:

it has the opportunity to observe the rate of harvest of player one, before

deciding on its own rate of harvest. Furthermore, we assume that at any t,

the harvest rate of playerone (therst mover) has anegativeimpact onthe

productivity of player two's shing eorts. We characterize the Stackelberg

equilibrium strategy prole, and compare it with the Naive Nash

equilibri-um. We show that the rst mover (i.e., the Stackelberg leader) in fact has

an incentive to be more restrained in its harvest. The combined harvest of

the two players in this leader-follower equilibrium is smaller than the total

harvest ina NaiveNash equilibrium.

Wealsoconsider adeviationfromthe equilibriumstrategies. Wesuppose

player one decreases its shing eort during a xed period of time.(This

may be due topressure formore conservation fromenvironmentalgroupsin

countryone.) Weshowthatplayertwowouldrespondbymodifyingitseort.

The modication of player two's shing eort is shown to be an increasing

functionoftime. Furthermore,weshowthat thebest responseby playertwo

to player one's decreased shing eort is ambiguous and is sensitive to the

length ofthe periodof timeduring which playerone isdecreasing itsshing

eort. Thisperiodoftime canbeinterpreted asthe commitmentperiodtoa

lowershing eort of player one. When the period of commitmentis short,

playertwoincreases itscatching eort. However, whentheleader'speriodof

commitment is long enough, player two responds by decreasing its catching

eort fora certain periodof time. This decrease could be even greater than

playerone's deviation. Ouranalysis thereforesuggests that ifenvironmental

groups in country one (the leader) want to achieve an overall moderation

in the exploitation rate, but manage only to get their own government to

commit to a short term decrease in shing, they may end up causing more

sh to be caught by the other country. Thus a good intention might well

3

See Clemhout and Wan (1994) for a concise treatment of dierentialgames. For a

(6)

the leaderwould servethe conservationists'sobjectives.

In section2, we set up the model, characterize the Stackelberg

equilibri-um, and compare it with the Naive Nash equilibrium. Section 3 deals with

deviation from the equilibrium strategies. Some concluding remarks are

of-fered insection 4.

2 The Model

We use a continuous time model 4

. Let x(t) be the stock of sh at time t.

Playerone's(Alaska's)catchrateattish 1 (t)= 1 (t)x(t),where 1 (t)maybe

interpretedas itsshing eort. Playerone's net benetsis R 1

(h 1

(t)), which

isindependentof playertwo'sshingeort 2

(t),becauseplayerone catches

rst, before player two. The net benets to player two, on the contrary,

depend onboth h 1

(t) and h 2

(t) aswellasthe stockof sh and are given by

R 2 (h 2 (t);h 1 (t);x(t) ), where h 2 (t)= 2 (t)x(t). We assume R 2 is decreasing in h 1

and increasing in x. This may be justied as follows. The greater the

catch rate of player one, the more diÆcult it is for player two, who must

sh after player onehas shed, toachieve agiven catchrate. This negative

impactofplayerone'scatchrateonplayertwo'sbenetsisstrongerthelower

the sh stock. WeassumethatR 2 (h 2 (t);h 1

(t);x(t))takesthemultiplicativelyseparable

form R 2 (h 2 (t);h 1 ;x(t))=R (h 2 (t)) 1 h 1 (t) x(t) ! . 4

In an earlierversionof the paper, adierence-equation game, in discrete time, was

used: weadopted amodication ofthe Levhari-Mirman(L-M,1980) model ofshwars

to allowfor thesequentialityof thegame under consideration. Using theL-M

function-al forms, with modication to capture sequential moves, we obtained apuzzling result:

Canada'sharveststrategyturned outtobeindependentofAlaska'sharvestingrate, i.e.,

Canada would have a dominant harvesting strategy. This puzzling result seems to be

attributableto theL-Mspecicfunctional formsfortheutilityfunction (logutility)and

the reproduction function (a power function, concave in the stock). Unfortunately, to

ourknowledge,thesearetheonlyfunctionalformsforwhichwecancomputeanalytically

equilibrium strategiesin adiscrete-time sequentialgame. We havetherefore decided to

trytocapturesomeaspectsof sequentialityusingacontinuoustimemodel: onegroupof

shers are disadvantaged by the\prior" harvesting by theother group: the externality

(7)

1

equation indicatesthat if playerone and playertwocatchthe same quantity

ofsh,thenetbenetstoplayertwowillbelower,becauseoftheexternalities

generated by playerone's catch. Thefact thatplayertwois directlyaected

byplayerone'scatchiscaptured by themultiplicativeterm 1 h 1 ( t) x(t) . The

largerthe extractionrateof playerone andthe smallertheavailablestock of

sh the more costly it is for player two to harvest and the lower will be its

net benet fora given harvesting level.

The rate of growth of the sh stock is

_ x(t)=G(x(t); 1 (t); 2 (t)): (1)

The objective of player one is to maximize J 1

, the discounted sum of its

instantaneous net benets

J 1 = Z 1 0 R 1 (h 1 (t))e rt dt

subject to (1), where r denotes the interest rate which we assume, for

sim-plicity,to be the same for both players.

Similarly,the objective of playertwo isto maximizeJ 2 : J 2 = Z 1 0 R 2 (h 2 (t);h 1 ;x(t))e rt dt subjectto (1).

WeseekaMarkovperfectNashequilibriumwhereeachplayerusesa

time-independent Markovian strategy: 1 = 1 (x) and 2 = 2 ( 1 (x);x). That

is, player one conditions its current eort level only on the current stock,

and player two conditions its eort level both on the current stock, and on

the observed eort level of its rival, whose catch rate is observed by player

two. Thisfeatureoftheplayertwo'scatchstrategiesallowsustocapture the

fact that the shing game is sequential: player one catches rst and player

twois informedof the catch ofplayerone beforetakingits action. Since the

action of player two depends on the action of player one, which it observes

beforetaking itsown action,and since playerone knows this, and takes this

intoaccount,the solutionwe areseekingisinfactwhateconomistsnormally

call a Stackelberg equilibrium. But, as pointed out by game theorists, and

(8)

reaction functions 5

.

The Hamilton-Jacobi-Bellman(HJB) equation for playerone is

rV 1 (x)=max 1 [R 1 ( 1 x)+V 0 1 (x)G(x; 1 ; 2 ( 1 (x);x))]

and the HJBequation for player two is

rV 2 (x)=max 2 [R 2 ( 2 x)(1 1 (x)x x )+V 0 2 (x)G(x; 1 (x); 2 )]:

To proceed further, it is convenient to specify explicit functional forms.

Assume the transitionequationis linear inx:

G(x(t); 1 (t); 2 (t))=Ax(t) 1 (t)x(t) 2 (t)x(t)

The assumed linearity allows us to derive analytical solutions. A possible

justication is that even if the true function is, say, quadratic 6

in x, when

the two players are facing a low stock level,a linear approximation may be

a reasonable price topay for tractability 7

. Linearity implies,in the absence

of human harvesting, an exponential rate of growth of the sh stock: the

resource growth is neitherlimitedby spacenor byfoodsupply (see

Dasgup-ta and Heal (1979), Clark (1990)). This assumption is not unreasonable,

because for most kinds of sh that are commerciallyexploited, it is human

harvesting, rather than natural conditions, that is the relevant limit to the

growth of the biomass. Their survival is endangered by mass exploitation

rather than by natural forces.

The net benet function of player 1 isassumed totake the simple form

R 1 ( 1 x)=( 1 x) ; 0< <1;

It is a concave function in the catch rate, 1

x. For example, if the harvest

is sold in an international market at a constant price, then net benet for

5

SeeDockneretal. (2000),forfurtherdiscussion inthecontext ofdierentialgames.

Fora similar point, made in the context of homogenous product duopoly, see Binmore

(1992). 6

Aquadraticfunctionisafrequentlymadeassumptionintherenewable-resource

liter-ature. 7

It ismoreoverarealisticassumptionin thecaseofstockswherefoodandspace

(9)

and total cost isconvex, then net benetis concave. Inthe case the harvest

is sold in the home country's market segregated from the world market,

the net benet to the home country is taken to be the sum of consumers'

surplus and producers' surplus; in this case, since the consumers' surplus is

normally concave and the cost function of catching sh is usually convex, it

makes sense to assume that the net benet is concave 8

. This justies the

assumption <1.

Forplayertwo,we specify

R 2 ( 2 x; 1 x;x)=( 2 x) (1 1 ); 0< <1: Notice thatif 1 1

0,thenplayertwowould set itseortlevelat 2

=0.

The equilibrium would thus be trivial in that case. Henceforth, we will

mainly focus on the cases where parameter values ensure that both players

willchoose positiveeortlevels.

Our rst task isto show that there exists a pair of Markov perfect Nash

equilibriumstrategies, asdescribed by the following proposition:

Proposition 1

Assume that r > A. There exists a pair of Markov perfect Nash

equi-librium strategies, where player one's equilibriumstrategy is

1 = r A 1 = 1 >0 (2)

and player two's equilibriumstrategy is

2 = r A+ 1 1 = 2 ( 1 ) (3) if 1 1 > 0, and 2 = 0 if 1 1

0. Given this pair of equilibrium

strategies, if 1 1

>0, the equilibriumeortlevelof player two is

2 = (r A) (1 ) 2 = 1 (1 ) > 1 (4)

Proof: See Appendix A.

Remarks:

8

Thedierence betweenapositiveconcavefunction andapositiveconvexfunction is

(10)

Playertwotakesthenumber 1

(tobeobserved) asindependentofitsaction,

and nds, from the set of functions 2

( 1;

x), a function that maximizes

its payo. This optimal choice is found to be the function described by

(3). Playerone, taken as given player two's strategy (3), chooses a function

1

(x) that maximizes its payo. This function is found to have a constant

value which is given by (2). This Nash equilibrium is sometimes called a

Stackelberg equilibrium, in the sense that player one harvests rst (even

though one can say that the strategies are chosen simultaneously).

(ii) Notice that, from (4), player two's harvesting eort is greater than

that of playerone if 1 1

>0.

(iii)Player two'sstrategy,whichmay be calleditsreactionfunction,

dis-plays the property of strategic complementarity: an increase in 1

implies

that 2

willincrease too 9

. Moreover, if >1=2, the increase in 2

is greater

than the increase in 1

.

(iv)Hadplayerone notknown that 2

isafunction, and hadittaken 2

as agiven number 10

, likein Cournot games,we would have ended up witha

pair of values, denoted by ( b 1 ; b 2 );where b i = r A 1 2 ; i=1;2 (5)

we assume r A > 0 and < 1=2. Callthis \equilibrium" the Naive

E-quilibrium. Comparing (5)with (2), wesee that the leader'scatch rate 1

is

smaller that the catch rate b i

of eachplayerat the Naive Equilibrium. This

makessense: the leader knows that if it catches more, the followerwill also

increase itseort. Therefore,the leaderhasanincentivetoexercise

restrain-t. Moreover, the overall catching eort under the Stackelberg equilibrium

( 2

+ 1

)isless thanthe catcheort undertheNaiveequilibrium( b 1 + b 2 ). 9

Thisstrategiccomplimentarityholdsalsoformoregeneralnetbenetfunctions,

pro-vided that theelasticity ofthemarginalnetbenetwiththerespectto thecatch, i.e.,

R 00 2 2x R 0 2

,issuchthatthefollowingconditionissatised:

(r A+ 1 ) 0 2 >1. This

con-dition canbederivedfromthe rstorderconditionoftheHJBequation associatedwith

player2'sproblemtogetherwiththeuseoftheenvelopetheoremappliedtothatequation.

In general,this conditionhasnodirect relationshipwith theconcavity(or otherwise) of

thenet benetsfunction. Inthespecial casewhere isaconstant,theaboveinequality

isequivalentto requiring0<<1. (Aproofisavailableuponrequest.) 10

Not thenumber 2

(11)

erisgiven therst moveradvantageisnot worseningthe preservationof the

resource.

(vi)Wehave found aNash equilibrium(witha leaderand a follower) on

the assumption that thefollowerassumesthat the leaderchooses aconstant

eort level 1

. The follower then solves for itsbest reply, which we call the

reaction function. Using this reactionfunction, the leader solves itsoptimal

controlproblem. It turns out thatthe solutionyieldsaconstant eortlevel,

thus justifying the follower's initialassumption.

(vii) The equilibrium found in Proposition 1 has the property that each

player's strategy is individually optimal and is unique, given the player's

strategy. This can be proven by writing down each player's problem, given

the specied strategy of the other player, as an optimal control problem,

and applyingthe standard suÆciency theorem. Note, however, the factthat

we havefound anequilibriumstrategy pair doesnot preclude the possibility

that there may exist non-linear strategy pairs that also constitute a Nash

equilibrium.

(viii) Note that the equilibrium determined in Proposition 1 is not

col-lectively optimal, that is, the sum of net benets of the two players is not

maximized. In additiontoa rst sourceof ineÆciency due tothe absence of

property rights over the resource, thereisa secondsourceof ineÆciency due

to thefact that player one ignoresitsdirect negative impactonplayertwo's

net benets.

Inwhatfollows,weshall focus onparametervaluesyielding positive

sh-ingeorts for both players : 0< r A

1 <1.

3 Departures from the equilibrium

Given playertwo's reactionfunction,playerone's choice of 1

maximizesits

objective function. This may be interpreted as follows: Suppose player two

has the following best replyfunction (which is aversion of (3)):

2 (t)= r A+ 1 (t) 1 = 2 ( 1 (t)) (6)

which can be a function of t to the extent that 1

(t) can in principle be a

function of time. Player one then has an optimal control problem to solve,

where 1

(12)

1

fromthis path,evenforashorttimeinterval,couldbeoptimalfortheleader,

if player two would continue to use the rule (6) at all time. But such

continuation is not necessarily optimal for player two. In fact, player two

would exercise constant eort onlyif player one doesso. Therefore, it does

make sense to ask the following question: if player one deviates from the

equilibrium strategy 1

for an interval of time, how will player two react?

One possible reason for such deviation is that the government of country

one may be responding to pressures from conservationist groups within the

country, by promisinga reduction in the catch rate over a specied number

ofyears. (Inadierentscenario,thegovernmentofcountryonemaypromise

to subsidize its shing industry over a xed period, implying an increase in

the catch rate.)

Wesuppose, inthe followinganalysis,thatplayerone decidestoincrease

its eort level to 1

+" over a specied time interval [0;]. (The case of a

decrease can be inferred from the analysis by a simple change of sign.) We

should note that the initial equilibrium strategies constitute a Markovian

equilibrium and therefore remain equilibrium strategies once the aggressive

move of player one ends. Our task willthen be todetermine the reactionof

player two over the interval of time [0;]. Let g(t) denote the modication

of playertwo'sshingeortover[0;] 11

.Beforeproceedingfurtherweshould

rstnotethatifplayeronecancommitpermanently( =1)tothisincrease

inits catch eort, thenplayertwo's best replyis easilyderived from(3)and

is g(t)=d 2 = 1 d 1 >0

Fora nite , player two's problem is to choose a function g that

maxi-mizes the discounted sum of instantaneous prots:

Max fg( t) g Z 0 (( 2 +g(t))x) (1 ( 1 +"))dt+e r V 2 (x()) (7) subjectto x= Ax ( 1 +"+ 2 +g(t))x (8) 11

Notethatwecouldjustaswellmodelizethemodicationofplayertwo'sshingeort

asafunctionoftheshstock(i.e. aMarkovianstrategy). Howeversinceplayertwoisthe

\onlydecisionmaker"during[0;](playeroneisassumedto follow 1

+over[0;])the

(13)

2

the initialequilibriumstrategies and where 2 isgiven by 2 = r A+ 1 1 (9)

Playertwo'sproblemisastandardoptimalcontrolproblemandisthussolved

by standard maximum principletechniques (see Leonard and Long (1992)).

LetH denotethe Hamiltonianassociated to (7):

H =(( 2 +g(t))x) (1 1 "))+(Ax ( 1 +"+ 2 +g(t))x)

where isthe shadowprice ofthe stock ofsh. Thesolutiontothe problem

above is characterized by the followingnecessary conditions:

x(( 2 +g(t))x) 1 (1 1 ") x=0 (10) =r ( 2 +g)(( 2 +g)x) 1 (1 1 " ) (A 1 " 2 g) (11)

and the transversality condition

()=V 0 2

(x()) (12)

By dierentiation of (10) with respect to time and substituting into (11) we obtain: g ( 2 +" 1 )g = 2 " 1 +g 2 (13)

where we have made use of (9). Furthermore,substituting from (10) into

(12) and using V 2 (x)=K 2 x ,we obtain (1 ( 1 +"))( 2 +g()) 1 =K 2 (14)

Player two'soptimal response, g(t);to playerone's shing eortincrease is

thus determined by the dierential equation(13) and (14). This dierential

equation (13) can be transformed into a Bernouilli equation after a change

of variable. Fordetails see Appendix B.

Proposition 2: The optimal modication of player two's shing eort,

g(t), to player one's increased shing eort is given by

g(t)= 1 1 ( 1 +") 1 1 1 1 1 2 1 2 " 1 e ( 2 " 1 )( t ) + 1 2 " 1 2 (15)

(14)

of time (g 0

(t)<0),and for"=0 we have g 0.

Proposition3: Playertwo'sreactiontoanincrease inplayerone's

sh-ing eort is ambiguous and is sensitive to the period of time during which

player one can commit to such anincrease. If the period of commitmentis

small, playertwo reduces its catching eort. Ifthe periodof commitmentis

large enough, player two's catching eort is initially increased for a certain

period of time though it is ultimately reduced. In the limiting case where

playerone commitspermanentlytoanincrease of itsharvestingeort,

play-er two unambiguously reacts by increasing its own harvesting eort. This

increase could beeven greater than playerone's deviation.

Proof: see Appendix C.

The intuition behind Proposition 3 is as follows. Player two is facing

a trade o: more harvesting increases the instantaneous net benet at the

expense of the stock of shavailablewhen the aggressivemoveof playerone

ends, atthe instant. The main incentivefor playertwo toreduceits catch

before is to enjoy a larger stock once competition becomes less aggressive

(after ). The larger is the less importantisthe presentvalueof the game

after for player two and the smaller are the incentives to conserve the

resource. For a suÆciently long period of commitment of player one (to

more aggressive catching eorts), player two responds by alsoincreasing its

catching eortfor a certaininterval of time.

Accordingto the above propositions, if the commitmentperiod is short,

the result is similar to the outcome of a static Stackelberg modelwith two

rms, a leader and a follower, both selling in the same market. In the case

of strategic substitutes in that static model the best reply of the follower

to any increase of output of the leader is to decrease of its own output.

However, despite this similarity, our modelis dierent. The two players do

not competeinthe samemarket 12

: they eitherconsume theircatches, orsell

them indistinctmarkets. Furthermore,wehaveshown inproposition1that

the twoeortlevelsarestrategiccomplements. Itwould beinterestingtosee

howtheintroductionofrivalryinacommonmarketwouldalterthebehavior

of the two players. Since there is no competition in the market place in the

case of separate markets, as opposed to the oligopolistic case, it could be

expected that the reaction of the follower to an aggressive move of its rival

12

The conclusions would hold if rmsoperate in the sameforeign market where they

(15)

thatthelongertheperiodofcommitmenttotheaggressivemove,the\shier"

shouldbetheresponseofplayertwo. AscanbeseenfromPropositions2and

3, such expectation turns out to be incorrect. The reaction of the follower,

in the case of a long period of commitment of the leader to an aggressive

move, turns out to be another aggressive move that could be even stronger

than the one that provokesit.

Asastraightforward implicationofpropositions2and3,wecanconclude

that if country one (the leader) is committed to a more conservationist

ex-ploitation ( i.e., " < 0) over a specied number of years, the follower will

follow suit only if the commitment period is long enough. Conservationist

groupsthat managetoget onlyvery shorttermcommitmentbycountry one

would nd theireorts counter-productive.

4 Concluding Remarks

Wehavedeveloped asimplemodelthat captures tosomeextentthe

sequen-tial aspect of transboundary shery. We have been able to show that in

a Markov perfect equilibrium, the rst mover has an incentive to be more

conservationist than a normal player ina symmetric situation. This picture

musthoweverbeseenwithguardedoptimism,forthereisadarkcloudinthe

horizon for the conservation-minded observer: the leader may deviate from

the equilibrium by making an aggressive move, and such a deviation may

triggerthe followertoreactby aneven moreaggressive move, ifplayerone's

periodofcommitmenttothe deviationis suÆciently long. Anothermessage

from our analysis is that pressures for a unilateral cutback in exploitation

(16)

Proof of Proposition 1:

Player two takes player one's constant strategy 1 = 1 as given, and chooses its 2

. The HJBequation for player two is

rV 2 (x)=max 2 [( 2 x) (1 1 )+V 0 2 (x)(A 1 2 )x] (16)

Let ustry the followingfunctional formfor V 2 (:) V 2 (x)=K 2 x (17) where K 2 =0 if 1 1 0 (18) For 1 1 >0, equation (16) becomes rK 2 x =max 2 [( 2 x) (1 1 )+K 2 (A 1 2 )x ] (19)

The maximizationof the right-hand side of (16) yields

2 = 1 1 K 2 1=(1 ) (20) Hence K 2 = 1 1 1 2 (21)

Substituting (21) into(19), we get

2 = r A+ 1 1 = 2 ( 1 )if 1 1 >0 (22) Then, from(18), 2 ( 1 )=0 if 1 1 0 (23)

We turn nowtoplayerone's problem. Weassumethat playerone knows

that playertwo uses the strategy (22)-(23). Playerone's HJB equation is

rV 1 (x)=max 1 [( 1 x) +V 0 1 (x)(A 2 ( 1 ) 1 )x] (24)

(17)

Letus trythe functionalformV 1 (x)=K 1 x . Notice that 2 (a 1 ) has akink.

We must solve (24) for two separate cases: case (i), where 1 1

0, and

case (ii), where1 1

>0.

Forcase (i), wenote that 2

=0, and hence we have the HJBequation

rK 1 x =max 1 [( 1 x) +K 1 (A 1 )x ] (25) subjectto 1 1 0: (26) Solving, we get K 1 = " r A 1 # 1 =K 1 and hence 1 = " r A 1 # = 1

which,in viewof the constraint (26), is asolution provided that

"

r A

1

#

>1 (27)

If (27) isnot satised, then (25), subject to (26), has nosolution.

Forcase (ii),we must solve

rK 1 x =max 1 [( 1 x) +K 1 (A r A+ 1 1 1 )x ] (28)

From this we get

1 1 =!K 1 (29) where != 1 1

Thus, substituting(29) into(28), we get

r! = 1 +! " A r 1 ! 1 # (30) 1 = (r A) (1 ) = 1 (31)

(18)

K 1 =(1 ) " r A 1 + 2 # 1 =K 1

This completes the proof of proposition1.

Appendix B

In this appendix we determine player two's optimal reply of Canada to

player one'scatch increase by solving the dierentialequation

g ( 2 +" 1 )g = 2 " 1 +g 2 (32)

with the transversality condition

(1 ( 1 +"))( 2 +g()) 1 =K 2 (33) Letus dene G(t)=g(t)+ 2

The dierentialequation (13) can be rewritten in termsof G(t)

G =G(G 2 +" 1 ) (34)

This dierentialequation is aBernoulli equation. LetY = 1 G , (34) gives Y = 2 " 1 ! Y 1

the solutionof whichis

Y (t)=Ce ( 2 " 1 ) t + 1 2 " 1

where C is constant of integration that can be determined by the

transver-sality condition (14).

After substitutionwe have

g(t)= 1 K2 (1 ( 1 +")) 1 1 1 2 " 1 e ( 2 " 1 )( t ) + 1 2 " 1 2

(19)

g(t)= 1 1 (1+") 1 1 1 1 1 2 1 2 " 1 e (2 " 1 )( t ) + 1 2 " 1 2 . Note that 1 1 (1 ( 1 +")) ! 1 1 1 2 > 1 2

Since, for ">0and for <1,

0< 1 2 " 1 < 1 2

wehavethatg(:)isacontinuousandadecreasingfunctionoftime. Moreover,

if " =0, g(:)is identically zero.

Appendix C

Wenowshowthatifplayerone increasesitseortlevelby"overthetime

interval [0;] then player two will increase its own eort, i.e. g(t) > 0 for

an interval of time. However player two will ultimately reduce its catching

eort, i.e. g()<0.

Westartbynotingthatg()<0. Indeed,from(15),itisstraightforward

that g()= 2 0 @ 1 1 (1 ( 1 +")) ! 1 1 1 1 A (35)

For ">0 and for <1,we have g()<0.

We can alsocalculateg(0) :

g(0)= 1 1 ( 1 +") 1 1 1 1 1 2 1 2 " 1 e ( 2 " 1 ) + 1 2 " 1 2 (36)

Now consider the implicationof the lengthof the commitment period .

Note that g(0)is acontinuous and increasing function of , and

lim !+1 g(0)= " 1 >0

(20)

given there exists t such that for all t 2 [0;t], g(t) > 0: In the

limiting case where =1 wehave

g(t)= "

1

>0for all t2[0;1)

We nowturn to the case of short commitmentperiod. It isclear that

lim !0 g(0)= 2 0 @ 1 1 (1 ( 1 +")) ! 1 1 1 1 A <0

and thus there a exists a positive value 0

such that for each given 0

we have g(0)<0. Since g()<0and g is a continuous function of time we

(21)

[1] Binmore, Ken, 1992, Fun and Games: A Text on Game Theory:, D.C.

Heath, Lexington, Mass.

[2] Bolle,F., 1986,On theOligopolisticExtractionofNon-renewable

Com-mon Pool Resources, Economica 53, 519-27.

[3] Chiarella,Carl,Murray C. Kemp,NgoVanLong, Koji Okuguchi,1984,

On the Economics of International Fisheries, International Economic

Review 25, 85-92.

[4] Clark, C. W.,1990,Mathematical Bioeconomics: TheOptimal

Manage-ment of Renewable Resources, second edition,Wiley,New York.

[5] Clemhout, Simone and Henry WanJr., 1985, Dynamic Common

Prop-erty Resources and Environmental Problems, Journal of Optimization

Theory and Applications 46, 471-81.

[6] Clemhout, Simoneand Henry WanJr., 1994, DierentialGames:

Eco-nomic Applications, in R. J. Aumann and S. Hart, eds., Handbook of

Game TheorywithEconomicApplications,Volume 2, Elsevier,

Amster-dam, 801-25.

[7] Crabbe,Philippe,andNgoVanLong,1993,EntryDeterrenceand

Over-exploitationofthe Fishery,Journal of EconomicDynamicsand Control

17, 679-704.

[8] Dasgupta, P. and G.Heal,1979, Economic Theory andExhaustible

Re-sources, Nisbet, Cambridge.

[9] Datta, M.and L.Mirman, 1994,Dynamic CapitalMarketInteractions,

Externalities, and Trade, CORE discussion paper9409.

[10] Dockner, Engelbert, Steen Jorgensen, Ngo Van Long, and Gerhard

Sorger, 2000, Dierential Games in Economics and Management

Sci-ences,CambridgeUniversity Press, Cambridge, UK

[11] Fischer, R. D., and L. Mirman, 1992, Strategic Dynamic Interactions:

(22)

the Salmon Wars, School of Marine Aairs, University of Washington,

Seattle, SMA 95-1.

[13] Kaitala, V. , and M. Pohjola, 1988, OptimalRecovery of a Shared

Re-source Stock: a Dierential Game with EÆcient Memory Equilibria,

Natural Resource Modelling 3,91-119.

[14] Leonard, Daniel,andNgoVanLong,1992, OptimalControlTheoryand

Static Optimizationin Economics, CambridgeUniversity Press

[15] Levhari, D., and L. Mirman, 1980, The Great Fish War: An Example

Using a Dynamic Nash-Cournot Solution, Bell Journal of Economics,

11, 322-34.

[16] Miller,K.A.,1996,SalmonStock Variabilityandthe Political Economy

of the Pacic SalmonTreaty. Contemporary Economic Policy,vol XIV,

112-29

[17] Missios, PaulC., and Charles G.Plourde,1996, The Canada-European

Union TurbotWar: A Brief Game-theoretic Analysis, Canadian Public

Policy 22(2)144-50.

[18] Missios, Paul C., and Charles G. Plourde, 1997, Transboundary

Re-newableResources ManagementandConservationMotives, Marine

Re-source Economics 12(1), 27-30.

[19] Munro, Gordon, and Stokes, Robert. L., 1987, The Canada-United

S-tates Pacic Salmon Treaty, Chapter 2 in D. McRae and G. Munro

(eds.), Canadian Oceans Policy: National Strategies and the New Law

of the Sea, UBC Press, pp17-35.

[20] Munro, Gordon, Ted McDorman, and Robert McKelvey, 1997, T

rans-boundary Fishery Resources and the Canada-United States Pacic

Salmon Treaty, Typescript, U.B.C.

[21] Plourde, Charles G., 1970, A Simple Model of Replenishable Natural

Resource Exploitation,American Economic Review 60(3), pp 518-22.

[22] Plourde, Charles G., 1971, Exploitation of Common Property

(23)

boundary Replenishable Fish Stock, Marine Resource Economics 6(1),

57-70.

[24] Sinn, Hans-Werner, 1984, Common Property Resources, Storage F

acil-ities, and Ownership Structure: A Cournot Model of the Oil Market,

Economica 51,235-52.

[25] Thomas, J., 1992, Cartel Stability in an Exhaustible Resource Model,

Economica 59 (235),279-93.

[26] Tian, Huilan, 1998, Optimal Management of Transboundary Fishery,

with SpecicReferencetothe PacicSalmon,M.Scethesis,Department

(24)