• Aucun résultat trouvé

Regulation of renewable resource exploitation

N/A
N/A
Protected

Academic year: 2021

Partager "Regulation of renewable resource exploitation"

Copied!
26
0
0

Texte intégral

(1)

HAL Id: hal-03013860

https://hal.sorbonne-universite.fr/hal-03013860

Submitted on 19 Nov 2020

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Regulation of renewable resource exploitation

Idris Kharroubi, Thomas Lim, Thibaut Mastrolia

To cite this version:

Idris Kharroubi, Thomas Lim, Thibaut Mastrolia. Regulation of renewable resource exploitation.

SIAM Journal on Control and Optimization, Society for Industrial and Applied Mathematics, 2019,

58 (1), pp.551-579. �10.1137/19M1265740�. �hal-03013860�

(2)

Regulation of renewable resource exploitation

Idris Kharroubi

Thomas Lim

Thibaut Mastrolia

November 25, 2019

Abstract

We investigate the impact of a regulation policy imposed on an agent exploiting a pos- sibly renewable natural resource. We adopt a principal-agent model in which the Principal looks for a contract, i.e.taxes/compensations, leading the Agent to a certain level of ex- ploitation. For a given contract, we first describe the Agent’s optimal harvest using the BSDE theory. Under regularity and boundedness assumptions on the coefficients, we ex- press almost optimal contracts as solutions to HJB equations. We then extend the result to coefficients with less regularity and logistic dynamics for the natural resource. We end by numerical examples to illustrate the impact of the regulation in our model.

Key words: Contract Theory, BSDEs, HJB PDE, Logistic SDE.

1 Introduction

The exploitation of natural resources is fundamental for the survival and development of the growing human population. However, natural resources are limited since they are either non renewable (e.g. minerals, oil, gas and coal) so that the available quantity is limited, or renewable (e.g. food, water and forests) and in this case the natural resource is limited by its ability to renew itself. In particular, an excessive exploitation of such resources might lead to their extinc- tions and therefore affect the depending economies with, for instance, high increases of prices and higher uncertainty on the future. Thus, the natural resource manager faces a dilemma:

either harvesting intensively the resource to increase her incomes, or taking into account the potential externalities induced by an overexploitation of the resource and impacting her future ability to harvest the resource. It has been nevertheless emphasized in [7] that in some cases it is optimal for natural resource manager to harvest until the extinction of the resource. This optimal harvesting strategy thus leads to costs for the global welfare related to the environment degradation.

Therefore, the management and the monitoring of the exploitation of natural resources are a balance between optimal harvest for the natural resource manager and ecological implications for public organizations. This second issue has attracted a lot of interest, especially from governance institutions. For example in its last annual report on sustainable development, the statistical office of the European Union Eurostat dedicates a full section to the question of sustainable consumption and production (see [11], Section 12).

The management of natural resources have also attracted a lot of interest from the academic community. Many studies on natural resources exploitation tried to describe the possible effect of economic incentives on the exploitation (see e.g. [5, 14, 27, 15]). These references stress the need of an incentive policy to ensure the sustainability of the resource. However, even if

Sorbonne Universit´e, LPSM,idris.kharroubi @upmc.fr.

Laboratoire d’Analyse et Probabilit´es, Universit´e d’Evry-Val d’Essonne and ENSIIE,lim@ensiie.fr.

CMAP, Ecole Polytechnique, IP Paris,thibaut.mastrolia@polytechnique.edu. This author acknowledges the financial supports of the Chaire Financial Risks hosted by the Louis Bachelier Institute and the ANR project PACMAN ANR-16-CE05-0027.

(3)

the regulator have access to the abundance level of natural resource, the unobservability of the natural resource manager behavior induces moral hazards. Thus, the regulator’s issue is to incentivize the resource manager to optimally reduce the cost of the resource degradation, together with ensuring a minimal incomes for the manager, under moral hazard. To the best of our knowledge, this question has been addressed only in the discrete-time framework (see for instance [13]) without considering any randomness in the dynamics of the resource. The aim of this work is to investigate this problem in continuous time with randomness in the system.

To deal with this issue, we consider a principal-agent model under moral hazard. The first elements of contract theory with moral hazard appeared in the 60’s with the articles [3, 30] in which the mechanisms of controlled management were investigated. Then, it has been extended and named asagency problem (see among others [29, 24]) by considering discrete-time models.

Concerning the continuous-time framework, the agency problem with moral hazard has been first studied in [17] by modelling the uncertainty of risky incomes with a Brownian motion.

The agency problem can be roughly described as follows. We associate a moral hazard problem with a Stackelberg game in which the leader (named the Principal) proposes at time 0 a compensation to the follower (named the Agent) given at a maturity T > 0 fixed by the contract, to manage the wealth of the leader. Moreover, the Principal has to propose a compensation high enough (called the reservation utility) to ensure a certain level of utility for the Agent. Although the Principal cannot directly observe the action of the Agent, the former can anticipate the best reaction effort of the latter with respect to a fixed compensation. Hence the agency problem remains to design an optimal compensation proposed by the Principal to the Agent given all the constraints mentioned above under moral hazard.

The common approach to solve this problem consists in proceeding in two steps. The first step is to compute the optimal reaction of the Agent given a fixed compensation proposed by the Principal, i.e. solving the utility maximization problem of the Agent. In all the papers mentioned above, the shape of considered contracts is fundamental to solve the Agent problem by assuming that the compensation is composed by

– a constant part depending on the reservation utility of the Agent, – a part indexed by the (risky) incomes of the Principal,

– the certain equivalent gain of utility appearing in the Agent maximization.

Using the theory of Backward Stochastic Differential Equations (BSDE for short), [9] proved that this class ofsmooth contracts, having a relevant economic interpretation, is not restrictive to solve the agency problem. The second step consists in solving the Principal problem. Taking into account this optimal reaction of the Agent, the goal is to compute the optimal compensa- tion. As emphasized in [26] and then in [8, 9], this problem remains to a (classical) stochastic control problem with the wealth of the Principal and the continuation utility value of the Agent as state variables.

In this paper, we identify the natural resource manager as the Agent. The Principal refers to a regulator, which can be a public institution that monitors the resource manager’s activities.

The resource manager can either harvest or renew the natural resource. In the first case the production is sold at a given price on the market and in the second case the resource manager pays for each unit of renewed natural resource. To regulate the natural resource exploitation, the Principal imposes a tax/compensation to the Agent depending on the remaining level of resource at the terminal time horizon. We suppose here that the Agent is risk-averse and we model his preference with an exponential utility function1. For a given harvesting strategy, the Agent total gain is composed by the cumulated amounts paid/earned by renewing/harvesting the natural resource and the regulation compensation/tax. The Agent’s aim is then to maximize the expected utility of his total gain over possible harvesting strategies.

On the other side, given the previous optimal harvest of the Agent, the regulator aims at fixing a tax/compensation policy that incentives the Agent to let a reasonable remaining level

1See for instance [4] for more details on this kind of utility function and the economical interpretations of it.

(4)

of natural resource. As a public institution, we assume that the regulator is risk-neutral.

The main features to model the dynamic of a renewable natural resource are its birth and death rates and the inter-species competition. Besides, due to random evolution of the popu- lation, we consider uncertainty in the available abundance. Following [12, 2, 22] we choose to model the evolution of the natural resource by a stochastic logistic diffusion.

We then focus on the Principal-agent problem. We first characterize the Agent behavior for a fixed regulation policy represented by a random variableξ. Following the BSDEs approaches to deal with exponential utility maximization, we get a unique optimal harvesting strategy as a function of the Z component of the solution to a quadratic BSDE with terminal condition ξ (see [25, 18]).

We next turn to the regulator problem which consists of maximizing an expected terminal reward depending on the regulation taxξand the level of remaining natural resource according to the Agent’s optimal response. By writing the explicit form of the resource manager’s optimal strategy, we turn the regulator problem into a Markov stochastic control problem of a diffusion with controlled drift. We then look for a regular solution to the related PDE to proceed by verification. However, in our case we face the following three issues.

– By considering the logistic dynamics for the resource abundance population, the HJB PDE related to the Principal problem involves a term of the form x2xv wherexstands for the resource population abundance andv is the Principals value function. This term, induced by the inter-species competition in the classical logistic case, prevents us from using existence results of regular solutions to PDEs.

– The shape of the optimal harvest of the manager leads to irregular coefficients for the related PDE, which also prevents from getting regular solutions.

– Due to the exponential preferences of the Agent, the Principal’s admissible strategies need to satisfy an exponential integrability condition. However, the linear preferences of the Principal leads to an optimal contract that is not necessarily exponential integrable.

Therefore, the regulator problem might not have an optimal regulation policy.

To deal with these issues, we first study a model for which the inter-species competition coeffi- cientµof the population is bounded. Hence, the termx2xvis replaced byxµ(x)∂xv. We then construct a regular approximation of the Hamiltonian. By considering the related PDE, we de- rive a regular solution (see Proposition 4.1) together with an almost optimal control satisfying the admissibility condition (see Theorem 4.2). We notice that our approach can be related to that of Fleming and Soner [28], which consists of an approximation of the value function by a sequence of smooth value functions to derive a dynamic programming principle. We next turn to the logistic casei.e. µ(x) =xfor which we show that the almost optimal strategy obtained for a truncation of µ remains an almost optimal strategy for a large value of the truncation parameter (see Theorem 4.3).

We finally illustrate our results by numerical experiments. We compute the almost optimal strategies using approximations of solutions to HJB PDEs and show that the regulation has a significant effect on the level of remaining natural resource.

The remainder of the paper is the following. In Section 2 we describe the considered mathe- matical problem. We then solve in Section 3 the manager’s problem for a given regulation policy.

In Section 4, we first provide almost optimal strategies in the case where the coefficient µ is bounded and we extend our result to the logisitic dynamics. We end Section 4 by economical insights and numerical experiments.

(5)

Notations and spaces

We give in this part all the notations used in this paper. Let (Ω,F,P) be a complete probability space. We assume that this space is equipped with a standard Brownian motion W and we denote byF:= (Ft)t≥0its right-continuous and complete natural filtration.

Letp≥1 and a time horizonT >0, we introduce the following spaces

– P(R) (resp. Pr(R)) will denote the σ-algebra of R-progressively mesurable, F-adapted (resp. F-predictable) integrable processes.

– STp is the set of processesX,P(R)-mesurable and continuous satisfying E[ sup

0≤t≤T

|Xt|p]<+∞. – HpT is the set of processesX,Pr(R)-mesurable satisfying

E h Z T

0

|Xt|2dtp2i

<+∞.

– For an integerq≥0, a subsetD ofRq and for anyν ∈(0,1), we denote byC1+ν(D) the set of continuously differentiable functions f : D→Rsuch that

|f|1= sup

x∈D

|f(x)|+ X

1≤i≤q

|∂xif(x)|+ sup

x,y∈D

X

1≤i≤q

|∂xif(x)−∂xif(y)|

|x−y|ν

<∞,

and by C2+ν(D) the set of twice continuously differentiable functions f : D →R such that

|f|2+ν = sup

x∈D

|f(x)|+ X

1≤i≤q

|∂xif(x)|+ X

1≤i,j≤q

|∂xi,xjf(x)|

+ sup

x,y∈D

X

1≤i,j≤q

|∂xi,xjf(x)−∂xi,xjf(y)|

|x−y|ν < ∞.

2 The model

2.1 The natural resource

We fix a deterministic time horizonT >0 and we suppose that the natural resource abundance Xtµ at timet≥0 is given by

Xtµ=X0+ Z t

0

Xsµ(λ−µ(Xsµ))ds+ Z t

0

σXsµdWs, t∈[0, T], (2.1) where X0, λ andσ are positive constants. The quantities X0 and λcorrespond to the initial natural resource abundance and the growth rate respectively. The map µrepresents the com- petition inside the species considered or more generally an auto-degradation parameter for a natural resource. We assume that the mapµsatisfies the following assumption

(H0)µis a map fromR+ toR+ such that (2.1) admits a unique strong solution inST2. Note that Assumption(H0)holds for instance if the mapx7−→xµ(x) is Lipschitz continuous.

Another important example is the so-called logistic equation where µ(x) = xon R+, see for example in [12]. In this last case, SDE (2.1) admits an explicit unique solution that will be denoted in the sequel byX and given by

Xt= X0e(λ−σ22)t+σWt 1 +X0Rt

0e(λ−σ22)s+σWsds

, t∈[0, T].

(6)

The ecological interpretation of this model is the following. At timet, if the coefficientµ(Xtµ) is larger thanλthen the drift of the diffusion is negative. Therefore the abundance of the natural resource Xtµ decreases in mean. Conversely, if µ(Xtµ) is smaller thanλ then the drift of the diffusion is positive. Hence, the abundance Xtµ increases in mean. For more details see for instance [22, Proposition 3.4].

More general models can be used in practice and one of the main challenges, see [22], is to rely branching processes with birth and death intensities to the solutions of continuous SDEs.

2.2 The Agent’s problem

We consider an agent who tries to make profit from the natural resource. We suppose that this agent owns facilities to either harvest or renew this resource. We assume that his action happends continuously in time and we denote byαthis intervention rate at timet,i.e. the abun- danceXtµwill decrease of an amountαtXtµper unit of time. This means that if the intervention rateαtis positive (resp. negative), the Agent harvests (resp. renews) the natural resource. We denote byA the set ofF-adapted processes defined on [0, T] and valued in [−M , M] whereM and M are two nonnegative constants. If the Agent is prohibited to renew the resource then M = 0. This setAis called the set of admissible actions.

To take into account the control α of the Agent on the natural resource abundance, we introduce the probability measurePα defined by its densityHαw.r.t. Pgiven by

dPα dP

F

T

:=HTα, where the processHαis defined by

Htα:= exp

− Z t

0

αs

σ dWs−1 2

Z s

0

αs

σ

2

ds

, t∈[0, T].

In the sequel, we denote byEα andEαt the expectation and conditional expectation given Ft respectively, for anyt∈[0, T], under the probability measurePα.

Forα∈ A, we get from Girsanov Theorem (see e.g. Theorem 5.1 in [19]) that the process Wα defined by

Wtα:=Wt+ Z t

0

αs

σds , t∈[0, T],

is a Brownian motion under the probabilityPα. Thus, for a given admissible effortα∈ A, the dynamics ofX can be rewritten under the probabilityPαas

Xtµ=x+ Z t

0

Xsµ(λ−µ(Xsµ))−αsXsµ ds+

Z t

0

σXsµdWsα, t∈[0, T].

This new dynamics reflects the evolution of the population with a rate αt per unit of time.

Hence,αtXtµ has to be seen as the speed of the exploitation of the natural resource at timet.

Remark 2.1. We choose a harvesting/renewing component of the form αXµ. This allows to take into account not only the impact of the effort of the agent on the system but also the abundance of the resource itself. We note that this form of strategy is a particular case of the form presented in [16, (2.4)]. From a technical point of view, this particular form is due to the presence of a quadratic cost (see (2.2)) and allows to derive optimal efforts under a closed form. Possible extensions which are let for further research are to consider more general forms including finite variations and jumps as in the dynamic proposed in [16].

We then are given a price function p: R+ →R+ and we suppose that the price per unit of the natural resource on the market is given byp(Xtµ) at timet≥0. We make the following assumption on the price functionp.

(7)

(Hp)There exists a constantP such thatp(x)x≤P for allx∈[0,+∞).

This price functionpallows to take into account the dependence w.r.t. the abundance (the more abundant the resource is, the cheaper it will be and conversely). Such a price dependence has already been used to model liquidity effects on financial market, where empirical studies showed that the impact is of the formp(x) =P e−β1xβ2,x∈R+, for some positive constantsP, β1andβ2(see e.g. [1, 21]). In particular,(Hp)is satisfied for this type of dependence. Another basic example for which(Hp)holds is the casep(x) =P x−1,x >0. This last example reflects the inability to buy the natural resource once it is extinct.

We assume that the manager sells the harvested resource on the market at pricep(Xtµ) per unit at timet ifαtis positive, and pays the pricep(Xtµ) per unit of natural resource at timet ifαt is negative to renew this one. This provides the global amountRT

0 p(Xtµ)Xtµαtdtover the time horizon [0, T].

We also suppose that giving an effort is costly for the manager and we consider the classical quadratic cost functionk:R→R+ given byk(α) = |α|22, α∈R. Thus, the Agent is penalized by the instantaneous amountk(αt) per unit of time for a given effort α∈ A. This leads to the global paymentRT

0 k(αt)dtover the considered time horizon [0, T].

In our investigation, we recall that the activity of the natural resource manager is regulated by an institution (usually an environment administration) who is taking care about the size of the remaining natural resource. To avoid an over-exploitation, the regulator imposes a tax on the Agent depending on the remaining resource. This tax amount is represented by an FT-measurable random variable ξand is paid at timeT. Note thatξ can be either positive or negative. In this last case, it means that the regulator gives a compensation to the manager.

Throughout the paper we assume that the Agent’s preferences are given by the exponential utility functionuA defined by

uA(x) :=−exp −γx

, x∈R,

where γ is a positive constant corresponding to the risk aversion of the Agent. We define the value functionVA(ξ) of the Agent associated to the taxation policyξby

VA(ξ) := sup

α∈AEα

h−exp

−γ Z T

0

p(Xsµ)Xsµαsds− Z T

0

s|2

2 ds−ξi

. (2.2)

For a fixed taxξ, we denote byA(ξ) the set of effortsα∈ Asatisfying the following equality Eα

h

−exp

−γ Z T

0

p(Xsµ)Xsµαsds− Z T

0

s|2

2 ds−ξi

=VA(ξ). An effortα∈ A(ξ) is said to be optimal for the fixed taxξ.

2.3 The Principal’s problem

The aim of the regulator is to stabilize the resource population at a fixed target size at the maturityT. For that, a taxξis chosen to incentivize the Agent to manage the natural resource so that the remaining population is close to the targeted size. Hence, the regulator benefits from the tax paid by the Agent and is penalized through a cost functionf depending on the size of the resource at maturityT. The expected reward under the action α∈ A of the Agent is then given by

Eα

ξ−f(XTµ) .

Typically, we have in mind f(x) =c(β−x)+ meaning that the regulator targets a population size β > 0 at timeT for the sustainability of the resource and pays the costc per unit if the

(8)

natural resource is over-consumed. This functionf can be seen as the amount that the regulator must pay to reintroduce the missing resource.

We suppose that the resource manager is rational. Therefore, the Principal anticipates that for a taxξ, the Agent will choose an effortαin the setA(ξ). Note that this set is not necessarily reduced to a singleton2, hence, as usual in moral hazard problems (see for instance [17] for the formulation of the moral hazard problem), the regulator solves

sup

ξ

VP(ξ), withVP(ξ) = sup

α∈A(ξ)

Eα h

ξ−f(XTµ)i

, (2.3)

whereξ lives in a set of suitable contracts defined in the following section.

2.4 Class of contracts and utility reservation

We now introduce a reserve utility R which is a negative constant. This reserve means that the regulator cannot penalize too strongly the Agent for economical reasons so that the utility VA(ξ) expected by the Agent has to be greater thanR. For instance, we can chooseRsuch that the regulator monitors the Agent by promising the same expected utility as the case without regulation (see Section 4.3.1 for more details). This example reflects a non-punitive taxation policy in which the regulator purely monitors the activities of the Agent. In our model, the sign of the tax ξis on purpose. This means that the natural resource manager pays the fee to the regulator whenξis positive and conversely, the regulator compensates the Agent’s activity when ξ is negative. Moreover, we need to impose an exponential integrability on the tax ξ to ensure the well-posdness of VA(ξ). We therefore introduce the class CRµ of admissible taxes defined as the set ofFT-measurable random variablesξsuch that

VA(ξ)≥R , (2.4)

and there exists a constantγ0>2γsuch that E

exp(γ0|ξ|)

<+∞. (2.5)

This last condition is very convenient since it allows to deal with the problem by using the theory of BSDEs. Moreover, a straightforward application of Cauchy-Schwarz inequality ensures that the optimization problemsVA(ξ) andVP(ξ) take finite values.

3 Optimal effort of the natural resource’s manager

We first solve the optimal problem of the Agent (2.2) under taxation policyξ∈ CµR. As in [9], the following result shows that solving the Agent problem gives both an optimal effortα and a particular representation of the taxξwith respect to the solution of a BSDE.

Theorem 3.1. Let ξ ∈ CµR and Assumption (Hp) be satisfied. There exists a unique pair (Y0, Z)∈(−∞,R]˜ ×H2T withR˜ :=log(−R)γ such that

(i) the tax has the following decomposition ξ=Y0

Z T

0

g(Xtµ, Zt) +σ2 2 γ|Zt|2

dt+ Z T

0

σZtdWt, (3.6) whereg is defined for any(x, z)∈R+×Rby

g(x, z) = |a(x, z)|2

2 −p(x)xa(x, z)−a(x, z)z , and

a(x, z) = (p(x)x+z)∨(−M)

∧M , (3.7)

2In our investigation, we will show that the setA(ξ) is reduced to a single element.

(9)

(ii) the value of the Agent is given by

VA(ξ) =−exp(γY0),

(iii) the process α(ξ) defined by αt(ξ) = a(Xtµ, Zt) is the unique optimal effort associated with the taxξ given by (3.6).

Proof. The proof is divided in three steps and is related to the BSDE associated with the value function of the Agent. We first introduce a dynamic extension of the optimization problem (2.2). We denote byJ(t, ξ) the dynamic value function of the Agent at timetfor a taxξwhich is defined by

J(t, ξ) := ess inf

α∈A Eαt

h exp

−γ Z T

t

p(Xsµ)Xsµαsds− Z T

t

k(αs)ds−ξi . Note thatVA(ξ) =−J(0, ξ).

Step 1. Dynamic utility of the Agent and BSDE.We characterizeJ(·, ξ) as the unique solution of a BSDE and we derive the optimal control by using comparison results.

Letα∈ A, we introduce the processJα(ξ) defined by Jtα(ξ) :=EP

α

t

h exp

−γ Z T

t

p(Xsµ)Xsµαsds− Z T

t

k(αs)ds−ξi , so that

J(t, ξ) := ess inf

α∈At

Jtα(ξ). (3.8)

Step 1a. Martingale representation and integrability.

We know that the processHtα(exp(γRt

0 k(αs)−p(Xsµ)Xsµαs

ds)Jtα(ξ))0≤t≤T is a (P,F)-martingale.

In view of the condition (2.5), there exists ε >0 and q >1 such that (2 +ε)qγ ≤γ0. Hence, forp >1 such that 1p+1q = 1, sinceαis bounded and Condition (2.5) is satisfied, we get from H¨older’s inequality

E[|HαTJTα(ξ)|2+ε] ≤ E[|HαT|(2+ε)p]1pE[|e(2+ε)qγξ|]1q < +∞, whereHαt :=Htαexp

γRt

0 k(αs)−p(Xsµ)Xsµαs

ds

. Hence, by using Doob’s maximal inequal- ity,HαJα(ξ)∈ S2+ε. So by using the martingale representation theorem, we know there exists a processZα∈H2+εT such that

HαtJtα(ξ) = J0α+ Z t

0

σZαsdWs, t∈[0, T]. Therefore,Jαsatisfies

dJtα(ξ) = (αttα−γ(k(αt)−p(Xtµ)Xtµαt)Jtα(ξ))dt+σZ˜tαdWt, where ˜Ztα= Z

α t

Hαt +Jtαασ2t for anyt∈[0, T], andJTα(ξ) = exp(γξ).

We now prove that ˜Zα ∈ H2T. From (2.5), the boundedness of α and Assumption (Hp), there exists a positive constantC >0 such that

E hZ T

0

|Zαt

Hαt +Jtααt

σ2|2dti

≤ 2 E

hZ T

0

|Zαt Hαt|2dti

+E hZ T

0

|Jtααt

σ2|2dti

≤ C 1 +E

hZ T

0

|Zαt Hαt|2dti

.

(10)

We set ˜q:= 1 +ε2 and ˜p >1 such that 1p˜+1q˜= 1. Using H¨older and BDG Inequalities and since Zα∈H2+εT , we get

E[ Z T

0

|Zαt

Hαt |2dt] ≤ E[ sup

t∈[0,T]

|(Hαt)−1|2 Z T

0

|Zαt|2dt]

≤ E[ sup

t∈[0,T]

|(Hαt)−1|2 ˜p]p1˜E[ Z T

0

|Zαt|2dtq˜ ]1q˜

< +∞. Consequently, we get ˜Zα∈H2T.

Step 1b. Comparison of BSDEs and optimal effort. We now turn to the characterization of the solution to (3.8) by a BSDE. We introduce the following BSDE

dJt(ξ) = − inf

a∈[−M ,M]

G(Xtµ, Jt(ξ),Z˜t, a)dt+σZ˜tdWt, JT(ξ) = exp(γξ), (3.9) where

G(x, j,z, a)˜ := γ(k(a)−p(x)xa)j−a˜z .

This BSDE has a Lipschitz generator and square integrable terminal condition from (2.5).

Therefore it admits a unique solution in S2×H2T. Moreover, for any α∈ A, we notice that (Jα(ξ),Z˜α) satisfies the following BSDE

dJtα(ξ) = −G(Xtµ, Jtα(ξ),Z˜tα, αt)dt+σZ˜tαdWt, JTα(ξ) = exp(γξ). By classical comparison Theorem (see for instance [10, Theorem 2.2]), we have

Jt(ξ) ≤ J(t, ξ), ∀t∈[0, T]. Then, we notice that BSDE (3.9) can be rewritten

dJt(ξ) = −G

Xtµ, Jt(ξ),Z˜t, a Xtµ, Z˜t γJt(ξ)

dt+σZ˜tdWt, JT(ξ) = exp(γξ).

In particular, we have J(ξ) = Ja Xµ,

Z˜ γJ(ξ)

(ξ) by uniqueness of the solution to BSDE (3.9).

Therefore, we get

Jt(ξ) = J(t, ξ) and a Xµ, Z˜ γJ(ξ)

∈ A(ξ). (3.10)

We now prove that this optimal effort is unique. Let ˜α∈ Abe another optimal effort, then we have

J0α˜ = Ja

Xµ,γJ(ξ)Z˜

0 .

From strict comparison Theorem (see again [10, Theorem 2.2]) we getJα˜=Ja Xµ,

Z˜ γJ(ξ)

and G

Xµ, J(ξ),Z, a˜ Xµ, Z˜ γJ(ξ)

= G

Xµ, J(ξ),Z,˜ α˜

, dt⊗dP−a.e.

By the uniqueness of the minimizer ofG(Xtµ, Jt(ξ),Z˜t,·),we deduce that ˜α=a Xµ,γJ(ξ)Z˜ . Step 2. Representation ofξ.

(11)

Since, by definition, the processJa Xµ,

Z˜ γJ(ξ)

is positive, we can define the processesY andZ by

Y :=

log

Ja Xµ,

Z˜ γJ(ξ)

γ and Z :=

Z˜ γJa Xµ,

Z˜ γJ(ξ)

. (3.11)

We obtain

dYt=− k(a(Xµ, Zt))−p(Xtµ)Xtµa(Xµ, Zt)−a(Xµ, Zt)Zt2 2 γ|Zt|2

dt+σZtdWt, YT =ξ .

We first proveY ∈ S2. Note for anyt∈[0, T], by using Jensen inequality, we have 1

γlog(Jt(ξ)) ≥ Eα

t

hZ T

t

k(αs)−p(Xsµ)Xsµαs ds+ξi

≥ Eα

t [ξ]−T P M , whereα stands fora Xµ,γJ(ξ)Z˜

andM =M ∨M. We then notice – ifJt(ξ)≥1 we have

0 ≤ log(Jt(ξ)) ≤ Jt(ξ), – if 0≤Jt(ξ)<1 we have

1

γlog(Jt(ξ))

≤ T P M+Eα

t [|ξ|]. Hence, there exists a constantC >0 such that

1

γlog(Jt(ξ))

2

≤ C(1 +Eα

t [|ξ|]2) + 1

γ2|Jt(ξ)|2, t∈[0, T]. From Young inequality, we get

E h

sup

t∈[0,T]

1

γlog(Jt(ξ))

2i

≤ 2C 1 +E

h sup

t∈[0,T]

HTα Htα

4i

+E[|ξ|4] + 1

γ2E h

sup

t∈[0,T]

|Jt(ξ)|2i .

Sinceα is bounded, we haveE

hsupt∈[0,T]Hα T

Htα

4i

<+∞. UsingJ(ξ)∈ S2, we obtain E[ sup

t∈[0,T]

1

γlog(Jt(ξ))

2

]<+∞. Which impliesY ∈ S2.

We now checkZ ∈ H2T. To this end, we use a localization procedure by introducing the sequence of stopping times (τn)n≥1defined by

τn := infn

t∈[0, T], Z t

0

|Zs|2ds≥no

∧T ,

for anyn≥1. Similarly to the proof of [6, Theorem 2], we apply Itˆo’s Formula toι(|Y|) where ι(x) =γ12(eγx−γx−1) forx∈R. We obtain

ι(|Y0|) = ι(|Yτn|) + Z τn

0

ι0(|Ys|)sgn(Ys) g(Xsµ, Zs) +σ2γ 2 |Zs|2

−1

00(|Ys|)σ2|Zs|2 ds

− Z τn

0

σι0(|Ys|)sgn(Ys)ZsdWs.

(12)

Sinceι00−γι0 = 1 andι0(x)≥0 for x≥0, we get from BDG and Young inequalities E

hZ τn

0

|Zs|2dsi

≤ C 1 +E

h sup

t∈[0,T]

eγ|Yt|+ Z T

0

eγ|Yt| 1 +|Ys|i

. (3.12)

From the definition of Y and sinceα is bounded, there exists a constantC such that 2γ|Yt| ≤ C+ 2γEα

t [|ξ|].

Using Jensen and H¨older inequalities we get another constantC0 such that E

h sup

t∈[0,T]

e2γ|Yt|i

≤C0E h

sup

t∈[0,T]

HTα Htα

γ

0 γ0 −γiγ0 −γγ0

E h

e0|ξ|iγγ0

.

Sinceα is bounded, we haveE h

supt∈[0,T]Hα T

Htα

γ

0 γ0 −γi

<+∞and we get from (2.5) E

h sup

t∈[0,T]

e2γ|Yt|i

< +∞. Sending nto ∞in (3.12), we get from Fatou’s LemmaZ∈H2.

Step 3. Conclusion. We directly deduce (ii) and (iii) from (3.11) together with (3.10) given

that (i) has been proved in Step 2.

4 The problem of the regulator

In this section, we focus on the regulation policy. In view of (2.3) and Theorem 3.1 the regula- tor’s problem turns to be

VRP = sup

ξ∈CµR

Eα

(ξ)[ξ−f(XTµ)]. (4.13)

We first provide almost optimal contracts for a bounded parameterµby a PDE approach. We then extend the study to the logistic case withµ(x) =x.

4.1 Almost optimal strategies for bounded auto-degradation and cost parameters

We introduce the following class of contracts Ξµ := n

YTY0,Z,µ=Y0− Z T

0

h(Xtµ, Zt) +σ2 2 γ|Zt|2

dt+ Z T

0

σZtdWt, Y0≤R , Z˜ ∈ Zo

, (4.14)

whereZ denotes the subset of predictable processes ofH2T such that

E[exp(γ0|YTY0,Z,µ|)] < +∞, (4.15) for someγ0 >2γand we recall that ˜R= log(−R)γ . Whenµis the identity, we omit the exponent µin the previous definitions.

From Theorem 3.1, constraint (2.4) and integrability conditions (2.5) and (4.15), the set CRµ coincides with Ξµ so that the regulator’s problem (4.13) becomes

VRP = sup

Y0R, Z∈Z˜

Ea

(Xµ,Z)[YTY0,Z,µ−f(XTµ)], (4.16)

(13)

with

YtY0,Z,µ = Y0− Z t

0

k(αs)−p(Xsµ)Xsµa(Xsµ, Zs) +σ2 2 γ|Zs|2

dt+ Z t

0

σZsdWs , t∈[0, T], where W stands for Wa(Xµ,Z). We notice that the function to maximize in VRP is non- deacreasing w.r.t. the variableY0. Therefore the constraintY0≤R˜ is saturated and (4.16) can be rewritten under the following form

VRP = sup

Z∈ZEa

(Xµ,Z)[YTR,Z,µ˜ −f(XTµ)]. (4.17) To construct a solution to the problem (4.17), we introduce the related HJB PDE given by

( −∂tv−H

x, ∂xv(t, x), ∂xxv(t, x)

= 0, (t, x)∈[0, T)×R+, v(T, x) =−f(x), x∈R+,

(4.18) where the HamiltonianH is given by

H(x, δ1, δ2) = sup

z∈R

xp(x)a(x, z)−k(a(x, z))−σ2

2 γz2+x(λ−µ(x)−a(x, z))δ1

2

2 x2δ2, (x, δ1, δ2)∈R+×R×R,

andais given by (3.7). We first extend PDE (4.18) to the whole domain [0, T]×Rby considering the change of variablew(t, y) :=v(t, ey) for any (t, y)∈[0, T]×R. We get the following PDE

( −∂tw− H

y, ∂yw(t, y), ∂yyw(t, y)

= 0, (t, y)∈[0, T)×R, w(T, y) =−f(ey), y∈R,

(4.19) where

H(y, δ1, δ2) := sup

z∈R

eyp(ey)a(ey, z)−a(ey, z)2 2 −σ2

2 γz2+ (λ−σ2

2 −µ(ey)−a(ey, z))δ1

2

2 δ2, (y, δ1, δ2)∈R×R×R.

Our aim is to construct a regular solution to this PDE to proceed by verification. Unfortunately, the coefficients of PDE (4.19) are not smooth enough to do so. To overcome this issue, we provide a smooth approximationHεofHfor which we get regular solutions.

Moreover, we introduce the following assumption, which ensure that the optimal control derived from the PDE satisfies the admissibility condition,i.e. belongs toZ.

(H’)There existsν ∈(0,1) such that

(i) the mapy7→µ(ey) belongs toC1+ν(R), (ii) the mapy7→f(ey) belongs toC2+ν(R), (iii) the mapy7→p(ey)ey belongs toC1+ν(R).

Proposition 4.1. Under (H’), there exists a family{Hε, ε >0} of functions from R3 to R such that the PDE

( −∂twε− Hε

y, ∂ywε(t, y), ∂yywε(t, y)

= 0, (t, y)∈[0, T)×R, wε(T, y) =−f(ey), y∈R,

(4.20) admits a unique solutionwε inC2+ν([0, T]×R) and

sup

R3

H − Hε

≤ ε (4.21)

for any ε >0.

(14)

The proof of Proposition 4.1 consists in an approximation by regularization of the original HamiltonianH. As it is quite technical we postpone this proof to the appendix.

We are now able to describe almost optimal contracts and related almost optimal efforts using the functionswεgiven by Proposition 4.1.

Theorem 4.2. Suppose that (H’)holds. For any ε >0, the tax policyξεgiven by ξε= ˜R−

Z T

0

g(Xtµ, Ztε) +1

2γ|Ztε|2+Ztε(λ−µ(Xtµ)) dt+

Z T

0

Ztε XtµdXtµ , where

Ztε = −∂xwε(t,log(Xtµ))

1 +γσ2 , t∈[0, T], (4.22)

is2T ε-optimal for the regulator problem:

VRP ≤ Ea

(Xµ,Zε)

ξε−f(XTµ)

+ 2T ε .

Proof. We fix some control Z ∈ Z and we apply Itˆo’s formula to the process YtR,Z,µ˜ + wε(t,log(Xtµ))

t∈[0,T]

YTR,Z,µ˜ +wε(T,log(XTµ)) = R˜+wε(0,log(X0)) +

Z T

0

twε(s,log(Xsµ)) +(λ−σ2

2 −µ(Xsµ)−a(Xsµ, Zs))∂xwε(s,log(Xsµ)) +p(Xsµ)a(Xsµ, Zs)Xsµ−k(a(Xsµ, Zs))−σ2

2 γ|Zs|22

2 ∂xxwε(s,log(Xsµ)) ds +σ

Z T

0

(∂xwε(s,log(Xsµ)) +Zs)dWs, whereW stands forWa(Xµ,Z). Sincewε∈C2+ν([0, T]×R) andZ ∈ Z we get

Ea

(Xµ,Z)h

YTR,Z,µ˜ +wε(T,log(XTµ))i

≤ R˜+wε(0,log(X0)) +

Z T

0

Ea

(Xµ,Z)h

twε+H

., ∂ywε, ∂yywε

(s,log(Xsµ))i ds . From (4.21) we get

Ea

(Xµ,Z)h

YTR,Z,µ˜ +wε(T,log(XTµ))i

≤ R˜+wε(0,log(X0)) +T ε +

Z T

0

Ea

(Xµ,Z)h

twε+Hε

., ∂ywε, ∂yywε

(s,log(Xsµ))i ds ,

and sincewε is solution to (4.20), we get Ea

(Xµ,Z)h

YTR,Z,µ˜ −f(XTµ)i

≤ R˜+wε(0,log(X0)) +T ε . SinceZ is arbitrarily chosen in Z we get

VRP ≤ R˜+wε(0,log(X0)) +T ε . (4.23)

(15)

We now take Z = Zε where Zε is given by (4.22). We now notice that Zε ∈ Z since Zε is bounded, and by definition of Zεwe have

H

log(Xµ), ∂ywε(.,log(Xµ)), ∂yywε(.,log(Xµ))

= Xµp(Xµ)a(Xµ, Zε)−a(Xµ, Zε)2

2 −σ2

2 γ Zε

22

2 ∂yywε(.,log(Xµ)) + λ−σ2

2 −µ(Xµ)−a(Xµ, Zε)

ywε(.,log(Xµ))

for any [0, T]. A straightforward application of Itˆo’s formula and Girsanov Theorem give Ea

(Xµ,Zε)[YTR,Z˜ ε−f(XTµ)] = R˜+wε(0,log(X0)) +

Z T

0

Ea

(Xµ,Zε)h

twε+H

., ∂ywε, ∂yywε

(s,log(Xsµ))i ds . From Propositions (4.20) and (4.21) we get

Ea

(Xµ,Zε)[YTR,Z˜ ε−f(XTµ)] ≥ R˜+wε(0,log(X0))−T ε . Hence, we get from (4.23)

VRP ≤ Ea

(Xµ,Zε)

[YTR,Z˜ ε−f(XTµ)] + 2T ε . Therefore, we getξε=YTR,Z˜ ε is a 2T ε-optimal policy for the regulator.

4.2 Extension to the logistic equation and continuous cost

We consider in this section an approximation method to build a sequence of almost optimal taxes in the case the classical logistic dynamic for SDE (2.1), i.e. µ(x) = x. More precisely, we introduce a sequence of approximated models from which we derive almost optimal strategy from the previous section. We show that this sequence remains almost optimal for the logistic model. We also weaken the assumption(H’)(ii) as follows.

(Hf’)The function f is bounded and continuous onR.

We introduce the sequence of mollifiersρn: R→R,n≥1, defined by ρn(x) := nρ(nx)

R

Rρ(u)du , x∈R, where the functionρ: R→Ris defined by

ρ(x) := exp −1 1− |x|2

1|x]<1. We then define the functionsfn,n≥1, by

fn(x) :=

Z

R

f(y)ρn(x−y)dy , x∈R.

From classical results, we know thatfn satisfies(H’)(ii) for alln≥1 andfn converges tof as ngoes to infinity uniformly on every compact subset ofR.

We also define the functionsµn: R→R,n≥1, by

µn(x) := x Θ(x+en+ 1)−Θ(x−(en+ 1))

, x∈R, (4.24)

where the function Θ :R→Ris given by Θ(u) :=

Ru

−∞ρ(r)dr R

Rρ(r)dr , u∈R. (4.25)

Références

Documents relatifs

/ La version de cette publication peut être l’une des suivantes : la version prépublication de l’auteur, la version acceptée du manuscrit ou la version de l’éditeur. For

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

By monitoring for the first time the virus concentration flowing in host plant sap, in parallel with the number of viral genome founders in each successive leaf, we provide

Based on this observation, a CIRAD team developed a methodology, Self Land Policies, whose effects at the local and national levels can still be seen 15 years after its

La température de sortie du fluide de travail, les températures du fluide primaire, les débits et les mesures de pression, sont notés au même instant que les

Together the four remaining programmes aim to provide an all-encompassing integrated watershed management programme catering for clearing invasive alien plants (through Working

Je voudrais sentir son souffle dans mon cou, qu'il pose sa main dans le creux de mon dos, sentir cette sensation inouïe de bien être et de sécurité quand il me caresserait le bras

Les dispositifs en alternance sont moins utilisés dans les DOM qu'en métropole : 57 % des contrats dans le secteur marchand sont des contrats en alternance, contre plus de 80 %