ON THE CORRECT SOLUTION

(1)

on the occasion of his 70th birthday

ON THE CORRECT SOLUTION

OF A TRIVIAL OPTIMAL CONTROL PROBLEM IN MATHEMATICAL ECONOMICS

S¸TEFAN MIRIC ˘A and TOUFFIK BOUREMANI

We use the Dynamic Programming approach to obtain the correct solution of the optimal control problem studied in [1] using Pontryagin’s Maximum Principle. We prove that all the optimal trajectories of the problem are constant functions, hence the “solution” in [1] is wrong and the problem is rather trivial.

AMS 2000 Subject Classiﬁcation: 49L20, 49K05, 49K24, 49N35, 49N90, 90B30.

Key words: optimal control, mathematical economics, maximum principle, dynamic programming, value function, veriﬁcation theorem.

1. INTRODUCTION

This work is ﬁrst intended, as a warning to the, unfortunately, increasing number of authors try to solve concreteoptimal control problems without enough knowledge and even basic mathematical abilities; second, our aim is to show that the Dynamic Programming approach in [6]–[7] is much more eﬃcient in the study of this type of problems.

For the sake of completeness we present in Section 2 the (not very convincing) “economic problem”, the corresponding mathematical model and the

“solution” obtained in [1] using a non-existent variant of Pontryagin’s Maxi- mum Principle (PMP); we also point out the main errors made by the authors of [1].

In Section 3 we use the Dynamic Programming approach (adapted to non-autonomous problems) to obtain the correct solution of the same problem, and prove that the solution in [1] iswrongand their problem is rather trivial since all the optimal trajectories are constant.

MATH. REPORTS9(59),1 (2007), 77–86

(2)

2. THE ECONOMIC PROBLEM, THE MATHEMATICAL MODEL AND THE WRONG SOLUTION IN [1]

The aim of paper [1] is to solve the optimal control problem ofminimizing the cost functional:

(2.1) J(P(·), I(·)) :=

_T

0 e^−ρt[k(P(t)) +h(I(t))]dt, subject to the constraints

(2.2) I(t) =P(t)−D(t, I(t)), I(0) =I₀, I(T) =I_T, p(t)≥D(t, I(t)), where the functions involved have the following “economic signiﬁcances”:

I(t): inventory level (state variable) at timet≥0;

P(t): the chosen production rate (control variable);

D(t, I(t)): the demand rate (minimal production rate);

h(I(t)): the holding cost rate of the inventory I(t);

k(P(t)): the cost rate of the chosen production rateP(t);

ρ≥0: the discount rate;

I(0) =I₀, I(T) =I_T: the initial and, respectively terminal values of the inventory (ﬁxed, but, apparently, arbitrary).

At a ﬁrst stage, the authors assume that the functionsh(·), k(·), D(·,·) have “non-negative” values, are of class C² and k(·) is increasing.

In fact, the authors only studied the problem in more detail only in the particular case

(2.3) k(u) := ku²

2 , h(x) := hx²

2 , D(t, x) :=d₁(t) +d₂x, wherek, h, d₂ >0 and d₁(·)≥0 is of classC¹.

One may note that, in the absence of a more concrete example, the economic problem does not seem very convincing since the “customers” should know the level I(t) of the inventory rate in order to choose their “minimal demand rate”D(t, I(t)) while the “patron” should know the choiceD(t, I(t)) of the customers.

Anyway, the authors of [1] invoke the classical variant of Pontryagin’s Maximum Principle (PMP) in [9] to write some diﬀerential equations and, in the particular case in (2.3), to conclude in their Theorem 1 that in this case there exist constantsa₁, a₂, m₁, m₂and a functionQ(·) such that theoptimal trajectory of problem (2.1)–(2.2) is given by

(2.4) I^∗(t) =a₁e^m¹^t+a₂e^m²^t+Q(t), t∈[0, T].

However, as we shall prove in the next section, in the particular case (2.3) the only optimal trajectories are the constant functions, I(t) ≡ I₀,

(3)

t∈[0, T], hence the solution in [1] iswrongand problem (2.1)–(2.2) is rather trivial.

Even if, by any chance, the solution I^∗(·) would be correct, the authors of [1] among several others, made the following errors:

(1) they apply the classical variant of Pontryagin’s Maximum Principle (PMP) (in which the set of control parameters, U, is constant) to problem (2.1)–(2.2) where the set of control parameters, U(t, x) := [D(t, x),∞), is variable; in this case the dynamics in (2.1) is deﬁned by a diﬀerential inclusion of the form

(2.5) x ∈f(t, x, U(t, x)), x(0) =y,

for which the Pontryagin’s Maximum Principle (PMP) has more complicated statements as shown in [2], [4], etc;

(2) even if a correct variant of Pontryagin’s Maximum Principle (PMP) were used, the authors could not conclude the optimality of the extremalI^∗(·) without additional arguments proving:

(2a) the existence of an optimal trajectory joiningI₀ and I_T; (2b) the uniqueness of the extremal I^∗(·) in (2.4);

(3) the authors are apparently solving the elementary problem of max- imizing the real function u → α(t, p, u) in (3.7) below on the interval u ∈ [D(t, x),∞) using the “Lagrange multiplier rule”.

3. THE DYNAMIC PROGRAMMING CORRECT SOLUTION

In order to use the Dynamic Programming Approach in [5]–[7], adapted to non-autonomous problems as in [8], we reformulate problem (2.1)–(2.2) using standard notation inOptimal Control Theoryand imbedding this problem in a set of problems associated with each initial point in the phase-space; thus, we obtain the “standard”Lagrange optimal control problem below.

Problem 3.1. Given T > 0, ρ ≥ 0 and smooth non-negative functions D(·,·), h(·), k(·), find

(3.1) inf

u(·)C(s, y;u(·)) ∀(s, y)∈E₀:= [0, T)×R^∗₊ subject to

(3.2) C(s, y;u(·)) :=g(T, x(T)) + _T

0 f₀(t, x(t), u(t))dt, (3.3) x(t) =f(t, x(t), u(t)), u(t)∈U(t, x(t))a.e. ([0, T]), (3.4) (t, x(t))∈E₀ ∀ t∈[0, T), (T, x(T))∈E₁ :={T} ×R^∗₊,

(4)

deﬁned by the data (3.5)

f(t, x, u) :=u−D(t, x), f₀(t, x, u) := e^−ρt[h(x) +k(u)], E₀:= [0, T)×R^∗₊, E₁:={T} ×R^∗₊, g(T, ξ)≡0 ∀(T, ξ)∈E₁, U(t, x) := [D(t, x),∞) ∀(t, x)∈E:=E₀∪E₁= [0, T]×R^∗₊.

One may note here that Problem 3.1 solves problem (2.1)–(2.2) for the particular case (s, y) = (0, I₀) in the case the (possible) optimal trajectory,

x(·), takes the terminal valuex(T ) =I_T; however, as we shall prove in what follows, ifI_T =I₀, Problem 3.1 (and also problem (2.1)–(2.2)) does not have an optimal control (hence an optimal trajectory, too).

In what follows we shall solve Problem 3.1 in the case the datah(·),k(·), D(·,·) verify the assumptions

(3.6) D(t, x)>0, k(x)>0, h(x) +k(D(t, x))·∂D

∂x(t, x)>0∀t∈[0, T], k(u)>0 ∀u≥0, lim

u→∞k(u) =∞, x >0,

which are obviously satisﬁed by the particular data in (2.3); a more general case does not seems worth the eﬀort in the absence of some convincing exam- ples.

Characterization of the Hamiltonian. The ﬁrst step of the Dynamic Programming Approach consists in characterization of the “true Hamiltonian”

of the problem.

The “pseudo-Hamiltonian” H(t, x, p, u) := p, f(t, x, u)+f₀(t, x, u) is given in our case by

(3.7) H(t, x, p, u) = e^−ρth(x)−pD(t, x)+α(t, p, u), α(t, p, u) :=pu+e^−ρtk(u).

The Hamiltonian and the corresponding multifunction of minimum points are given by the formulas

(3.8) H(t, x, p) = min

u∈U(t,x)H(t, x, p, u) = e^−ρth(x)−pD(t, x)+ min

u∈U(t,x)α(t, p, u) U(t, x, p) :={u∈U(t, x); H(t, x, p, u) =H(t, x, p)}.

In order to obtain a more explicit description of the HamiltonianH(·,·,·) in (3.8), we introduce the following notation: for (t, x, p)∈Z := [0, T]×R^∗₊×R andu∈R+, we set

(3.9) α(t, x, p) := min

u∈U(t,x)α(t, p, u), β(t, x, p) :=p+ e^−ρtk(D(t, x)).

Proposition3.2. If the assumptions in(3.6)hold, then for any(t, x, p)∈ Z there exists a unique real number u(t, x, p)∈[D(t, x),∞) such that:

(3.10) α(t, x, p) = α(t, p,u(t, x, p)).

(5)

Moreover,

(1)if β(t, x, p)≥0 then u(t, x, p) =D(t, x);

(2) if β(t, x, p) < 0 then there exists a unique real number u^∗(t, p) ∈ (D(t, x),∞) such that

(i) ^∂α_∂u(t, p, u^∗(t, p)) =p+ e^−ρtk(u^∗(t, p)) = 0;

(ii)u(t, x, p) = u^∗(t, p);

(3)the functionu( ·,·,·) is given by

(3.11) u(t, x, p) :=

D(t, x) if β(t, x, p) ≥0 u^∗(t, p) if β(t, x, p) <0 and is locally-Lipschitz onZ.

Proof. Using the fact thatk(u)>0∀u∈[0,∞), we deduce thatk(·) is strictly increasing, hence,u→ ^∂α_∂u(t, p, u) =p+ e^−ρtk(u) is strictly increasing.

(1) If β(t, x, p) = ^∂α_∂u(t, p, D(t, x))≥0, then ^∂α_∂u(t, p, u)≥ ^∂α_∂u(t, p, D(t, x))

≥0 ∀u ≥ D(t, x), hence α(t, p, u) ≥α(t, p, D(t, x)) ∀ u ≥ D(t, x) and there- fore,u(t, x, p) = D(t, x).

(2) If β(t, x, p) < 0, since ^∂α_∂u(t, p,·) = p+ e^−ρtk(·) is strictly increasing and lim

u→∞∂α

∂u(t, p, u) = lim

u→∞[e^−ρtk(u)] = ∞, it follows from the Dar- boux property that there exists a unique point u^∗(t, p) ∈ (D(t, x),∞) such that ^∂α_∂u(t, p, u^∗(t, p)) = 0 and, moreover, ^∂α_∂u(t, p, u) < ^∂α_∂u(t, p, u^∗(t, p)) ∀u ∈ [D(t, x), u^∗(t, p)), ^∂α_∂u(t, p, u) > 0 ∀ u > u^∗(t, p), hence, u^∗(·,·) is the global minimum point of the functionα(t, p,·), therefore u(t, x, p) =u^∗(t, p).

Since ^∂_∂u²^α₂(t, p, u) = e^−ρtk(u) > 0 ∀ u ∈ [0,∞), it follows from the implicit functions theorem thatu^∗(·,·) is given by

(3.12) u^∗(t, p) := (k)⁻¹(−pe^ρt)

and is of classC¹ on the subsetZ₋:={(t, x, p)∈Z; β(t, x, p)<0}.

Since D(·,·), u(·,·,·) are locally-Lipschitz, according to a well-known

“quasi-elementary” result, it follows thatu(·,·,·) is locally-Lipschitz.

Corollary3.3. The HamiltonianH(·,·,·) in (3.8) is given by (3.13)

H(t, x, p) =











H₊(t, x, p) := e^−ρt[h(x)+k(D(t, x))] if (t, x, p)∈Z₊, H₀(t, x, p) := e^−ρt[h(x)+k(D(t, x))] if (t, x, p)∈Z₀, H₋(t, x, p) := e^−ρth(x)−p[D(t, x)−u^∗(t, p)]+

+e^−ρtk(u^∗(t, p)) if (t, x, p)∈Z₋,

(6)

whereu^∗(·,·) is the function defined in(3.12), hence it isC¹-stratified and the strataZ_±, Z₀ are defined by

Z₊:={(t, x, p)∈Z;β(t, x, p)>0}, β(t, x, p) :=p+e^−ρtk(D(t, x)), (3.14)

Z₀:={(t, x, p)∈Z;β(t, x, p) = 0}, Z₋:={(t, x, p)∈Z;β(t, x, p)<0}. Proof. The formula in (3.14) follows from Proposition 3.2 while since the functions D(·,·), k(·), k(·) are diﬀerentiable, it follows that the subsets Z_± are relatively open andZ₀ is a 2-dimensional diﬀerentiable manifold.

The Hamiltonian system. Since the Hamiltonian in (3.13) isC¹-stratified, we choose the stratified Hamiltonian orientor fieldd^#_SH(·,·,·) defined by the formula

(3.15) d^#_SH(t, x, p) ={(x, p)∈R²ⁿ; (1, x, p)∈T_(t,x,p)Z, x∈f(t, x,U(t, x, p)), x, p − p, x=DH(t, x, p)·(0, x, p)∀(0, x, p)∈T_(t,x,p)Z}.

Since the manifolds Z_± ⊂ Z are open subsets, the Hamiltonian orientor ﬁelds d^#_SH_±(·,·,·) in (3.15) coincide with the classical Hamiltonian vec- tor ﬁelds

(3.16) d^#_SH_±(t, x, p) := ∂H_±

∂p (t, x, p),−∂H_±

∂x (t, x, p)

if (t, x, p)∈Z_±. The terminal transversality conditions. As specified in the Dynamic Programming Algorithm, in the case of a stratified Hamiltonian a generalized Hamiltonian flow is obtained by “backward integration” for t ≤ T of the Hamiltonian inclusion

(3.17) (x, p)∈d^#_SH(t, x, p), (T, x(T), p(T)) = (T, ξ, q)∈Z₁^∗,

where the set of terminal “transversality” points is deﬁned in the general case by

Z₁^∗ :={(τ, ξ, q)∈Z; q, ξ−τ H(τ, ξ, q) =Dg(τ, ξ)·(τ , ξ)∀(τ , ξ)∈T_(τ,ξ)E₁}. Since g(τ, ξ) = 0 andT_(T,ξ)E₁={0} ×R, in our case we have

Z₁^∗ ={(T, ξ, q)∈Z; q, ξ= 0∀ξ∈R}={(T, ξ,0); ξ >0}.

Further, sinceβ(T, ξ,0) = e^−ρtk(D(T, ξ)), it follows that in the general case the setZ₁^∗ above admits the stratiﬁcation

Z_1,+^∗ :={(T, ξ,0); k(D(T, ξ))>0} ⊂Z₊, (3.18)

Z_1,0^∗ :={(T, ξ,0); k(D(T, ξ)) = 0} ⊂Z₀, Z_1,−^∗ :={(T, ξ,0); k(D(T, ξ))<0} ⊂Z₋,

(7)

while in the case of an additional hypothesis in (3.6) according to whichk(u)>

0 ∀ u ≥0, one has Z_1,0^∗ =Z_1,−^∗ = ∅, hence all the trajectories are ending on the stratumZ₊.

Construction of the generalized Hamiltonian ﬂow. Sincek(D(T, ξ))>0, we have (T, ξ,0)∈Z₊ hence on an interval (τ⁺(ξ), T) the trajectory X₊^∗(·) = (X⁺(·), P⁺(·)) of the Hamiltonian system in (3.17) will remain on the stratum Z₊, hence it is a solution of the “smooth Hamiltonian system”

(3.19)

x = 0, x(T) =ξ >0

p =−e^−ρt[h(x) +k(D(t, x)).^∂D_∂x(t, x)], p(T) = 0.

Therefore, we obtain its solution in the form of a “maximal ﬂow”X₊^∗(·,·)

= (X⁺(·,·), P⁺(·,·)) whose ﬁrst component is given by X⁺(t, ξ) ≡ ξ while P⁺(·, ξ), is the solution of the elementary diﬀerential equation

(3.20) p =−e^−ρt

h(ξ) +k(D(t, ξ)).∂D

∂x(t, ξ)

, p(T) = 0, hence it is a primitive of the function on the right-hand side.

It follows from the Dynamic Programming algorithm in [6] that, on the stratumZ₊ we must retain only the trajectories X₊^∗(·, ξ), ξ >0, that satisfy the conditions

(3.21) X₊^∗(t, ξ)∈Z₊, (t, X⁺(t, ξ))∈E₀ ∀ t∈(τ₁⁺(ξ), T), ξ >0, on the maximal intervals (τ₁⁺(ξ), T), ξ >0, hence the extremityτ₁⁺(ξ)< T is deﬁned by

τ₁⁺(ξ) := inf{τ < T; β₊(t, ξ)>0, X⁺(t, ξ)>0 ∀t∈[τ, T)}, (3.22)

β₊(t, ξ) :=β(t, ξ, P⁺(t, ξ)) =P⁺(t, ξ) + e^−ρtk(D(t, ξ)), whereP⁺(·, ξ), ξ >0 is the only solution of equation (3.20).

It follows from hypothesis (3.6) (in particular, hypothesis (2.3)) that h(ξ) +k(D(t, ξ)).^∂D_∂x(t, ξ) > 0 ∀ t ∈ [0, T], ξ > 0 hence P⁺(·, ξ) is strictly decreasing, P⁺(t, ξ) >0 ∀ t∈[0, T], therefore β₊(t, ξ) >0 ∀t∈[0, T], ξ >0, which proves that the extremityτ₁⁺(·) in (3.22) is given by

(3.23) τ₁⁺(ξ) = 0 ∀ξ >0.

Therefore, the trajectoriesX⁺(·, ξ), ξ >0 “cover” the domainE₀⁺ deﬁned by E₀⁺:={(t, X⁺(t, ξ)); t∈[0, T), ξ >0}= [0, T)×R^∗₊=E₀, (3.24)

B⁺:= dom(X⁺(·,·)) = [0, T)×R^∗₊.

An essential step in using the general algorithm in [6]–[7] consists in the fact that the value of the cost functional in (3.2) is given by the functionV(·,·)

(8)

deﬁned by

V(t, ξ) :=

_t

T

P⁺(σ, ξ),∂X⁺

∂σ (σ, ξ)

−H(σ, X₊^∗(σ, ξ))

dσ= (3.25)

=− _t

T f₀(σ, X⁺(σ.ξ),u(σ, X ₊^∗(σ, ξ)))dσ =

= _T

t H₊(σ, ξ, P⁺(σ, ξ))dσ, whereH₊(·,·,·) is deﬁned in (3.13).

Thus, the Hamiltonian system in (3.19) generates the smooth charac- teristic flow C₊^∗(·,·) := (X₊^∗(·,·), V(·,·)) on the open stratum Z₊ and according to a well known classical results (e.g., Miric˘a ([6])) satisfies the basic differential relation:

(3.26) DV(t, ξ)·(t, ξ) = P⁺(t, ξ), DX⁺(t, ξ)·(t, ξ) ∀(t, ξ)∈T_(t,ξ)B⁺, which may be proved directly in this case.

On the other hand, it follows from (3.25) that for any initial point, (s, y) = (s, X⁺(s, ξ)) = (s, ξ)∈E₀, the mappingu_(s,y)(·) deﬁned by

(3.27) u_s,y(t) :=u(t, X ₊^∗(t, y)) =D(t, y), t∈[s, T], is an admissible control for which the cost functional is given by (3.28) C(s, y;u_s,y(·)) =V(s, y) =

_T

s H₊(σ, y, P⁺(σ, y))dσ.

Therefore, we have obtained the “feasible selection” of admissible controls

(3.29) A(s, y) :={u_s,y(·)}, (s, y)∈E₀, whosevalue functionis given by

(3.30) W⁺(s, y) := 0 if (s, y)∈E₁:={T} ×R^∗₊, W₀⁺(s, y) :=V(s, y) if (s, y)∈E₀:= [0, T)×R^∗₊, whereV(·,·) is the function deﬁned in (3.25).

Since the value function W⁺(·,·) in (3.30) is of class C¹, the optimality of the controlsu_s,y(·) in (3.27), therefore of the corresponding trajectories (3.31) x_s,y(t) =y ∀t∈[s, T], (s, y)∈E₀,

follows from the so calledElementary Verification Theorem (e.g. [3], [5], [7], etc.), according to which a sufficient optimality condition for the admissible controlsu_s,y(·) in (3.27) is the verification of the differential inequality (3.32) DW₀⁺(s, y)·(1, f(s, y, u)) +f₀(s, y, u)≥0∀ u∈U(s, y), (s, y)∈E₀.

Thus, we obtain the main result of this paper.

(9)

Theorem3.4. The selectionA(·,·)of admissible controls in (3.29) with value functionW⁺(·,·) in (3.30) is optimal.

Proof. First, it follows from (3.26), (3.30) that

DW₀⁺(s, y)·(1, f(s, y, u)) =DV(s, y)·(1, f(s, y, u)) = (3.33)

=P⁺(s, y)DX⁺(s, y)·(1, f(s, y, u)).

Since X⁺(s, y) = y, it follows that DX⁺(s, y) ·(1, v) = v, hence DX⁺(s, y)·(1, f(s, y, u)) =f(s, y, u) and therefore, it follows from (3.33) that

DW₀⁺(s, y)·(1, f(s, y, u)) +f₀(s, y, u) =P⁺(s, y)f(s, y, u) +f₀(s, y, u) =

=H(s, y, P⁺(s, y), u)> H₊(s, y, P⁺(s, y)) =

= e^−ρs[h(y) +k(D(s, y))>0 ∀u∈U(s, y),

which proves inequality (3.32), hence the optimality of the selec- tionA(·,·).

Remark 3.5. It follows, in particular, from Theorem 3.4 above that if I₀, I_T ∈R^∗₊ are such that I₀ =I_T (in fact, one should have I_T > I₀), then it does not exist an optimal trajectory joiningI₀ and I_T since the only optimal trajectory starting at (0, I₀) is the constant function I(t) ≡ I₀ (in fact, for any admissible controlu(·)=u_0,I₀(·) one hasC(0, I₀;u(·))≥ C(s, y;u_0,I₀(·)) = W⁺(0, I₀)).

REFERENCES

[1] M. Bounkhel, L. Tadj and Y. Benhadid, Optimal control of a production system with inventory-level-dependent demand. Appl. Math. E-Notes5(2005), 36–43.

[2] A. Cernea and S¸t. Miric˘a,Minimum principle for some classes of nonconvex diﬀerential inclusions. An. S¸tiint¸. Univ. “Al.I. Cuza” Ia¸si Mat.XLI(1995), 307–324.

[3] L. Cesari, Optimization-Theory and Applications. Springer-Verlag, New York–Berlin, 1983.

[4] H. Frankowska,The maximum principle for an optimal solution to a diﬀerential inclu- sion with end point constraints. SIAM J. Control Optim.25(1987), 145–157.

[5] V. Lupulescu and S¸t. Miric˘a,Veriﬁcation theorems for discontinuous value functions in optimal control. Math. Reports2(52)(2000),3, 299–326.

[6] S¸t. Miric˘a,Constructive Dynamic Programming in Optimal Control Autonomous Prob- lems. Editura Academiei Romˆane, Bucharest, 2004.

[7] S¸t. Miric˘a,User’s Guide on Dynamic Programming for autonomous diﬀerential games and optimal control problems. Rev. Roumaine Math. Pures Appl. 49 (2004), 5-6, 501–529.

(10)

[8] S¸t. Miric˘a and C. Necul˘aescu,On the solution of an optimal control problem in Mathe- matical Economics. Anal. Univ. Bucure¸sti Mat. XLVII(1998), 49–57.

[9] L.S. Pontryagin, V.G. Boltyanskii, R.V. Gamkrelidze and E.F. Mishchenko,The Mathe- matical Theory of Optimal Processes. Wiley, N.Y., 1962.

Received 8 September 2006 University of Bucharest

Faculty of Mathematics and Computer Science Str. Academiei 14, 010014 Bucharest, Romania

[email protected], [email protected]