TIME MINIMAL CONTROL OF BATCH REACTORS
B. BONNARD AND G. LAUNAY
Abstract. In this article we consider a control system modelling a batch reactor in which three speciesX1X2X3 are reacting according to the scheme X1 ! X2 !X3, each reaction being irreversible. The control is the temperature T of the reactions or the derivative of this temperature with respect to time. The terminal constraint is to obtain a given concentration of the productX2 at the end of the batch. The objective of our study is to introduce and to apply all the mathematical tools to compute the time optimal control as a closed-loop function.
This work can be used to optimize the yield of chemical batch reactors.
1. Introduction
Until now, the chemical batch reactors are mainly operating at constant temperature. Substantial gain in the yield of such reactors can obviously be obtained by controlling the temperature during the batch. This leads to optimal control problems. Recent developments in geometric optimal control allow to handle the mathematical complexity of such problems.
In 3], the authors consider the time minimal problem for a batch reactor in which three species
X
1X
2X
3are reacting according to the schemeX
1!X
2 !X
3, each reaction being irreversible and of the rst order, while the nal constraint is to obtain a given ratio of the two concentrations ofX
1X
2 and the control is the derivative of the temperature with respect to time.Due to symmetry the problem can be reduced to a time minimal control problem for a planar system and the time minimal control is computed as a closed-loop function. The objective of this article is to generalize this analysis for a network of the form
X
1 !X
2 !X
3, each reaction being irreversible but of any order, while the terminal constraint is now to obtain a desired production of the intermediate specieX
2. If the control is the derivative of the temperature with respect to time, this leads to a time minimal control problem for a system in R3 much more complex than a planar one.The main objective of our study is to present general techniques and results to handle the problem they are mainly threefold:
Bernard Bonnard: Universite de Bourgogne, Laboratoire de Topologie, UMR 5584, BP 400, 21011 Dijon Cedex, France. Email: bbonnard@satie.u-bourgogne.fr.
Genevieve Launay: INRIA Rocquencourt, Domaine de Voluceau, BP 105, 78153 Le Chesnay Cedex, France. Email: launay@pascal.inria.fr.
Received by the journal November 18, 1997. Accepted for publication August 24, 1998.
c
Societe de Mathematiques Appliquees et Industrielles. Typeset by LATEX.
rst we compute the time optimal control as a closed-loop function near the terminal constraint. This analysis is intricated it has motivated and it uses the mathematical developments of 5], 14],
secondly we need global estimates concerning the number of switchings of an optimal control,
endly we have to introduce the concept of focal points and give an algorithm to compute such points. This part is related to techniques introduced in 4].
The combination of the three previous analysis with numerical simulations allows to compute the optimal synthesis in many situations. Also it can be applied to more general reaction scheme to get closed-loop sub-optimal control laws.
This work was motivated by practical control problem and is currently implemented on a batch reactor located at Caen, France. Preliminary simu- lations indicate that in our experiment the gain of the optimal law is about 15% with respect to a constant operating temperature.
In order to present our main results we need to introduce precisely our problem.
1.1. Statement of the problem
1.1.1. Chemical kinetics. In this article, we shall restrict our study to a batch reactor in which three species
X
1X
2X
3 are reacting according to the schemeX
1 !X
2 !X
3, where the two reactions are irreversible and of then
i-th order with respect to the speciesX
i,i
= 12. Assuming that the reactions are at constant volume, denote byc
j, 1j
3 the molar concentration ofX
j and byT
the temperature of the reactions from the laws of chemical kinetics 7] we know that they satisfy:8
>
<
>
:
c
_1 =;k
1c
n11c
_2 =k
1c
n11 ;k
2c
n22c
_3 =k
2c
n22 (1.1)where _
c
denotesd dt
(c
)The parameters
k
i are depending onT
according to Arrhenius lawk
i=A
iexp(;E
i=RT
)i
= 12 (1.2) whereA
iE
i are respectively the frequency factor and the activation energy of thei
-th reaction andR
is the gas constant all these parameters are positive. Similar equations can be obtained when dealing with a network of two simple reactionsn
1X
1 !n
2X
2 !n
3X
3, where then
j's are the stoichiometric coecients of the reactions.Our optimal control problem is the following: minimize the batch time when a desired production quantity is xed for a batch. The desired product can be
z
=c
2=c
1 (problems of index 1) orc
2 (problems of index 2). Hence we shall solve time minimal problems with nal constraintz
(t
F) =d
1 (resp.c
2(t
F) =d
2), wheret
F is the batch time, and with initial conditionz
(0)< d
1 (resp.c
2(0)< d
2).1.1.2. Controls. Two mathematical controls are possible:
T
, subject to constraintsT
mT
T
M the associated problems areP1 (targetz
=d
1) andP2 (targetc
2 =d
2)or
u
= _T
, subject to constraintsu
;u
u
+ withu
;<
0< u
+ the associated problems are _P1 (targetz
=d
1) and _P2 (targetc
2=d
2).The choice of the temperature as control is clearly related to the chemical process. If we introduce
v
=k
1=
E
2=E
1=
A
2=A
1 (1.3) then system (1.1) becomes:8
>
<
>
:
c
_1 =;vc
n11c
_2 =vc
n11;v
c
n22c
_3 =v
c
n22 (1.4)and
v
can be taken as the control, sincev
is an increasing one-to-one function ofT
, the constraintsT
mT
T
M being equivalent tov
mv
v
M. Observe that system (1.4) is ane with respect tov
if and only if = 1 in this particular case the ratiodc
2=dc
1 does not depend onv
and the system is not controllable. Since in general the system is not ane with respect to the input it may happen that there is no admissible time optimal control law this is due to a relaxation phenomenon analyzed in 6].If we cannot directly track the optimal temperature prole in the previous system (1.4), we choose
u
= _T
as the control and we get the following system (which is ane with respect tou
):8
>
>
>
<
>
>
>
:
c
_1 =;vc
n11c
_2 =vc
n11 ;v
c
n22c
_3 =v
c
n22v
_ =h
(v
)u
withh
(v
) = (Rv=E
1)ln2(v=A
1)(1.5) By considering
u
= _T
as a control law, we compute an optimal law related only to the chemical network this choice is an idealization and the tem- perature can become negative. To handle this diculty we must introduce constraint on the state- coordinatev
this problem will not be considered in our study.1.1.3. Physical space. The physical space
P
of our problem is dened as follows:8
>
<
>
:
c
1>
0c
20c
2(t
)>
0 ift
is not the initial time]c
30z < d
1 orc
2< d
2 except at nal time,0
< v < A
1 sincev
=k
1 =A
1exp(;E
1=RT
) withT >
0:
(1.6) Then the following inequality:h
(v
)>
0 (1.7)comes from (1.5) and (1.6).
1.1.4. An example. The main properties of our study can be well under- stood by considering the following example. Notice that according to system (1.1),
z
=c
2=c
1 satises:z
_= (v
+vz
)c
n11;1;v
z
n2c
n12;1Therefore _
z
is independant ofc
1if and only ifn
1 =n
2 = 1. In this casen
1 =n
2 = 1 the structure of P1 and _P1 is particular and allows simplications.This introduces the lightning analysis of the next section.
1.2. Solving problem P1 in the case
n
1 =n
2 = 11.2.1. Computations. According to section 1.1, problem P1 is the time minimal problem of reaching the target (
z
=d
1) from an initial valuez
0 2 0d
1 ofz
for the dynamical system (1.4) which becomes in the coordinatesc
1zc
3 whenn
1=n
2 = 1:8
>
<
>
:
c
_1 =;vc
1z
_ =v
+vz
;v
z
c
_3 =v
c
2 (1.8)the variable
v
such thatv
mv
v
M being considered as the control.Note that the optimal law is: maximize _
z
subject tov
mv
v
M. Note also that the target (z
=d
1) is not accessible fromz
0 2 0d
1 such that maxvtz
_ 0, where maxvtz
_ is the maximum of _z
for anyv
2v
mv
M] and for any time t.Consider then
H
(c
1zv
) = _z
. From (1.8) it comesH
= 0 whenv
= 0 (1.9)@H @v
= 1 +z
;v
;1z
(1.10)@
2H
@v
2 =;(;1)v
;2z
(1.11) So if 6= 1,H
is a non-linear function ofv
such that there exists a single positive value ^v
ofv
satisfying@H
@v
= 0 and so corresponding to an extremum ofH
, which is from (1.11):a maximum (positive since (1.9) holds) if
>
1,a minimum (negative since (1.9) holds) if
<
1.Let us introduce
G
(resp.S
) the set of points (v >
0,z >
0) such thatH
= 0 (resp.@H
@v
= 0). From (1.8),G
is dened byz
(v
;1;1) = 1, and from (1.10),S
is dened byz
(v
;1;1) = 1: so if 6= 1,G
\S
= . 1.2.2. Results. Figure 1 showsG
andS
in the cases<
1,>
1 and a careful observation of this gure gives the optimal trajectories. In order to describe these optimal trajectories, let us denote by m (resp. M^) any arc satisfying (1.8) with controlv
m (resp.v
M^v
). Then, for abcin f
mM^g,abc denotes the concatenation of a a arc followed by a b arc and then ended by a c arc where each arc of the sequence may be empty.O
v H>0
@H@v >0
H<0 z
d
1
vA
O
() 1
;1
1
;1 S(@H@v =0) G(H =0)
@H@v <0
H<0
z v=vM
@H@v >0
z
d
1
H>0
^ v
1 vm
O
H>0 G(H=0) S(@H@v =0)
@H@v >0 @H@v <0
1
;1
() 1
;1
^ zm
^ zM
v=^v v=vm
H<0
v=vM
z v
@H@v <0
vM
O
Figure 1 (i) :
<
1 Figure 1 (ii) :>
1 Figure 1Case
<
1. For anyz
0 in 0d
1,if
v
M< v
A, wherev
A =
d
1 1 +d
11
1; is the
v
-coordinate ofG
\(z
=d
1), the target (z
=d
1) is not accessible,if
v
Mv
A, optimal trajectories areM. Case>
1. For anyz
0 in 0d
1,if
v
Mv
^1, where ^v
1=d
1 1 +d
11
1; is the
v
-coordinate ofS
\(z
=d
1), optimal trajectories areM,if
v
mv
^1< v
M, optimal trajectories areM^,M being empty if and only ifz
0> z
^M where ^z
M = 1=
(v
M;1;1) is thez
-coordinate ofS
\(v
=v
M),if ^
v
1< v
m<
1;1 , optimal trajectories are M^ m, M being empty if and only ifz
0> z
^M, and M^ being empty if and only ifz
0> z
^mwhere ^
z
m = 1=
(v
m;1;1) is thez
-coordinate ofS
\(v
=v
m),if
v
m1;1 , the target (z
=d
1) is not accessible.1.2.3. Conclusion. This example shows the following phenomenon. The value
= 1 is a bifurcation value for the parameter which is the ratio of activation energiesE
1=E
2(cf. (1.3) in section 1.1.2) the optimal law appears to be radically dierent in the case<
1 and>
1. More precisely, the optimal policy when the target is accessible is:when
<
1, to apply the maximal temperatureT
Mwhereas if
>
1, there exists an intermediate optimal heat law in some domain of the state space which is neitherT
m norT
M. Such a law is called a singular optimal control, see 3] it plays an important role in our problem it is related to the convexity of the functionv
7!H
= _z
and all the analysis relies on the behavior ofH
.1.3. Maximum Principle and Extremals
In order to make this article self-contained and to give the results as clearly as possible, let us recall some basic concepts and results about Pontryagin's Maximum Principle (in brief PMP) and extremals.
1.3.1. Statement of PMP. Consider a system of the form
x
_(t
) =f
(x
(t
)u
(t
))x
(t
)2Rnu
(t
)2Rm (1.12) wheref
is an analytic mapping from RnRto Rn and where the set U of admissible controls is the set of measurable mappingsu
() dened on an intervalt
0(u
)0] of R;and taking their values inu
1u
2]. LetN
be a regular analytic submanifold ofRn. The PMP asserts that ifu
(t
),t
2t
00]is an optimal control for the time minimal control problem with terminal manifold
N
, then there exists a so-called adjoint vectorp
(t
) 2 Rnnf0Rng withp
absolutely continuous, such that the following equations are satised almost everywhere ont
00]:dt d
(x
) =@H
@p
(x
p
u
)ddt
(p
) =;@H
@x
(x
p
u
) (1.13)H
(x
p
u
) =M
(x
p
) (1.14)where
H
(xpu
) =hpf
(xu
)i,hibeing the canonical inner product inRn and whereM
(xp
) = maxu2u1u2]
H
(xpu
). Moreover we have:t
7!M
(x
(t
)p
(t
)) is constant and non-negative (1.15) and at the nal time 0 the so-called transversality conditions are satised:x
(0)2N p
(0) is orthogonal toT
x(0)N
(1.16) whereT
xN
denotes the tangent space toN
atx
.1.3.2. Definitions.
System (1.13) is called the hamiltonian lift of (1.12), and
H
is called the Hamiltonian.Any (
xpu
) solution to (1.13), (1.14) and (1.15) is called an extremal (sometimes the adjoint statep
will be omitted).Any extremal (
xpu
) that satises the transversality conditions (1.16) is called a BC-extremal.Any extremal (
xpu
) such thatM
(xp
) = 0 is called exceptional.An extremal (
xpu
) is called regular if and only if for almost allt
,u
(t
) is equal tou
1 or tou
2 if moreoveru
is piecewise constant, the extremal is called bang-bang.An extremal (
xpu
) is called singular if and only if for every time@H @u
(xpu
) = 0Let (
xpu
) be an extremal. A time is called a switching time if and only if belongs to the closure of the set of times wherex
is notC1 the corresponding pointx
() is then called a switching point.1.3.3. Notation. As in section 1.2.2, if
a and b are two arcs solutions to system (1.12), ab denotes the concatenation of a followed by b (this corresponds to the concatenation of the controls associated to a and then to b).1.3.4. Singular extremals. By denition a singular extremal (
xpu
) is a solution to (1.13), (1.14) and (1.15) satisfying@H
@u
= 0 from (1.14) it has to satisfy the Legendre condition@
2H
@u
2 j(xpu)0 if this inequality is strict for every timet
, it is called the strong Legendre condition. Then by the implicit function theorem the singular control can be locally computed as a function of (xp
) by solving@H
@u
= 0 equivalently the singular control can be obtained as solution to the Cauchy problem for the dierential equationd
dt @H
@u
= 0, the strong Legendre condition implying the existence of such a solution.Recall that, in order to be admissible, the singular control has to belong to
u
1u
2]. Any point of a singular extremal where the singular control is equal tou
1 or tou
2 is called a saturation point.1.4. Statement of the main results
We shall give the main results of our article. To simplify the presentation we shall assume that each reaction is of rst order, i.e. that
n
1=n
2 = 1.1.4.1. ProblemP_1 in the case
n
1 =n
2 = 1. According to section 1.1, _P1 is the time minimal control problem of reaching the target (z
=d
1) from an initial valuez
0 20d
1 ofz
for the dynamical system (1.5) and for controlu
such thatu
;u
u
+ withu
;<
0< u
+. Whenn
1 =n
2 = 1, system (1.5) becomes in the state-coordinatesc
1zc
3v
:8
>
>
>
<
>
>
>
:
c
_1 =;vc
1z
_ =v
+vz
;v
z c
_3 =v
c
2v
_ =h
(v
)u
withh
(v
) = (Rv=E
1)ln2(v=A
1)(1.17) We shall prove subsequently (cf. section 3.1) that we can restrict our study to the planar system:
(
z
_ =v
+vz
;v
z
v
_ =h
(v
)u
withh
(v
) = (Rv=E
1)ln2(v=A
1) (1.18) Let us discuss the corresponding optimal synthesis.We introduce the following notations:
Let
u
s be the singular control, and let us denote by ; (resp. +s) any arc satisfying (1.18) with controlu
; (resp.u
+u
s).Let
A
be the set of the points of the target (z
=d
1) that are accessible from(
z < d
1).Let
G
be the set of the points (z >
0,v
2]0A
1) such that _z
= 0.Let
S
be the set of the points (z >
0,v
2]0A
1) such thatd dt @H
@u
= 0, whereH
denotes the Hamiltonian corresponding to system (1.18) and letS
1 =S
\N
. From section 1.3.4,S
contains all the singular extremals.Computations show that the singular control
u
s is negative and that there is onS
at most one saturation pointS
sat, whereu
s=u
;.In the following gures (Figures 2 and 3) we assume that
v
A< A
1, wherev
A denotes the v-coordinate ofG
\(z
=d
1) ifA
1v
A, these gures have to be restricted to (v < A
1).Case
<
1. Near the target, every optimal trajectory is of the form + and the local synthesis is given by Figure 2.S G(z_ =0)
S
1
A
1 vA
+
+ z
d
1
v
A, empty if and only if
A
1v
AFigure 2
Case
>
1. In this case we can describe the global synthesis. Letz
sat denote thez
-coordinate of the saturation pointS
sat. We have two situations:if
z
satd
1, optimal trajectories are+;and the synthesis is represented on Figure 3(i),if
z
sat< d
1, optimal trajectories are+;s and the synthesis is given by Figure 3(ii)where each arc of these sequences may be empty
1.4.2. Problem P2 in the case
n
1 =n
2 = 1. According to section 1.1,P
2 is the time minimal problem of reaching the target (
c
2 =d
2) from an initial value (c
2)0 2 0d
2 ofc
2 for the system (1.4) where the control isv
such thatv
mv
v
M.We shall prove subsequently (cf. section 3.1) that we can restrict our study to the planar system:
(
c
_1 =;vc
1c
_2 =vc
1;v
c
2 (1.19)Let us denote by ^
v
the singular control and by m (resp. M^) any arc satisfying (1.19) with controlv
m (resp.v
M^v
).Let
A
be the set of the points of the target (c
2 =d
2) that are accessible from (c
2< d
2).Let
E
m (resp.E
M) be the point of target such that _c
2 = 0 whenv
=v
m(resp.
v
=v
M).A
1
v G(z_=0)
d
1
G(z_=0)
; S
S
+
+
+
+
;
;
G(z_=0) S
S
A
1
v d
1
;
G(z_=0)
s
+
+
+
+
+
+
;
;
A, O
O z z
S
1
S
1 v
1
v
A
v
1
v
A
S z
sat
S
sat
v
sat
S
sat z
sat
v
sat
+ Figure3(i):z
sat d
1
Figure3(ii):z
sat
<d
1
setofswitchingp ointsfor
+
; ,
setofrstswitchingp ointsfor+;s
Figure 3
Let
S
m (resp.S
M) be the point of the target such that 0 =@H
@v
jv=vm, i.e.v
m = ^v
jt=0 (resp. 0 =@H
@v
jv=vM, i.e.v
M = ^v
jt=0) whereH
is the Hamiltonian (recall that, from section 1.3.4, the singular control ^v
is solution to@H
@v
= 0).Case
<
1. Each optimal trajectory is of the form M and the synthesis is represented on Figure 4.A M
EM
M M
c
1 d
2 c
2
Figure 4 Case
>
1. Results are threefold:Near the target, each optimal policy has at most one switching.
Each optimal trajectory is of the form
M^m, where each arc of the sequence may be empty, and each open-loop control isC0.Numerical simulations(cf. section 8) show that the optimal synthesis is given by Figure 5.
A
Em Sm SM
^ v=vM
^
^
c
1
^ v=vM
M
M
M
M
M
m
^ v=vm
^ v=vM
^
m
d
2 c
2
Figure 5
1.4.3. ProblemP_2 in the case
n
1 =n
2 = 1. According to section 1.1, _P2 is the time minimal problem of reaching the target (c
2=d
2) from an initial value (c
2)0 2 0d
2 ofc
2 for the control system (1.5) where the controlu
satisesu
;u
u
+ withu
;<
0< u
+.We shall prove subsequently (cf. section 3.1) that we can restrict our study to the three dimensional system:
8
>
<
>
:
c
_1 =;vc
1c
_2 =vc
1;v
c
2v
_ =h
(v
)u
withh
(v
) = (Rv=E
1)ln2(v=A
1) (1.20)Let us denote by
u
s the singular control and by; (resp. +s) any arc satisfying (1.20) with controlu
; (resp.u
+u
s).The set
A
of the points of the target (c
2 =d
2) that are accessible from (c
2< d
2) is included in the set A of the points of the target such thatc
_20, the frontier of which is the setE of the points of the target such thatc
_2= 0.Let
S
2 be the set of the points of the target such thatd dt @H
@u
= 0, whereH
denotes the Hamiltonian (from section 1.3.4 any singular extremal satises this condition). It turns out that at the nal time the singular control is neg- ative and that there is onS
2 one saturation pointS
sat, whereu
;=u
sjt=0. On Figure 6 we describe the stratication of the target by the optimal control in the (vc
1) coordinates it is deduced from the equations of the Maximum Principle, except near the set E where the situation is intricated because of accessibility problems.non-accessible points: and onE, withE+ non-accessible
u=u
+ u=u
+
u=u
;
onE, withE; accessible
A
1 v u
;
<usjt=0
S
2
v
sat
E c
1
O
E
; S
sat
Figure 6 (ii) :
>
1 non-accessible points: andusjt=0<u;
A
1 c
1 S
2 E
E
+
O
S
2 E
v
Figure 6 (i) :
<
1Figure 6 : stratication of the target
Case
<
1. It turns out that near the target the optimal policy isu
=u
+. Moreover there exists one point E+ on E such thatd
2dt
2(c
2) = 0 near this point the local synthesis is given by Figure 7and
non-accessible points: onE, withE+ non-accessible
v E
+
+ E
+ c
1 E
+
A
1
Figure 7 : local synthesis near E+ when
<
1O
Case
>
1. We got the following local and global results:Lo cal synthesisNear the target the optimal synthesis is described as fol- lows:
For optimal trajectories arriving near
S
2, the synthesis can be topologically described by an invariant foliation (v
=v
0). Letv
satdenote thev
-coordinate of the saturation pointS
sat the syntheses in each leaf (v
=v
0) are shown by Figure 8:if
v
0< v
sat, optimal trajectories are of the form+;s(see Figure 8 (i)) ifv
0v
sat, optimal trajectories are of the form+; (see Figure 8 (ii)) where each arc of these sequences may be empty.For optimal trajectories arriving nearE, the situation is intricated. Near the target, close enough to E, the optimal policy is
u
=u
;. Moreover there exists one point E; on E whered
2dt
2(c
2) = 0 letv
;denote thev
-coordinate of E;. Near a point E0 6=E; of E, the optimal synthesis is described by aC
0 invariant foliationF: (
v
=v
0) the leaves of which are given by Figures 9 (i) and (ii) near E; there is no such foliation and the synthesis is given by Figure 9 (iii).Global switching rulesEach optimal control law has at most two switch- ings and each optimal trajectory is of the form
+;s, where each arc of this sequence may be empty.1.5. Summary
This paper is organized as follows. In section 2 we recall extremality results for time minimal problems with single input ane systems. In section 3 we introduce the concepts of projected and reduced problems, which are straightforward but important tools in our study. The use of projected problems is related to symmetry properties of our systems. The concept of
c
1
c
2
E S
2
E (c
2
=d
2 )
;
+
+
+
+
+
;
s
(c
2
=d
2 )
+
+
S
2
+
;
set of switching points for
+; set of rst switching points for+;sFigure 8 (ii) :
v
0v
sat Figure 8 (i) :v
0< v
satFigure 8 : local synthesis near
S
2 in the leafv
=v
0 when>
1 reduced problem is more acute it is related to Goh's transformation and lightens the relationship between problems Pi(with controlT
) and _Pi(with control _T
),i
= 12. In section 4 we analyze problem P2 rst we solve P2 near the target when6= 1 secondly we nd global bounds on the number of switchings. In section 5 (resp. 6) we apply results of 5] and 14] to compute in the case_ 6= 1 the closed-loop optimal control near the target for problemP
1 when
n
1 =n
2 = 1 (resp. problem _P2 for anyn
1n
2) secondly in the case>
1, we nd global switching rules for _P1 whenn
1 =n
2 = 1 (resp._
P
2, except when
n
2 = 1 andn
1 2). Section 7 deals with the concept of conjugate and focal points. Endly, section 8 is an appendix regrouping commented numerical experiments.2. Extremals for time minimal problems with single input affine systems
2.1. Definitions and elementary results
2.1.1. Statement of the problem. Throughout this section 2, the con- sidered system is:
x
_(t
) =X
(x
(t
)) +u
(t
)Y
(x
(t
))x
(t
)2Rn andu
(t
)2R (2.1) whereX
andY
are analytic vector elds. As in section 1.3.1, the set U of admissible controls is the set of measurable mappingsu
() dened on an intervalt
0(u
)0] of R; and taking their values inu
1u
2] and againN
denotes a regular analytic submanifold ofRn, which is the target to be reached within minimal time.This problem is said to be at if and only if
Y
is everywhere tangent toN
. Note that problems _P1 and _P2 dened in section 1.1 match the abovev E
;
;
;
; c
1
Figure 9 (i):
v
0< v
;E
0
;
;
Figure 9 (ii):
v
0> v
;E
0 no optimal
trajectory
local synthesis nearE0 =E\(
v
=v
0) in the leafv
=v
0 when>
1non-accessible points: and on E, with E; accessible Figure 9 (iii): local synthesis near E; when
>
1
;
A
1 v
;
;
;
E
;
Figure 9 conditions and are both at.
NotationsLet us denote by
u
s the singular control and by1 (resp. 2s) any arc satisfying (2.1) with controlu
1 (resp.u
2u
s).2.1.2. Switching function. For any extremal (
xpu
) the Hamiltonian isH
(xpu
) =hpX
(x
)+u Y
(x
)i(from section 1.3.1 applied to system (2.1)).Hence the equation
@H
@u
= 0 (which is satised by any singular extremal) is equivalent tohpY
(x
)i= 0. The mapping::
t
7!hp
(t
)Y
(x
(t
))i (2.2) evaluated along (xp
) is called the switching function.Computing we get:
_(
t
) =hp
(t
)XY
](x
(t
))i (2.3) and along any smooth extremal:(
t
) =hp
(t
)ad2X
(Y
)(x
(t
));u
(t
)ad2Y
(X
)(x
(t
))i (2.4)where the Lie bracket of two analytic vector elds
Z
1Z
2 is computed with the convention:Z
1Z
2](x
) =@Z
2@x
(x
)Z
1(x
) ;@Z
1@x
(x
)Z
2(x
), and where adZ
1(Z
2) is the mapping dened by adZ
1(Z
2) =Z
1Z
2].2.1.3. Elementary regular cases.
Proposition 2.1. Let
z
0 = (x
0p
0) 2 Rn (Rnnf0Rng) be such thath
p
0Y
(x
0)i 6= 0. In a neighborhood ofz
0, any extremal is a 1 (resp. 2) arc if and only if hp
0Y
(x
0)i is negative (resp. positive).Proof. From PMP any extremal maximizes the Hamiltonian.
Corollary 2.2. Let (
xpu
) be an extremal with switching time then necessarilyhp
()Y
(x
())i= 0, i.e. () = 0.Proposition 2.3. Let
z
0 = (x
0p
0)2Rn(Rnnf0Rng) be such thatY
(x
0)6= 0, hp
0Y
(x
0)i = 0 and hp
0XY
](x
0)i 6= 0: such a pointz
0 is called a normal switching point. Then close enough toz
0 there is only one extremal containingz
0, which is 21 (resp. 12) if and only if hp
0XY
](x
0)i is negative (resp. positive),x
0 being the switching point.Proof. Consider (
xpu
) an extremal that containsz
0, and let 0 be the time such thatz
0 = (x
(0)p
(0)). Since hp
0XY
](x
0)i 6= 0 i.e. (from (2.3)) _(0)6= 0, close enough toz
0 the considered extremal has no singular arc (indeed, from section 2.1.2, any singular arc satises 0). Then the result comes from the maximization of the Hamiltonian (cf. PMP) after expanding at0 up to the rst order.2.1.4. Singular extremals: generalities.
Definitions and notations. A point (
xp
) is called ordinary if and only if hp
ad2Y
(X
)(x
)i6= 0 let be the set of non-ordinary points. A singular extremal (xpu
) such that (x
(t
)p
(t
)) 2 R2nn is said to be of order 2.Let ! be the variety f(
xp
) 2 Rn(Rnnf0Rng) hpY
(x
)i = 0g and let!0=f(
xp
)2Rn(Rnnf0Rng) hpY
(x
)i=hp XY
](x
)i= 0g included in!.Let
H
s be the restriction to !0n of the mapping: (xp
) 7! hpX
(x
) +u
s(xp
)Y
(x
)i, where:u
s(xp
) = hp
ad2X
(Y
)(x
)ih
p
ad2Y
(X
)(x
)i (2.5) Proposition 2.4. Any singular extremal (xpu
) is such that (x
(t
)p
(t
))2!0. Moreover the singular extremals (