Optimisation TD 4 - correction

(1)

Optimisation TD 4 - correction

lanquetuit.cyril@gmail.com, Université de Cergy Pontoise

Optimiser une fonction, c’est minimiser ou maximiser, en ses optima sa dérivée s’annule et leur nature (min ou max) dépend de la concavité ou convexité de celle-ci

1 Composition et convexité

f :C⊂R→Rest convexe

∀(x, y)∈C²,∀λ∈]0,1[, f(λx+ (1−λ)y)≤λf(x) + (1−λ)f(y) etg:R→Rest croissante donc

∀(x, y)∈C²,∀λ∈]0,1[, gof(λx+ (1−λ)y)≤g(λf(x) + (1−λ)f(y))≤gof(x) + (1−λ)gof(y) puisque g est convexe

Donc gof est convexe.

2 Sous ensemble convexe

Sα={x∈C/f(x)≤α}est convexe dansRⁿ⇔∀(x, y)∈Sα,∀λ∈]0,1[, λx+ (1−λ)y∈Sα

c’est à dire si et seulement si∀(x, y)∈C²tels quef(x)≤αetf(y)≤α, f(λx+ (1−λy)≤α or f est convexe doncf(λx+ (1−λ)y)≤λf(x) + (1−λ)f(y)≤λα+ (1−λ)α=α

DoncSαest convexe.

3 Convexité en 3D

f(x, y, z) =x²+ 2y²+ 3z²−xy+ 2yz+xz sera convexe si et seulement si sa matrice hessienne est définie positive⇔Spec(Hf)⊂R⁺^∗

Bf

Bx= 2x−y+ 2,^B²^f

Bx² = 2, ^B²^f

BxBy =

−1,

Bf

By = 4y−x+ 2z,^B²^f

By² = 4, ^B²^f

ByBz = 2,

Bf

Bz = 6z+2y+x,^B²^f

Bz² = 6, ^B²^f

BzBx= 1

Hf =⎡⎢

⎢⎢⎢⎢

⎣

2 −1 1

−1 4 2

1 2 6

⎡⎢⎢⎢

⎢⎢⎣ det(Hf−XI3) =

⎡⎢⎢⎢

⎢⎢⎣

2−X −1 1

−1 4−X 2

1 2 6−X

⎡⎢⎢⎢

⎢⎢⎣

χ_Hf(X) = (2−X)((4−X)(6−X)−4) + 1((X−6)−2) + 1(−2 +X−4) =−X³+ 12X²−42X+ 30 χ^′_Hf(X) =−3X²+ 24X−42<0siX≤0

∆ = 576−504 = 72

x₁=⁻²⁴₋⁻₆^√⁷²>0, x₂=⁻²⁴⁺₋₆^√⁷²>0

donc surR⁻χ_Hf est décroissante etχ_Hf(0) = 30 DoncSpec(Hf)⊂R⁺^∗et f est convexe.

1

(2)

4 Convexité paramétrée

f(x1, x2) =x^α₁x^β₂

Bf

Bx1 =αx^α₁⁻¹x^β₂

Bf

Bx2 =βx^α₁x^β₂⁻¹

B²f

Bx²₁ =α(α−1)x^α₁⁻²x^β₂

B²f

Bx²₂ =β(β−1)x^α₁x^β₂⁻²

B²f

Bx₁Bx₂ =αβx^α₁⁻¹x^β₂⁻¹

Hf =f⎡⎢

⎢⎢⎢⎣

α(α−1) x²₁

αβ x₁x₂ αβ

x1x2

β(β−1) x²₂

⎡⎢⎢⎢

⎢⎣

Siα+β= 1 Hf =f⎡⎢

⎢⎢⎢⎣

−αβ x²₁

αβ x1x2

−αβ x²₂

⎡⎢⎢⎢

⎢⎣, T r(Hf)<0etdet(Hf)≥0⇒Spec(Hf)⊂R⁻∪0

Siα+β <1 Hf est définie négative si et seulement si tout les mineurs principaux d’ordre impaire sont négatif et ceux d’ordre paire sont positifs or

−αβ x²₁ <0 det(Hf) = (_x^f

1x2)²(α(α−1)β(β−1)−(αβ)²) = (_x^f

1x2)²(αβ(1−(α+β)))>0 Hf définie négative donc f est stritement concave.

5 Le retour de la convexité paramétré

f(x1, x2) =x^α₁ +µx^β₂, µ>0

Bf

Bx1 =αx^α₁⁻¹

Bf

Bx2 =µβx^β₂⁻¹

B²f

Bx²₁ =α(α−1)x^α₁⁻²

B²f

Bx²₂ =µβ(β−1)x^β₂⁻²

B²f Bx1Bx2 = 0

Hf =f[α(α−1)x^α₁⁻² 0 0 µβ(β−1)x^β₂⁻²[

Si0<α<1,0<β<1 T r(Hf)<0, det(Hf)>0⇒f strictement concave Siα>1, β>1 T r(Hf)>0, det(Hf)>0⇒f strictement convexe

6 Homogénéité et concavité

fC²homogène de degré 1=k⇔∑ix_i^Bf

Bxi =kfdoncx^Bf

Bx+y^Bf

By =f x^B^f

Bx+x^B²^f

Bx² +y ^B²^f

BxBy =^B^f

Bx

Doncx^B²^f

Bx² =−^y_x_BxBy^B²^f =f

x^B²^f

ByBx+^B^f

By+y^B²^f

By² =^B^f

By

Doncx^B²^f

By² =−^x_y_ByBx^B²^f =f

Hf = ^B²^f

BxBy[−^y_x 1 1 −^x_y[

det(Hf) = 0donc f concave⇔−_x^y_BxBy^B²^f ≤0⇔ _BxBy^B²^f ≥0 2

(3)

Optimisation TD 5 - correction

—– Recherche des extrema d’une fonction —–

f(X+h) =f(X) +hf^′(X) +^h₂²f”(X) +o(h²)formule de Taylor à l’ordre 2 (***)

extremum⇒f’ s’annule, f”>0⇒f concave, maximum, f”<0⇒f convexe, minimum.

—– Méthode —–

1/ Déterminer les points critiques solutions du système∀1≤i≤n,_Bx^Bf

i = 0

2/ Etude globale, la matrice hessienne Hf peut être :

• d.p.(Spec(Hf)⊂R⁺^∗)⇒f strictement convexe, minimum global strict

• d.n.(Spec(Hf)⊂R⁻)⇒f strictement concave, maximum global strict

• s.d.p.(Spec(Hf)⊂R⁺)⇒f convexe, minimum global

• s.d.n.(Spec(Hf)⊂R⁻∪{0})⇒f concave, maximum global

• Dans le dernier cas l’étude locale en chaque point critique est nécessaire 3/ Etude locale, pour chaque point critique X, Hf(X) peut être :

• définie positive⇒minimum local strict

• définie négative⇒maximum local strict

• semi définie positive⇒on ne peut conclure (signe deo(h²)inconnu (***))

• semi définie négative⇒on ne peut conclure

• Hf comporte des valeurs propres positives, d’autres négatives, selon la direction dans laquelle on choisi la variation h, f(X+h) sera donc supérieur ou inférieur à f(X)⇒le point critique considéré est singulier (ni min, ni max)

1

(4)

1 x

²

+ y

²

+ z

²

− xy − x − 2z

0 = ^Bf_Bx = 2x−y−1,0 = ^Bf_By = 2y−x,0 = ^Bf_Bz =

2z−2, X= (²₃,¹₃,1)point critique

B²f

Bx² = 2,^B_By²^f2 = 2,^B_Bz²^f2 = 2, _BxBy^B²^f =−1,_BxBz^B²^f = 0,_ByBz^B²^f = 0, Hf =⎡⎢

⎢⎢⎢⎢

⎣

2 −1 0

−1 2 0

0 0 2

⎡⎢⎢⎢

⎢⎢⎣ mineurs principaux : 2, 3, 6⇒Hf d.p. donc X minimum global strict.

2 x

³

+ y

³

− (x + y)

²

0 =^B^f

Bx = 3x²−2(x+y),0 =^B^f

By = 3y²−2(x+y), X1= (⁴₃,⁴₃)etX2= (0,0)points critiques

Hf = [6x−2 −2

−2 6y−2[,Hf(X1) =[ 6 −2

−2 6[,Hf(X2) =[−2 −2

−2 −2[

T r(Hf(X₁) = 12, det(Hf(X₁) = 32 ⇒ valeurs propres : 4 et 8 ⇒ Hf(X₁) d.p.

doncX₁ minimum local strict.

T r(Hf(X₂) =−4, det(Hf(X₂) = 0⇒valeurs propres : -4 et 0⇒Hf(X₂)s.d.n.

on ne peut conclure.

3 x

³

z + y

³

− 3x

²

y − 2z

²

0 = ^Bf_Bx = 3x²z−6xy,0 = ^Bf_By = 3y²−3x²,0 = ^Bf_Bz =x³−4z X_1,2,3= (0,0,0),(2,2,2),(−2,2,−2)points critiques, Hf =⎡⎢

⎢⎢⎢⎢

⎣

6xz−6y −6x 3x²

−6x 6y 0 3x² 0 −4

⎡⎢⎢⎢

⎢⎢⎣ Hf(X₁) =⎡⎢

⎢⎢⎢⎢

⎣

0 0 0 0 0 0 0 0 −4

⎡⎢⎢⎢

⎢⎢⎣

,Hf(X₂) =⎡⎢

⎢⎢⎢⎢

⎣

12 −12 12

−12 12 0 12 0 −4

⎡⎢⎢⎢

⎢⎢⎣

,Hf(X₃) =⎡⎢

⎢⎢⎢⎢

⎣

12 12 12 12 12 0 12 0 −4

⎡⎢⎢⎢

⎢⎢⎣ PourX1 : Hessienne s.d.n.⇒on ne peut conclure.

Pour X2, X3 : T r = 20 > 0, det = −12³ < 0 ⇒ 2 valeurs propres positives, une négative⇒points singuliers.

4 x

³

+ 3xy

²

− 15x − 12y

0 = ^Bf_Bx = 3x²+ 3y²−15,0 = ^Bf_By = 6xy−12

X_0,1,2,3= (2,1),(−2,−1),(1,−2),(−1,−2)points critiques, Hf = 6[x y y x[ PourX₀ : det > 0, Tr > 0⇒Hf(X₀)d.p.⇒minimum local strict.

(5)

1 ⇒ 1 ⇒

PourX_2,3 : det < 0⇒2 valeurs propres de signe opposé⇒points singuliers.

5 (x

²

+ y

²

)e

^x²⁻^y²

0 = ^Bf_Bx = 2x(1 +x²+y²)e^x²⁻^y²,0 = ^Bf_By = 2y(1−x²−y²)e^X²⁻^y² X_1,2,3= (0,0),(0,−1),(0,1)points critiques

B²f

Bx² = (6x²+ 2 + 2y²+ 4x²(1 +x²+y²))e^x²⁻^y²,

B²f

By² = (−6y²−2x²+ 2−4y²(1−x²−y²))e^x²⁻^y²,

B²f

BxBy = (−4xy+ 4xy(1−x²−y²))e^x²⁻^y² Hf(X₁) =[2 0

0 2[ d.p.⇒X₁ minimum local strict.

Hf(X₁) =Hf(X₂) =[⁴^e 0

0 ⁻_e⁴[ 2 valeurs propre opposées⇒X₂, X₃ singuliers.

6 ln(x − 2) + ln(y − 3) − x − 2y

0 = ^Bf_Bx = _x₋¹₂ −1,0 = ^Bf_By = _y₋¹₃−2, X = (3,⁷₂)point critique Hf =[ ⁻

1

(x−2)² 0

0 _(y⁻₋¹₃₎2[d.n.⇒X maximum global strict.

7 x

²

− 2xy + 2y

²

+ 2y + z

²

− 4z + 6

0 = ^Bf_Bx = 2x−2y,0 = ^Bf_By =−2x+ 4y+ 2,0 = ^Bf_Bz = 2z−4, X = (−1,−1,2)point critique, Hf =⎡⎢

⎢⎢⎢⎢

⎣

2 −2 0

−2 4 0

0 0 2

⎡⎢⎢⎢

⎢⎢⎣

mineurs principaux 2,4,8 poistifs ⇒Hf d.p.⇒X minimum global strict.

8 xln(y) + zln(x) − y

0 = ^Bf_Bx =lny+²_x,0 = ^Bf_By =^x_y −1,0 = ^Bf_Bz =lnx, X = (1,1,0)point critique

Hf = ⎡⎢

⎢⎢⎢⎢

⎣

−z x²

1 y

1 1 x

y −x y² 0

1

x 0 0

⎡⎢⎢⎢

⎢⎢⎣

,Hf(X) =⎡⎢

⎢⎢⎢⎢

⎣

0 1 1 1 −1 0 1 0 0

⎡⎢⎢⎢

⎢⎢⎣

det > 0, Tr < 0⇒2 valeurs propres négatives, une positive ⇒X point singulier.

(6)

Optimisation TD 6 - correction

Extrema de f(X) sous la contrainte g(X)=0 1/ Points critiques solutions du système∀1≤i≤n, ^BL

Bxi = 0, g(X) = 0oùL=f+λg 2/ Convexité du Lagrangien dépendant de la matrice hessienneHL

1 x

²

− xy + 4y

²

/1 − 3x − 4y = 0

L=f+λg

0 = ^BL_Bx = 2x−y−3λ,0 =^BL_By = 8y−x−4λ,0 =g(x, y) = 1−3x−4y

⎡⎢⎢⎢

⎢⎢⎣

2 −1 −3

−1 8 −4

−3 −4 0

⎡⎢⎢⎢

⎢⎢⎣

⎡⎢⎢⎢

⎢⎢⎣ x y λ

⎡⎢⎢⎢

⎢⎢⎣

=⎡⎢

⎢⎢⎢⎢

⎣ 0 0

−1

⎡⎢⎢⎢

⎢⎢⎣ x=₁₂₈²⁸, y=₁₂₈¹¹, λ=₁₂₈¹⁵ H_L=H_f =[2 −1

−1 8[,T r(H_L) = 10>0, det(H_L) = 15>0⇒H_Ld.p.⇒Lconvexe⇒(₃₂⁷,₁₂₈¹¹) minimum global strict

2 3x

²

− xy + y

²

/1 − 2x − 3y = 0

L=f+λg0 =^BL

Bx = 3x−y−2λ,0 =^BL

By =−x+ 2y−3λ,0 =g(x, y) = 1−2x−3y

⎡⎢⎢⎢

⎢⎢⎣

3 −1 −2

−1 2 −3

−2 −3 0

⎡⎢⎢⎢

⎢⎢⎣

⎡⎢⎢⎢

⎢⎢⎣ x y λ

⎡⎢⎢⎢

⎢⎢⎣

=⎡⎢

⎢⎢⎢⎢

⎣ 0 0

−1

⎡⎢⎢⎢

⎢⎢⎣

x = ₇₄⁷, y = ²⁰₇₄, λ = ¹¹₇₄, H_L = H_f = [6 −1

−1 2[, T r(H_L) = 12 > 0, det(H_L) = 11 > 0 ⇒ HLd.p.⇒Lconvexe⇒(₇₄⁷,¹⁰₃₇)minimum global strict

3 xy/2 − x

²

− y

²

= 0

L=f+λg0 =^BL

Bx =y−2xλ,0 =^BL

By =x−2yλ,0 =g(x, y) = 1−x²−y²

(x, y, λ) = (1,1,¹₂),(−1,−1,¹₂),(1,−1,−¹₂),(−1,1,¹₂)points critiquesHLf =[−2λ 1 1 −2λ[ Pourλ=¹₂ : det = 0, Tr = -2⇒Lconcave⇒maximum global.

Pourλ=−¹₂ : det = 0, Tr = 2⇒Lconvexe⇒minimum global.

(7)

4 x + 2y/1 − x − y = 0

L=f+λg 0 = ^BL

Bx = 1−2xλ,0 =^BL

By = 2−2yλ,0 =g(x, y) = 1−x²−y² (x, y, λ) = (√¹

5,√²

5,^√₂⁵),(−^√¹₅,−^√²₅,−^√₂⁵)points critiques,HLf=[−2λ 0 0 −2λ[ Pourλ=^√₂⁵ : det = 0, Tr < 0⇒Lconcave⇒(√¹

5,√²

5)maximum global.

Pourλ=−^√₂⁵ : det = 0, Tr > 0⇒Lconvexe⇒(−^√¹₅,−^√²₅)minimum global.

5 z/1 − x − y − z = 0 = 1 − x

²

− y

²

− z

²

L=f+λg+µh 0 = ^B^L

Bx =−λ−2xµ=^B^L

By =−λ−2yµ=^B^L

Bz = 1−λ−2zµ=^B^L

Bλ = 1−x−y−z=^B^L

Bµ = 1−x²−y²−z² (1−3λ)²=λ²+λ²+ (1−λ)²,6λ²= 4λ

(x, y, z, λ, µ) = (0,0,1,0,¹₂)ou(²₃,²₃,−¹₃,²₃,−¹₂), HLf=−2µI3

Pourµ=^√₂¹ : définie négative⇒Lconcave⇒z= 1maximum global.

Pourµ=−^√₂¹ : définie positive⇒Lconvexe⇒z=−¹₃ minimum global.

6 2x + y + z/4 − x − 2y − z = 1 = x

²

+ y

²

L=f+λg+µh

0 = ^BL_Bx = 2−λ−2xµ,0 = ^BL_By = 1−2λ−2yµ,0 =^BL_Bz = 1−λ,0 =^BL_Bλ = 3−x−2y−z,0 = ^BL_Bµ = 1−x²−y²

(x, y, z, λ, µ) = (√¹ 2,√⁻¹

2,3+√¹ 2,1,√¹

2)ou(√⁻¹ 2,√¹

2,3−^√¹₂,1,−^√¹₂), HL=

⎡⎢⎢⎢

⎢⎢⎣

−2µ 0 0 0 −2µ 0

0 0 0

⎡⎢⎢⎢

⎢⎢⎣ µ>0: L concave sur tout plan contenant pas de droite verticale,3 +√

2maximum global strict.

µ<0: L convexe sur tout plan contenant pas de droite verticale,3−√

2minimum global strict.