• Aucun résultat trouvé

Multi-task Bolasso based aircraft dynamics identification

N/A
N/A
Protected

Academic year: 2021

Partager "Multi-task Bolasso based aircraft dynamics identification"

Copied!
90
0
0

Texte intégral

(1)

HAL Id: hal-01643177

https://hal.inria.fr/hal-01643177

Submitted on 21 Nov 2017

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Multi-task Bolasso based aircraft dynamics identification

Cédric Rommel, Joseph Frédéric Bonnans, Baptiste Gregorutti, Pierre Martinon

To cite this version:

Cédric Rommel, Joseph Frédéric Bonnans, Baptiste Gregorutti, Pierre Martinon. Multi-task Bolasso

based aircraft dynamics identification. PGMODays, Nov 2017, Paris, France. �hal-01643177�

(2)

C. Rommel , J. F. Bonnans , B. Gregorutti

2

and P. Martinon

1

CMAP Ecole Polytechnique - INRIA1 Safety Line2

November 14

th

2017

(3)

Motivation

20 000 airplanes — 80 000 flights per day,

Should double until 2033,

Responsible for 3% of CO

2

emissions,

Accounts for 30% of operational cost for an airline,

Rectilinear climb trajectories at full thrust.

(4)

Motivation

20 000 airplanes — 80 000 flights per day,

Should double until 2033,

(5)

Motivation

20 000 airplanes — 80 000 flights per day, Should double until 2033,

Responsible for 3% of CO

2

emissions,

Accounts for 30% of operational cost for an airline,

Rectilinear climb trajectories at full thrust.

(6)

Motivation

20 000 airplanes — 80 000 flights per day, Should double until 2033,

Responsible for 3% of CO

2

emissions,

Accounts for 30% of operational cost for an airline,

(7)

Motivation

20 000 airplanes — 80 000 flights per day, Should double until 2033,

Responsible for 3% of CO

2

emissions,

Accounts for 30% of operational cost for an airline,

Rectilinear climb trajectories at full thrust.

(8)

Should double until 2033,

Responsible for 3% of CO

2

emissions,

Accounts for 30% of operational cost for an airline,

Rectilinear climb trajectories at full thrust.

(9)

Example of optimized trajectory

Reference Optimized

1311kg 1141kg

(10)

(x,u)∈

min

X×U

Z

tf

0

C (t, u(t), x(t))dt,

s.t.

 

 

Φ(x(0), x(t

f

)) ∈ K

Φ

,

u(t) ∈ U

ad

, for a.e. t ∈ [0, t

f

],

c

j

(x(t)) ≤ 0, j = 1, . . . , n

c

, for all t ∈ [0, t

f

],

˙

x = g (t, u, x), for a.e. t ∈ [0, t

f

].

(11)

Optimal Control Problem

(x,u)∈

min

X×U

Z

tf

0

C (t, u(t), x(t))dt,

s.t.

 

 

Φ(x(0), x(t

f

)) ∈ K

Φ

,

u(t) ∈ U

ad

, for a.e. t ∈ [0, t

f

],

c

j

(x(t)) ≤ 0, j = 1, . . . , n

c

, for all t ∈ [0, t

f

],

˙

x = g (t, u, x), for a.e. t ∈ [0, t

f

].

(12)

(x,u)∈

min

X×U

Z

tf

0

C (t, u(t), x(t))dt,

s.t.

 

 

Φ(x(0), x(t

f

)) ∈ K

Φ

,

u(t) ∈ U

ad

, for a.e. t ∈ [0, t

f

],

c

j

(x(t)) ≤ 0, j = 1, . . . , n

c

, for all t ∈ [0, t

f

],

˙

x = g (t, u, x), for a.e. t ∈ [0, t

f

].

(13)

Optimal Control Problem

(x,u)∈

min

X×U

Z

tf

0

C (t, u(t), x(t))dt,

s.t.

 

 

Φ(x(0), x(t

f

)) ∈ K

Φ

,

u(t) ∈ U

ad

, for a.e. t ∈ [0, t

f

],

c

j

(x(t)) ≤ 0, j = 1, . . . , n

c

, for all t ∈ [0, t

f

],

˙

x = g (t, u, x), for a.e. t ∈ [0, t

f

].

(14)

(x,u)∈

min

X×U

Z

tf

0

C (t, u(t), x(t))dt,

s.t.

 

 

Φ(x(0), x(t

f

)) ∈ K

Φ

,

u(t) ∈ U

ad

, for a.e. t ∈ [0, t

f

],

c

j

(x(t)) ≤ 0, j = 1, . . . , n

c

, for all t ∈ [0, t

f

],

˙

x = g (t, u, x), for a.e. t ∈ [0, t

f

].

(15)

Optimal Control Problem

(x,u)∈

min

X×U

Z

tf

0

C (t, u(t), x(t))dt,

s.t.

 

 

Φ(x(0), x(t

f

)) ∈ K

Φ

,

u(t) ∈ U

ad

, for a.e. t ∈ [0, t

f

],

c

j

(x(t)) ≤ 0, j = 1, . . . , n

c

, for all t ∈ [0, t

f

],

˙

x = g (t, u, x), for a.e. t ∈ [0, t

f

].

(16)

QAR data

(17)

QAR data

Massive (> 1000 variables recorded every second),

x ˙ = g (t, u, x)

(18)

QAR data

(19)

QAR data

Massive (> 1000 variables recorded every second),

x ˙ = g (t, u, x)

(20)

Massive (> 1000 variables recorded every second),

x ˙ = g (t, u, x)

(21)

Flight mechanics and state equation

Classic flight mechanics model

(22)

 

 

 

 

 

 

 

 

 

 

V ˙ = T cos α − D − mg sin γ

m ,

˙

γ = T sin α + L − mg cos γ

mV ,

˙

m = − T

I

sp

.

(23)

Flight mechanics and state equation

 

 

 

 

 

 

 

 

 

 

h ˙ = V sin γ ,

V ˙ = T cos α − D − mg sin γ

m ,

˙

γ = T sin α + L − mg cos γ

mV ,

˙

m = − T I

sp

.

State variables: x = [h, V , γ, m]

(24)

 

 

 

 

 

 

 

 

 

 

V ˙ = T cos α − D − mg sin γ

m ,

˙

γ = T sin α + L − mg cos γ

mV ,

˙

m = − T I

sp

.

State variables: x = [h, V , γ, m]

Control variables: u = [α, N

1

]

(25)

Flight mechanics and state equation

 

 

 

 

 

 

 

 

 

 

h ˙ = V sin γ,

V ˙ = T cos α − D − mg sin γ

m ,

˙

γ = T sin α + L − mg cos γ

mV ,

˙

m = − T I

sp

.

State variables: x = [h, V , γ, m]

Control variables: u = [α, N

1

]

Unknown functions of the state and control variables

(26)

 

 

 

 

 

 

 

 

 

 

V ˙ = T (x, u) cos α − D(x, u) − mg sin γ

m ,

˙

γ = T (x, u) sin α + L(x, u) − mg cos γ

mV ,

˙

m = − T (x, u) I

sp

(x, u) .

State variables: x = [h, V , γ, m]

Control variables: u = [α, N

1

]

Unknown functions of the state and control variables

(27)

Model requirements

 

 

 

 

T function of (M , ρ, N

1

), D function of (M , ρ, q),

L function of (M , ρ, q), I

sp

function of (M , h, SAT ),

Need for smooth models,

Need for models which are fast to compute,

Need for interpretable models for safety,

Need for models which are rich enough.

(28)

Model requirements

 

 

 

 

T function of (M , ρ, N

1

) = ϕ

T

(x, u), D function of (M , ρ, q) = ϕ

D

(x, u),

L function of (M , ρ, q) = ϕ

L

(x, u),

I

sp

function of (M , h, SAT ) = ϕ

Isp

(x, u),

(29)

Model requirements

 

 

 

 

T function of (M , ρ, N

1

) = ϕ

T

(x, u), D function of (M , ρ, q) = ϕ

D

(x, u),

L function of (M , ρ, q) = ϕ

L

(x, u), I

sp

function of (M , h, SAT ) = ϕ

Isp

(x, u),

Need for smooth models,

Need for models which are fast to compute,

Need for interpretable models for safety,

Need for models which are rich enough.

(30)

Model requirements

 

 

 

 

T function of (M , ρ, N

1

) = ϕ

T

(x, u), D function of (M , ρ, q) = ϕ

D

(x, u),

L function of (M , ρ, q) = ϕ

L

(x, u), I

sp

function of (M , h, SAT ) = ϕ

Isp

(x, u),

Need for smooth models,

Need for models which are fast to compute,

(31)

Model requirements

 

 

 

 

T function of (M , ρ, N

1

) = ϕ

T

(x, u), D function of (M , ρ, q) = ϕ

D

(x, u),

L function of (M , ρ, q) = ϕ

L

(x, u), I

sp

function of (M , h, SAT ) = ϕ

Isp

(x, u),

Need for smooth models,

Need for models which are fast to compute, Need for interpretable models for safety,

Need for models which are rich enough.

(32)

Model requirements

 

 

 

 

T = X

T

· θ

T

, D = X

D

· θ

D

, L = X

L

· θ

L

, I

sp

= X

Isp

· θ

Isp

.

Need for smooth models,

Need for models which are fast to compute,

Need for interpretable models for safety,

(33)

Model requirements

 

 

 

 

T = X

T

· θ

T

, with X

T

6= ϕ

T

(x, u), D = X

D

· θ

D

, with X

D

6= ϕ

D

(x, u), L = X

L

· θ

L

, with X

L

6= ϕ

L

(x, u), I

sp

= X

Isp

· θ

Isp

, with X

Isp

6= ϕ

Isp

(x, u).

Need for smooth models,

Need for models which are fast to compute, Need for interpretable models for safety,

Need for models which are rich enough.

(34)

 

 

 

 

T T T T

D = X

D

· θ

D

, with X

D

6= ϕ

D

(x, u), L = X

L

· θ

L

, with X

L

6= ϕ

L

(x, u), I

sp

= X

Isp

· θ

Isp

, with X

Isp

6= ϕ

Isp

(x, u).

Need for smooth models,

Need for models which are fast to compute,

Need for interpretable models for safety,

Need for models which are rich enough.

(35)

Model requirements

 

 

 

 

T = X

T

· θ

T

, with X

T

= Φ

d

◦ ϕ

T

(x, u), D = X

D

· θ

D

, with X

D

= Φ

d

◦ ϕ

D

(x, u), L = X

L

· θ

L

, with X

L

= Φ

d

◦ ϕ

L

(x, u), I

sp

= X

Isp

· θ

Isp

, with X

Isp

= Φ

d

◦ ϕ

Isp

(x, u).

Need for smooth models,

Need for models which are fast to compute,

Need for interpretable models for safety,

Need for models which are rich enough.

(36)

 

 

 

 

T T T 1

D = X

D

· θ

D

, with X

D

= q(1, α, M , α

2

, αM , M

2

, ...), L = X

L

· θ

L

, with X

L

= q(1, α, M, α

2

, αM , M

2

, ...), I

sp

= X

Isp

· θ

Isp

, with X

Isp

= (1, h, M , h

2

, hM, M

2

, ...).

Need for smooth models,

Need for models which are fast to compute,

Need for interpretable models for safety,

Need for models which are rich enough.

(37)

Regression problems

 

 

 

 

h ˙ = V sin γ

m V ˙

r

= T cos α − D − mg sin γ mV

r

γ ˙ = T sin α + L − mg cos γ

˙

m = −

IT

sp

.

Targets to fit Unknown Random error

(38)

Regression problems

 

 

 

 

h ˙ = V sin γ

m V ˙

r

= T cos α − D − mg sin γ mV

r

γ ˙ = T sin α + L − mg cos γ

˙

m = −

IT

sp

.

(39)

Regression problems

 

 

 

 

h ˙ = V sin γ m V ˙

r

+ mg sin γ = T cos α − D mV

r

γ ˙ + mg cos γ = T sin α + L

˙

m = −

IT

sp

.

Targets to fit Unknown Random error

(40)

Regression problems

 

 

 

 

h ˙ = V sin γ m V ˙

r

+ mg sin γ = T cos α − D mV

r

γ ˙ + mg cos γ = T sin α + L

0 = T + ˙ mI

sp

.

(41)

Regression problems

 

 

 

 

h ˙ = V sin γ m V ˙

r

+ mg sin γ = T cos α − D mV

r

γ ˙ + mg cos γ = T sin α + L

0 = T + ˙ mI

sp

.

 

 

Y

1

= X

T

cos α · θ

T

− X

D

· θ

D

1

Y

2

= X

T

sin α · θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ ˙ mX

Isp

· θ

Isp

3

Targets to fit Unknown Random error

(42)

Multi-task regression framework

 

 

Y

1

= X

T

cos α · θ

T

− X

D

· θ

D

1

Y

2

= X

T

sin α · θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ ˙ mX

Isp

· θ

Isp

3

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

(43)

Multi-task regression framework

 

 

Y

1

= X

T

cos α · θ

T

− X

D

· θ

D

1

Y

2

= X

T

sin α · θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ mX ˙

Isp

· θ

Isp

3

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

Ensures all components of ˆ g to share the same thrust ˆ T , Better predictive accuracy from tight coupling,

Helps in high correlations setting.

(44)

Multi-task regression framework

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

(45)

Multi-task regression framework

Y = X θ + ε,

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

Ensures all components of ˆ g to share the same thrust ˆ T , Better predictive accuracy from tight coupling,

Helps in high correlations setting.

(46)

Multi-task regression framework

Y = X θ + ε,

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

(47)

Multi-task regression framework

Y = X θ + ε,

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

Ensures all components of ˆ g to share the same thrust ˆ T ,

Better predictive accuracy from tight coupling,

Helps in high correlations setting.

(48)

Multi-task regression framework

Y = X θ + ε,

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

Ensures all components of ˆ g to share the same thrust ˆ T ,

Better predictive accuracy from tight coupling,

(49)

Multi-task regression framework

Y = X θ + ε,

 Y

1

Y

2

0

| {z }

Y

=

X

T1>

−X

D>

0 0 X

T2>

0 X

L>

0

X

T>

0 0 X

Ispm>

| {z }

X

 θ

T

θ

D

θ

L

θ

Isp

| {z }

θ

+

 ε

1

ε

2

ε

3

| {z }

ε

.

Ensures all components of ˆ g to share the same thrust ˆ T ,

Better predictive accuracy from tight coupling,

(50)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations,

{X

i

, Y

i

}

Ni=1

,

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(51)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(52)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Empirical Risk Minimization min

θ

1 N

N

X

i=1

L(Y

i

, X

i

θ),

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(53)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

Least Squares Regression min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

,

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(54)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Least Squares Regression min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

,

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(55)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

Least Squares Regression min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

,

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(56)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

L

1

penalization min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λkθk

1

,

(57)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

L

1

penalization min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λkθk

1

,

' Lasso [Tibshirani, 1994]

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(58)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

Block sparse Lasso min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λkθk

1

,

' Lasso [Tibshirani, 1994]

(59)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

Block sparse Lasso min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λkθk

1

,

' Lasso [Tibshirani, 1994]

Polynomial regression

high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(60)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

Block sparse Lasso min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λkθk

1

,

' Lasso [Tibshirani, 1994]

Polynomial regression high correlations between elements of X

i

(61)

Feature selection

Let {(x

i

, u

i

, x ˙

i

)}

Ni=1

set of N observations, {X

i

, Y

i

}

Ni=1

,

Maybe not all monomials are relevant for T , D, L and/or I

sp

model...

Overfitting...

Block sparse Lasso min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λkθk

1

,

' Lasso [Tibshirani, 1994]

Polynomial regression high correlations between elements of X

i

Unstable selections...

⇒ Bolasso [Bach, 2008]

(62)

Block sparse Lasso min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λkθk

1

,

' Lasso [Tibshirani, 1994]

Polynomial regression high correlations between elements of X

i

Unstable selections...

(63)

Bootstrap implementation

Block sparse Bolasso

Require:

training data T = {(X

i

, Y

i

)}

Ni=1

, number of bootstrap replicates m, L

1

penalization parameter λ,

1:

for k = 1 to m do

2:

Generate bootstrap sample T

k

,

3:

Compute Block sparse Lasso estimate ˆ θ

k

from T

k

,

4:

Compute support J

k

= {j , θ ˆ

kj

6= 0},

5:

end for

6:

Compute J = T

m k=1

J

k

,

7:

Compute ˆ θ

J

from T

J

= {(X

Ji

, Y

i

)}

Ni=1

using Least-Squares.

Consistency under high correlations proved in [Bach, 2008],

Efficient implementations exists: LARS [Efron et al., 2004].

(64)

Bootstrap implementation

Block sparse Bolasso

Require:

training data T = {(X

i

, Y

i

)}

Ni=1

, number of bootstrap replicates m, L

1

penalization parameter λ,

1:

for k = 1 to m do

2:

Generate bootstrap sample T

k

,

3:

Compute Block sparse Lasso estimate ˆ θ

k

from T

k

,

4:

Compute support J

k

= {j , θ ˆ

kj

6= 0},

5:

end for

6:

Compute J = T

m k=1

J

k

,

7:

Compute ˆ θ

J

from T

J

= {(X

Ji

, Y

i

)}

Ni=1

using Least-Squares.

Consistency under high correlations proved in [Bach, 2008],

(65)

Bootstrap implementation

Block sparse Bolasso

Require:

training data T = {(X

i

, Y

i

)}

Ni=1

, number of bootstrap replicates m, L

1

penalization parameter λ,

1:

for k = 1 to m do

2:

Generate bootstrap sample T

k

,

3:

Compute Block sparse Lasso estimate ˆ θ

k

from T

k

,

4:

Compute support J

k

= {j , θ ˆ

kj

6= 0},

5:

end for

6:

Compute J = T

m k=1

J

k

,

7:

Compute ˆ θ

J

from T

J

= {(X

Ji

, Y

i

)}

Ni=1

using Least-Squares.

Consistency under high correlations proved in [Bach, 2008],

(66)

Identifiability issues

(67)

Identifiability issues

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

Use prior ˜ I

sp

{ ˜ I

spi

= ˜ I

sp

(x

i

, u

i

)}.

(68)

Identifiability issues

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

(69)

Identifiability issues

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λ

1

kθk

1

,

Use prior ˜ I

sp

{ ˜ I

spi

= ˜ I

sp

(x

i

, u

i

)}.

(70)

Identifiability issues

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λ

1

kθk

1

,

⇒ θ ˆ

T

= ˆ θ

Isp

= 0 !

(71)

Identifiability issues

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λ

1

kθk

1

,

⇒ θ ˆ

T

= ˆ θ

Isp

= 0 !

(72)

 

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λ

2

k ˜ I

spi

− X

Ispi

· θ

Isp

k

22

+ λ

1

kθk

1

,

⇒ θ ˆ

T

= ˆ θ

Isp

= 0 !

(73)

Identifiability issues

 

 

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

λ

2

˜ I

sp

= λ

2

X

Isp

· θ

Isp

4

min

θ

1 N

N

X

i=1

kY

i

− X

i

θk

22

+ λ

2

k ˜ I

spi

− X

Ispi

· θ

Isp

k

22

+ λ

1

kθk

1

,

⇒ θ ˆ

T

= ˆ θ

Isp

= 0 !

(74)

 

 

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

λ

2

˜ I

sp

= λ

2

X

Isp

· θ

Isp

4

min

θ

1 N

N

X

i=1

k Y ˜

i

− X ˜

i

θk

22

+ λ

1

kθk

1

,

⇒ θ ˆ

T

= ˆ θ

Isp

= 0 !

(75)

Identifiability issues

 

 

 

 

Y

1

= X

T1

· θ

T

− X

D

· θ

D

1

Y

2

= X

T2

· θ

T

+ X

L

· θ

L

2

0 = X

T

· θ

T

+ X

Ispm

· θ

Isp

3

λ

2

˜ I

sp

= λ

2

X

Isp

· θ

Isp

4

min

θ

1 N

N

X

i=1

k Y ˜

i

− X ˜

i

θk

22

+ λ

1

kθk

1

,

Y ˜

i

=

 Y

1i

Y

2i

0 λ

2

˜ I

spi

, X ˜

i

=

(X

T1i

)

>

−(X

Di

)

>

0 0 (X

T2i

)

>

0 (X

Li

)

>

0

(X

Ti

)

>

0 0 (X

Ispmi

)

>

0 0 0 λ

2

(X

Ispi

)

>

,

(76)
(77)

Feature selection results

25 different B737-800,

10 471 flights = 8 261 619 observations,

Block sparse Bolasso used for T , D, L and I

sp

,

We expect similar model structures,

(78)

Feature selection results

25 different B737-800,

10 471 flights = 8 261 619 observations,

(79)

Feature selection results

25 different B737-800,

10 471 flights = 8 261 619 observations, Block sparse Bolasso used for T , D, L and I

sp

,

We expect similar model structures,

(80)

Block sparse Bolasso used for T , D, L and I

sp

,

We expect similar model structures,

(81)

Feature selection results

(82)
(83)

Effect of λ 2 on hidden elements

(84)
(85)

Identification results assessment

x − ˆ x

u − u ˆ

(86)

s.t. x ˙ = ˆ g (x, u)

(87)

Identification results assessment

x,u∈

min

X×U

Z

tf

0

ku(t) − u

m

(t)k

2u

+ kx(t ) − x

m

(t )k

2x

dt

s.t. x ˙ = ˆ g (x, u)

(88)

s.t. x ˙ = ˆ g (x, u)

(89)

THANK YOU FOR YOUR ATTENTION !

(90)

pages 33–40. Proceedings of the 25th International Conference on Machine Learning (ICML).

Bonnans, J. F., Giorgi, D., Grelard, V., Heymann, B., Maindrault, S., Martinon, P., Tissot, O., and Liu, J. (2017).

Bocop – A collection of examples.

Technical report, INRIA.

Efron, B. et al. (2004).

Least angle regression.

Annals of Statistics, 32:407–499.

Tibshirani, R. (1994).

Regression shrinkage and selection via the Lasso.

Références

Documents relatifs

Louisette Wendling, Jean Pierre Triquigneaux, Eric Cregut, Cédric Petiteau, Daniel Claudel, Lionel Odie, Vincent Gaudefroy, Stéphane Ollier. To cite

Nathalie Bochard, Cédric Marchand, Oto Peťura, Lilian Bossuet, Viktor Fischer1. To cite

Pierre-Frédéric Villard, Michaël Beuve, Behzad Shariat, Vincent Baudet, Fabrice Jaillet, Jean-Yves Bayle, Thierry Quesnel. To cite

Cédric Duée, Marie Hénault, Thomas Lecas, Laifa Boufendi, Henry Pillière, Xavier Bourrat.. To cite

Robust sliding mode control of nonlinear systems with delay: A design via polytopic formulation Frédéric Gouaisbaut, Yann Blanco, Jean-Pierre Richard.. To cite this version:

Frédéric Choulet, Thomas Wicker, Etienne Paux, Cyrille Saintenac, Pierre Sourdille, Beat Keller, Rudi Appels, Catherine Feuillet1. To cite

Cédric Lemaire, Yann Quilichini, Jérémie Santini, Nicole Brunel-Michac, Julien Cartailler, Romain Souchal, Pierre Conchon, Stéphane Herbette.. To cite

Luc Massou, Pierre Morelli, Catherine Kellner, Stéphanie Fleck, Jean-Baptiste Lanfranchi, Christian Bastien, Brian Chauvel.. To cite