HAL Id: hal-01643177
https://hal.inria.fr/hal-01643177
Submitted on 21 Nov 2017
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Multi-task Bolasso based aircraft dynamics identification
Cédric Rommel, Joseph Frédéric Bonnans, Baptiste Gregorutti, Pierre Martinon
To cite this version:
Cédric Rommel, Joseph Frédéric Bonnans, Baptiste Gregorutti, Pierre Martinon. Multi-task Bolasso
based aircraft dynamics identification. PGMODays, Nov 2017, Paris, France. �hal-01643177�
C. Rommel , J. F. Bonnans , B. Gregorutti
2and P. Martinon
1CMAP Ecole Polytechnique - INRIA1 Safety Line2
November 14
th2017
Motivation
20 000 airplanes — 80 000 flights per day,
Should double until 2033,
Responsible for 3% of CO
2emissions,
Accounts for 30% of operational cost for an airline,
Rectilinear climb trajectories at full thrust.
Motivation
20 000 airplanes — 80 000 flights per day,
Should double until 2033,
Motivation
20 000 airplanes — 80 000 flights per day, Should double until 2033,
Responsible for 3% of CO
2emissions,
Accounts for 30% of operational cost for an airline,
Rectilinear climb trajectories at full thrust.
Motivation
20 000 airplanes — 80 000 flights per day, Should double until 2033,
Responsible for 3% of CO
2emissions,
Accounts for 30% of operational cost for an airline,
Motivation
20 000 airplanes — 80 000 flights per day, Should double until 2033,
Responsible for 3% of CO
2emissions,
Accounts for 30% of operational cost for an airline,
Rectilinear climb trajectories at full thrust.
Should double until 2033,
Responsible for 3% of CO
2emissions,
Accounts for 30% of operational cost for an airline,
Rectilinear climb trajectories at full thrust.
Example of optimized trajectory
Reference Optimized
1311kg 1141kg
(x,u)∈
min
X×UZ
tf0
C (t, u(t), x(t))dt,
s.t.
Φ(x(0), x(t
f)) ∈ K
Φ,
u(t) ∈ U
ad, for a.e. t ∈ [0, t
f],
c
j(x(t)) ≤ 0, j = 1, . . . , n
c, for all t ∈ [0, t
f],
˙
x = g (t, u, x), for a.e. t ∈ [0, t
f].
Optimal Control Problem
(x,u)∈
min
X×UZ
tf0
C (t, u(t), x(t))dt,
s.t.
Φ(x(0), x(t
f)) ∈ K
Φ,
u(t) ∈ U
ad, for a.e. t ∈ [0, t
f],
c
j(x(t)) ≤ 0, j = 1, . . . , n
c, for all t ∈ [0, t
f],
˙
x = g (t, u, x), for a.e. t ∈ [0, t
f].
(x,u)∈
min
X×UZ
tf0
C (t, u(t), x(t))dt,
s.t.
Φ(x(0), x(t
f)) ∈ K
Φ,
u(t) ∈ U
ad, for a.e. t ∈ [0, t
f],
c
j(x(t)) ≤ 0, j = 1, . . . , n
c, for all t ∈ [0, t
f],
˙
x = g (t, u, x), for a.e. t ∈ [0, t
f].
Optimal Control Problem
(x,u)∈
min
X×UZ
tf0
C (t, u(t), x(t))dt,
s.t.
Φ(x(0), x(t
f)) ∈ K
Φ,
u(t) ∈ U
ad, for a.e. t ∈ [0, t
f],
c
j(x(t)) ≤ 0, j = 1, . . . , n
c, for all t ∈ [0, t
f],
˙
x = g (t, u, x), for a.e. t ∈ [0, t
f].
(x,u)∈
min
X×UZ
tf0
C (t, u(t), x(t))dt,
s.t.
Φ(x(0), x(t
f)) ∈ K
Φ,
u(t) ∈ U
ad, for a.e. t ∈ [0, t
f],
c
j(x(t)) ≤ 0, j = 1, . . . , n
c, for all t ∈ [0, t
f],
˙
x = g (t, u, x), for a.e. t ∈ [0, t
f].
Optimal Control Problem
(x,u)∈
min
X×UZ
tf0
C (t, u(t), x(t))dt,
s.t.
Φ(x(0), x(t
f)) ∈ K
Φ,
u(t) ∈ U
ad, for a.e. t ∈ [0, t
f],
c
j(x(t)) ≤ 0, j = 1, . . . , n
c, for all t ∈ [0, t
f],
˙
x = g (t, u, x), for a.e. t ∈ [0, t
f].
QAR data
QAR data
Massive (> 1000 variables recorded every second),
x ˙ = g (t, u, x)
QAR data
QAR data
Massive (> 1000 variables recorded every second),
x ˙ = g (t, u, x)
Massive (> 1000 variables recorded every second),
x ˙ = g (t, u, x)
Flight mechanics and state equation
Classic flight mechanics model
V ˙ = T cos α − D − mg sin γ
m ,
˙
γ = T sin α + L − mg cos γ
mV ,
˙
m = − T
I
sp.
Flight mechanics and state equation
h ˙ = V sin γ ,
V ˙ = T cos α − D − mg sin γ
m ,
˙
γ = T sin α + L − mg cos γ
mV ,
˙
m = − T I
sp.
State variables: x = [h, V , γ, m]
V ˙ = T cos α − D − mg sin γ
m ,
˙
γ = T sin α + L − mg cos γ
mV ,
˙
m = − T I
sp.
State variables: x = [h, V , γ, m]
Control variables: u = [α, N
1]
Flight mechanics and state equation
h ˙ = V sin γ,
V ˙ = T cos α − D − mg sin γ
m ,
˙
γ = T sin α + L − mg cos γ
mV ,
˙
m = − T I
sp.
State variables: x = [h, V , γ, m]
Control variables: u = [α, N
1]
Unknown functions of the state and control variables
V ˙ = T (x, u) cos α − D(x, u) − mg sin γ
m ,
˙
γ = T (x, u) sin α + L(x, u) − mg cos γ
mV ,
˙
m = − T (x, u) I
sp(x, u) .
State variables: x = [h, V , γ, m]
Control variables: u = [α, N
1]
Unknown functions of the state and control variables
Model requirements
T function of (M , ρ, N
1), D function of (M , ρ, q),
L function of (M , ρ, q), I
spfunction of (M , h, SAT ),
Need for smooth models,
Need for models which are fast to compute,
Need for interpretable models for safety,
Need for models which are rich enough.
Model requirements
T function of (M , ρ, N
1) = ϕ
T(x, u), D function of (M , ρ, q) = ϕ
D(x, u),
L function of (M , ρ, q) = ϕ
L(x, u),
I
spfunction of (M , h, SAT ) = ϕ
Isp(x, u),
Model requirements
T function of (M , ρ, N
1) = ϕ
T(x, u), D function of (M , ρ, q) = ϕ
D(x, u),
L function of (M , ρ, q) = ϕ
L(x, u), I
spfunction of (M , h, SAT ) = ϕ
Isp(x, u),
Need for smooth models,
Need for models which are fast to compute,
Need for interpretable models for safety,
Need for models which are rich enough.
Model requirements
T function of (M , ρ, N
1) = ϕ
T(x, u), D function of (M , ρ, q) = ϕ
D(x, u),
L function of (M , ρ, q) = ϕ
L(x, u), I
spfunction of (M , h, SAT ) = ϕ
Isp(x, u),
Need for smooth models,
Need for models which are fast to compute,
Model requirements
T function of (M , ρ, N
1) = ϕ
T(x, u), D function of (M , ρ, q) = ϕ
D(x, u),
L function of (M , ρ, q) = ϕ
L(x, u), I
spfunction of (M , h, SAT ) = ϕ
Isp(x, u),
Need for smooth models,
Need for models which are fast to compute, Need for interpretable models for safety,
Need for models which are rich enough.
Model requirements
T = X
T· θ
T, D = X
D· θ
D, L = X
L· θ
L, I
sp= X
Isp· θ
Isp.
Need for smooth models,
Need for models which are fast to compute,
Need for interpretable models for safety,
Model requirements
T = X
T· θ
T, with X
T6= ϕ
T(x, u), D = X
D· θ
D, with X
D6= ϕ
D(x, u), L = X
L· θ
L, with X
L6= ϕ
L(x, u), I
sp= X
Isp· θ
Isp, with X
Isp6= ϕ
Isp(x, u).
Need for smooth models,
Need for models which are fast to compute, Need for interpretable models for safety,
Need for models which are rich enough.
T T T T
D = X
D· θ
D, with X
D6= ϕ
D(x, u), L = X
L· θ
L, with X
L6= ϕ
L(x, u), I
sp= X
Isp· θ
Isp, with X
Isp6= ϕ
Isp(x, u).
Need for smooth models,
Need for models which are fast to compute,
Need for interpretable models for safety,
Need for models which are rich enough.
Model requirements
T = X
T· θ
T, with X
T= Φ
d◦ ϕ
T(x, u), D = X
D· θ
D, with X
D= Φ
d◦ ϕ
D(x, u), L = X
L· θ
L, with X
L= Φ
d◦ ϕ
L(x, u), I
sp= X
Isp· θ
Isp, with X
Isp= Φ
d◦ ϕ
Isp(x, u).
Need for smooth models,
Need for models which are fast to compute,
Need for interpretable models for safety,
Need for models which are rich enough.
T T T 1
D = X
D· θ
D, with X
D= q(1, α, M , α
2, αM , M
2, ...), L = X
L· θ
L, with X
L= q(1, α, M, α
2, αM , M
2, ...), I
sp= X
Isp· θ
Isp, with X
Isp= (1, h, M , h
2, hM, M
2, ...).
Need for smooth models,
Need for models which are fast to compute,
Need for interpretable models for safety,
Need for models which are rich enough.
Regression problems
h ˙ = V sin γ
m V ˙
r= T cos α − D − mg sin γ mV
rγ ˙ = T sin α + L − mg cos γ
˙
m = −
ITsp
.
Targets to fit Unknown Random error
Regression problems
h ˙ = V sin γ
m V ˙
r= T cos α − D − mg sin γ mV
rγ ˙ = T sin α + L − mg cos γ
˙
m = −
ITsp
.
Regression problems
h ˙ = V sin γ m V ˙
r+ mg sin γ = T cos α − D mV
rγ ˙ + mg cos γ = T sin α + L
˙
m = −
ITsp
.
Targets to fit Unknown Random error
Regression problems
h ˙ = V sin γ m V ˙
r+ mg sin γ = T cos α − D mV
rγ ˙ + mg cos γ = T sin α + L
0 = T + ˙ mI
sp.
Regression problems
h ˙ = V sin γ m V ˙
r+ mg sin γ = T cos α − D mV
rγ ˙ + mg cos γ = T sin α + L
0 = T + ˙ mI
sp.
⇓
Y
1= X
Tcos α · θ
T− X
D· θ
D+ε
1Y
2= X
Tsin α · θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ ˙ mX
Isp· θ
Isp+ε
3Targets to fit Unknown Random error
Multi-task regression framework
Y
1= X
Tcos α · θ
T− X
D· θ
D+ε
1Y
2= X
Tsin α · θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ ˙ mX
Isp· θ
Isp+ε
3
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Multi-task regression framework
Y
1= X
Tcos α · θ
T− X
D· θ
D+ε
1Y
2= X
Tsin α · θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ mX ˙
Isp· θ
Isp+ε
3
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Ensures all components of ˆ g to share the same thrust ˆ T , Better predictive accuracy from tight coupling,
Helps in high correlations setting.
Multi-task regression framework
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Multi-task regression framework
Y = X θ + ε,
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Ensures all components of ˆ g to share the same thrust ˆ T , Better predictive accuracy from tight coupling,
Helps in high correlations setting.
Multi-task regression framework
Y = X θ + ε,
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Multi-task regression framework
Y = X θ + ε,
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Ensures all components of ˆ g to share the same thrust ˆ T ,
Better predictive accuracy from tight coupling,
Helps in high correlations setting.
Multi-task regression framework
Y = X θ + ε,
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Ensures all components of ˆ g to share the same thrust ˆ T ,
Better predictive accuracy from tight coupling,
Multi-task regression framework
Y = X θ + ε,
Y
1Y
20
| {z }
Y
=
X
T1>−X
D>0 0 X
T2>0 X
L>0
X
T>0 0 X
Ispm>
| {z }
X
θ
Tθ
Dθ
Lθ
Isp
| {z }
θ
+
ε
1ε
2ε
3
| {z }
ε
.
Ensures all components of ˆ g to share the same thrust ˆ T ,
Better predictive accuracy from tight coupling,
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations,
{X
i, Y
i}
Ni=1,
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Empirical Risk Minimization min
θ
1 N
N
X
i=1
L(Y
i, X
iθ),
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
Least Squares Regression min
θ1 N
N
X
i=1
kY
i− X
iθk
22,
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Least Squares Regression min
θ1 N
N
X
i=1
kY
i− X
iθk
22,
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
Least Squares Regression min
θ1 N
N
X
i=1
kY
i− X
iθk
22,
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
L
1penalization min
θ1 N
N
X
i=1
kY
i− X
iθk
22+ λkθk
1,
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
L
1penalization min
θ1 N
N
X
i=1
kY
i− X
iθk
22+ λkθk
1,
' Lasso [Tibshirani, 1994]
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
Block sparse Lasso min
θ
1 N
N
X
i=1
kY
i− X
iθk
22+ λkθk
1,
' Lasso [Tibshirani, 1994]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
Block sparse Lasso min
θ
1 N
N
X
i=1
kY
i− X
iθk
22+ λkθk
1,
' Lasso [Tibshirani, 1994]
Polynomial regression
high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Feature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
Block sparse Lasso min
θ
1 N
N
X
i=1
kY
i− X
iθk
22+ λkθk
1,
' Lasso [Tibshirani, 1994]
Polynomial regression high correlations between elements of X
iFeature selection
Let {(x
i, u
i, x ˙
i)}
Ni=1set of N observations, {X
i, Y
i}
Ni=1,
Maybe not all monomials are relevant for T , D, L and/or I
spmodel...
Overfitting...
Block sparse Lasso min
θ
1 N
N
X
i=1
kY
i− X
iθk
22+ λkθk
1,
' Lasso [Tibshirani, 1994]
Polynomial regression high correlations between elements of X
iUnstable selections...
⇒ Bolasso [Bach, 2008]
Block sparse Lasso min
θ
1 N
N
X
i=1
kY
i− X
iθk
22+ λkθk
1,
' Lasso [Tibshirani, 1994]
Polynomial regression high correlations between elements of X
iUnstable selections...
Bootstrap implementation
Block sparse Bolasso
Require:
training data T = {(X
i, Y
i)}
Ni=1, number of bootstrap replicates m, L
1penalization parameter λ,
1:
for k = 1 to m do
2:
Generate bootstrap sample T
k,
3:
Compute Block sparse Lasso estimate ˆ θ
kfrom T
k,
4:
Compute support J
k= {j , θ ˆ
kj6= 0},
5:
end for
6:
Compute J = T
m k=1J
k,
7:
Compute ˆ θ
Jfrom T
J= {(X
Ji, Y
i)}
Ni=1using Least-Squares.
Consistency under high correlations proved in [Bach, 2008],
Efficient implementations exists: LARS [Efron et al., 2004].
Bootstrap implementation
Block sparse Bolasso
Require:
training data T = {(X
i, Y
i)}
Ni=1, number of bootstrap replicates m, L
1penalization parameter λ,
1:
for k = 1 to m do
2:
Generate bootstrap sample T
k,
3:
Compute Block sparse Lasso estimate ˆ θ
kfrom T
k,
4:
Compute support J
k= {j , θ ˆ
kj6= 0},
5:
end for
6:
Compute J = T
m k=1J
k,
7:
Compute ˆ θ
Jfrom T
J= {(X
Ji, Y
i)}
Ni=1using Least-Squares.
Consistency under high correlations proved in [Bach, 2008],
Bootstrap implementation
Block sparse Bolasso
Require:
training data T = {(X
i, Y
i)}
Ni=1, number of bootstrap replicates m, L
1penalization parameter λ,
1:
for k = 1 to m do
2:
Generate bootstrap sample T
k,
3:
Compute Block sparse Lasso estimate ˆ θ
kfrom T
k,
4:
Compute support J
k= {j , θ ˆ
kj6= 0},
5:
end for
6:
Compute J = T
m k=1J
k,
7:
Compute ˆ θ
Jfrom T
J= {(X
Ji, Y
i)}
Ni=1using Least-Squares.
Consistency under high correlations proved in [Bach, 2008],
Identifiability issues
Identifiability issues
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3Use prior ˜ I
sp{ ˜ I
spi= ˜ I
sp(x
i, u
i)}.
Identifiability issues
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3Identifiability issues
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3min
θ1 N
N
X
i=1
kY
i− X
iθk
22+ λ
1kθk
1,
Use prior ˜ I
sp{ ˜ I
spi= ˜ I
sp(x
i, u
i)}.
Identifiability issues
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3min
θ1 N
N
X
i=1
kY
i− X
iθk
22+ λ
1kθk
1,
⇒ θ ˆ
T= ˆ θ
Isp= 0 !
Identifiability issues
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3min
θ1 N
N
X
i=1
kY
i− X
iθk
22+ λ
1kθk
1,
⇒ θ ˆ
T= ˆ θ
Isp= 0 !
Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3min
θ1 N
N
X
i=1
kY
i− X
iθk
22+ λ
2k ˜ I
spi− X
Ispi· θ
Ispk
22+ λ
1kθk
1,
⇒ θ ˆ
T= ˆ θ
Isp= 0 !
Identifiability issues
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3λ
2˜ I
sp= λ
2X
Isp· θ
Isp+ε
4min
θ1 N
N
X
i=1
kY
i− X
iθk
22+ λ
2k ˜ I
spi− X
Ispi· θ
Ispk
22+ λ
1kθk
1,
⇒ θ ˆ
T= ˆ θ
Isp= 0 !
Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3λ
2˜ I
sp= λ
2X
Isp· θ
Isp+ε
4min
θ1 N
N
X
i=1
k Y ˜
i− X ˜
iθk
22+ λ
1kθk
1,
⇒ θ ˆ
T= ˆ θ
Isp= 0 !
Identifiability issues
Y
1= X
T1· θ
T− X
D· θ
D+ε
1Y
2= X
T2· θ
T+ X
L· θ
L+ε
20 = X
T· θ
T+ X
Ispm· θ
Isp+ε
3λ
2˜ I
sp= λ
2X
Isp· θ
Isp+ε
4min
θ1 N
N
X
i=1
k Y ˜
i− X ˜
iθk
22+ λ
1kθk
1,
Y ˜
i=
Y
1iY
2i0 λ
2˜ I
spi
, X ˜
i=
(X
T1i)
>−(X
Di)
>0 0 (X
T2i)
>0 (X
Li)
>0
(X
Ti)
>0 0 (X
Ispmi)
>0 0 0 λ
2(X
Ispi)
>
,
Feature selection results
25 different B737-800,
10 471 flights = 8 261 619 observations,
Block sparse Bolasso used for T , D, L and I
sp,
We expect similar model structures,
Feature selection results
25 different B737-800,
10 471 flights = 8 261 619 observations,
Feature selection results
25 different B737-800,
10 471 flights = 8 261 619 observations, Block sparse Bolasso used for T , D, L and I
sp,
We expect similar model structures,
Block sparse Bolasso used for T , D, L and I
sp,
We expect similar model structures,
Feature selection results
Effect of λ 2 on hidden elements
Identification results assessment
x − ˆ x
u − u ˆ
s.t. x ˙ = ˆ g (x, u)
Identification results assessment
x,u∈
min
X×UZ
tf0