Resampling-based estimation of the accuracy of satellite ephemerides

(1)

1/52

Introduction Estimation methods Validation on simulated data Application on real data Conclusion

Resampling-based estimation of the accuracy of satellite ephemerides

Sylvain Arlot^1,2,3

joint work with J. Desmars^4,5 J.-E. Arlot⁴ V. Lainey⁴ A. Vienne^4,6

1CNRS

2´Ecole Normale Sup´erieure de Paris

3Sierra team, Inria Paris-Rocquencourt

4Institut de Mécanique Céleste et de Calcul des Éphémérides — Observatoire de Paris

5Universit´e Pierre et Marie Curie

6Universit´e de Lille

(2)

Precision of ephemerides of natural satellites of Saturn

Dynamical model + Observations⇒ Ephemerides

Problem: how accurate are the ephemerides, in particular far from the observation period?

Two main examples: Mimas (revolution period: 0.942 days) and Titan (revolution period: 15.945 days)

(3)

Precision of ephemerides of natural satellites of Saturn

Problem: how accurate are the ephemerides, in particular far from the observation period?

Two main examples: Mimas (revolution period: 0.942 days) and Titan (revolution period: 15.945 days)

(4)

Precision of ephemerides of natural satellites of Saturn

(5)

The models: TASS & NUMINT

TASS1.7 (Analytic Theory of Saturnian Satellites):

TASS1.6 (Vienne & Duriez 1995) + Hyperion motion theory (Duriez & Vienne, 1997)

⇒ Semi-analytic theory

Numerical integration (NUMINT)

⇒ Model: x(t) =ϕ(c,t)∈ X , wherec ∈ C ⊂R^p parameter space

e.g., x(t) = (α(t), δ(t))

(6)

The models: TASS & NUMINT

e.g., x(t) = (α(t), δ(t))

(7)

The models: TASS & NUMINT

e.g., x(t) = (α(t), δ(t))

(8)

Internal error of the models

True position P(t)6=ϕ(c,t) for every c ∈ C

⇒ Internal error (or bias)

c∈Cinf {d(P(t), ϕ(c,t))}

Neglected terms in analytic formulas (TASS)

⇒ can be evaluated by comparison with numerical integration (∼10 milliarcsecond for Saturnian satellites, except Hyperion and Japet)

Neglected (or unknown physical effects) in TASS and NUMINT

(9)

Internal error of the models

Neglected (or unknown physical effects) in TASS and NUMINT

(10)

Internal error of the models

(11)

Observations

(t₁,X₁), . . . ,(t_N,X_N)∈R× X X_i =P(t_i) +ε_i E[ε_i] = 0

Multiple error sources:

observer, reading the measurement instrument used

star catalogue used for reduction (bias depending on the position in the celestial sphere)

corrections taken into account or not (refraction, aberration, etc.)

mass center6= photocenter (phase, albedo) uncertainty of observation time (especially for old observations)

(12)

Observations

(t₁,X₁), . . . ,(t_N,X_N)∈R× X X_i =P(t_i) +ε_i E[ε_i] = 0 Multiple error sources:

observer, reading the measurement

instrument used

(13)

Observations

(14)

Observations

(15)

Observations

(16)

Observations

corrections taken into account or not (refraction, aberration,

uncertainty of observation time (especially for old observations)

(17)

Observations

mass center6= photocenter (phase, albedo)

(18)

Time distribution of observations

(19)

Time distribution of observation nights

(20)

Data fit: (weighted) least-squares

Observations (t1,X1), . . . ,(tN,XN)∈R× X

Model: x(t) =ϕ(c^?,t) wherec^? ∈ C ⊂R^p parameter space c^? estimated by

bc ∈arg min

c∈C

( 1 N

N

X

i=1

wi(ϕ(c,ti)−Xj)² )

where wi ≈σ⁻¹_i is roughly estimated from the name of the observer, the instrument and the observed satellite (Vienne & Duriez 1995)

Optimization method: start withc =c + linearization

⇒ iterate until convergence

(21)

Data fit: (weighted) least-squares

Model: x(t) =ϕ(c^?,t) wherec^? ∈ C ⊂R^p parameter space

c^? estimated by

bc ∈arg min

c∈C

( 1 N

N

X

i=1

where wi ≈σ⁻¹_i is roughly estimated from the name of the observer, the instrument and the observed satellite (Vienne & Duriez 1995)

(22)

Data fit: (weighted) least-squares

bc ∈arg min

c∈C

( 1 N

N

X

i=1

wherew_i ≈σ⁻¹_i is roughly estimated from the name of the observer, the instrument and the observed satellite (Vienne &

Duriez 1995)

(23)

Data fit: (weighted) least-squares

bc ∈arg min

c∈C

( 1 N

N

X

i=1

wherew_i ≈σ⁻¹_i is roughly estimated from the name of the observer, the instrument and the observed satellite (Vienne &

Duriez 1995)

(24)

Error sources for the ephemerides: summary

Internal error Observation errors Optimization error

Representation error when using the ephemerides

+ take into account time repartition & heterogeneity

(25)

Error sources for the ephemerides: summary

Internal error Observation errors Optimization error

Representation error when using the ephemerides + take into account time repartition & heterogeneity

(26)

Monte-Carlo method on the Covariance Matrix (MCCM)

(27)

Monte-Carlo method on the Covariance Matrix (MCCM)

bc⁽¹⁾, . . . ,bc^(B⁾∼ N(bc,Λ) where Λ = (P^>W^>WP)⁻¹ P = (∂ϕ(bc,t_i)/∂c_k)_(i,k₎ W = diag(w₁, . . . ,w_N)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

⇒ ∀t ≥0, region of possible positions: ϕ(bc^(k),t)

1≤k≤B

(28)

Monte-Carlo method on the Covariance Matrix (MCCM)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

1≤k≤B

(29)

Monte-Carlo method on the Covariance Matrix (MCCM)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

1≤k≤B

(30)

Monte-Carlo method applied to the Observations (MCO)

(31)

Monte-Carlo method applied to the Observations (MCO)

∀k ∈ {1, . . . ,B}, D_N^(k) = (t1,X₁^(k)), . . . ,(t_N,X_N^(k⁾) where ∀i ∈ {1, . . . ,N}, X_i^(k)−Xi =ε_i,k ∼ N(0,σc²)

⇒ ∀k ∈ {1, . . . ,B},bc^(k) =bc

D_N^(k)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

1≤k≤B

(32)

Monte-Carlo method applied to the Observations (MCO)

⇒ ∀k ∈ {1, . . . ,B},bc^(k)=bc

D_N^(k)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

1≤k≤B

(33)

Monte-Carlo method applied to the Observations (MCO)

⇒ ∀k ∈ {1, . . . ,B},bc^(k)=bc

D_N^(k)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

1≤k≤B

(34)

Another method is needed

MCCM: assumes thatbc ∼ N(c^?,Λ)

⇒ wrong results, especially on the long-term

MCO: assumes that X_i −P(t_i) =ε_i are i.i.d. Gaussian

⇒ errors are non-Gaussian, dependent, and not identically distributed

Asteroids with few observations: MCCM and MCO already can yield satisfactory results (Milani, 1999; Muinonen & Bowell, 1993; Virtanenet al., 2001; ...), which can still be improved

Many observations⇒ long-term ephemerides & complex models⇒ new methods needed

(35)

Another method is needed

Asteroids with few observations: MCCM and MCO already can yield satisfactory results (Milani, 1999; Muinonen & Bowell, 1993; Virtanenet al., 2001; ...), which can still be improved

(36)

Another method is needed

Asteroids with few observations: MCCM and MCO already can yield satisfactory results (Milani, 1999; Muinonen &

Bowell, 1993; Virtanenet al., 2001; ...), which can still be improved

(37)

Another method is needed

Asteroids with few observations: MCCM and MCO already can yield satisfactory results (Milani, 1999; Muinonen &

Bowell, 1993; Virtanenet al., 2001; ...), which can still be improved

(38)

Resampling heuristics (bootstrap, Efron 1979)

Real world: P ^sampling ^//P_n ⁺³bc =bc(Pn)

precision =F_t(P,P_n) = (ϕ(c^?,t)−ϕ(bc,t) )²

(39)

Resampling heuristics (bootstrap, Efron 1979)

Real world:

OOOOOOO P ^sampling ^//P_n ⁺³bc =bc(Pn) Bootstrap world: P_n

precision =Ft(P,Pn) = (ϕ(c^?,t)−ϕ(bc,t) )²

(40)

Resampling heuristics (bootstrap, Efron 1979)

Real world:

OOOOOOOO P ^sampling ^//P_n ⁺³bc =bc(Pn) Bootstrap world: P_n ^resampling ^//P_n^W ⁺³bc^W =bc(P_n^W)

F_t(P,P_n) ^/o ^/o ^/o ^//Ft(Pn,P_n^W) = ϕ(bc,t)−ϕ(bc^W,t)2

(41)

Resampling heuristics (bootstrap, Efron 1979)

Real world:

OOOOOOOO P ^sampling ^//P_n ⁺³bc =bc(Pn) Bootstrap world: P_n subsampling//P_n^W ⁺³bc^W =bc(P_n^W)

F_t(P,P_n) ^/o ^/o ^/o ^//Ft(Pn,P_n^W) = ϕ(bc,t)−ϕ(bc^W,t)2

W 1 X

(42)

The bootstrap for estimating the extrapolated error

∀k ∈ {1, . . . ,B}, D_N^(k) = (t_I(k) 1

,X_I(k) 1

), . . . ,(t_I(k) N

,X_I(k) N

) where ∀k,I₁^(k), . . . ,I_N^(k) i.i.d. ∼ U({1, . . . ,n})

⇒ ∀k ∈ {1, . . . ,B},bc^(k) =bc

D_N^(k)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

1≤k≤B

(43)

The bootstrap for estimating the extrapolated error

∀k ∈ {1, . . . ,B}, D_N^(k) = (t_I(k) 1

,X_I(k) 1

), . . . ,(t_I(k) N

,X_I(k) N

⇒ ∀k ∈ {1, . . . ,B},bc^(k)=bc

D_N^(k)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

1≤k≤B

(44)

The bootstrap for estimating the extrapolated error

∀k ∈ {1, . . . ,B}, D_N^(k) = (t_I(k) 1

,X_I(k) 1

), . . . ,(t_I(k) N

,X_I(k) N

⇒ ∀k ∈ {1, . . . ,B},bc^(k)=bc

D_N^(k)

⇒ ∀k ∈ {1, . . . ,B}, ϕ(bc^(k⁾,t)

t≥0

(45)

The block bootstrap

Implicit assumption of the bootstrap: i.i.d. data

⇒ How to deal with dependence (between thet_i and between the errors)?

Solution: the Block Bootstrap (e.g., Politis, 2003): First, group data into blocks: (t_i,X_i)i∈B_` for 1≤`≤N_b Then, resample the blocks

⇒ dependences inside the blocks are caught by the block bootstrap

Assumption: blocks are (almost) independent

(46)

The block bootstrap

⇒ How to deal with dependence (between theti and between the errors)?

Solution: the Block Bootstrap (e.g., Politis, 2003): First, group data into blocks: (t_i,X_i)i∈B_` for 1≤`≤N_b Then, resample the blocks

(47)

The block bootstrap

Solution: the Block Bootstrap (e.g., Politis, 2003):

First, group data into blocks: (t_i,X_i)i∈B_` for 1≤`≤N_b Then, resample the blocks

(48)

The block bootstrap

(49)

The block bootstrap

(50)

Data generation

N = 3650 observation dates (t_i)1≤i≤N,t_i+1−t_i = 4 days, from 1960 to 2000

Initial orbit: ∀i,X_i⁽⁰⁾ =x⁽⁰⁾(t_i) =ϕ(bc,t_i) wherebc estimated from real data

Simulated k-th observation set: X_i^(k)=X_i⁽⁰⁾+σ_M(i)ξ_i where ξ₁, . . . , ξ_N are i.i.d. N(0,1),M(i) is the month to whicht_i belongs,σ1, . . . , σ_M(N) are i.i.d. N(µ, τ²) with µ= 0.15⁰⁰ and τ = 0.05⁰⁰.

⇒ estimate bc^(k) by least-squares ⇒ x^(k)(t) =ϕ(bc^(k),t)

⇒ angular separation at time t: sk(t) = q

α^(k)(t)−α⁽⁰⁾(t)

cos δ⁽⁰⁾(t) ²+ δ^(k)(t)−δ⁽⁰⁾(t)2

⇒ Dependent observations, of rather good quality

(51)

Data generation

Simulated k-th observation set: X_i^(k)=X_i⁽⁰⁾+σ_M(i)ξ_i where ξ₁, . . . , ξ_N are i.i.d. N(0,1),M(i) is the month to whicht_i belongs,σ1, . . . , σ_M(N) are i.i.d. N(µ, τ²) with µ= 0.15⁰⁰ and τ = 0.05⁰⁰.

α^(k)(t)−α⁽⁰⁾(t)

cos δ⁽⁰⁾(t) ²+ δ^(k)(t)−δ⁽⁰⁾(t)2

(52)

Data generation

Simulated k-th observation set: X_i^(k)=X_i⁽⁰⁾+σ_M(i)ξ_i where ξ1, . . . , ξN are i.i.d. N(0,1),M(i) is the month to whichti

belongs,σ1, . . . , σ_M(N) are i.i.d. N(µ, τ²) with µ= 0.15⁰⁰ and τ = 0.05⁰⁰.

α^(k)(t)−α⁽⁰⁾(t)

cos δ⁽⁰⁾(t) ²+ δ^(k)(t)−δ⁽⁰⁾(t)2

(53)

Data generation

α^(k)(t)−α⁽⁰⁾(t)

cos δ⁽⁰⁾(t) ²+ δ^(k)(t)−δ⁽⁰⁾(t)2

(54)

Data generation

⇒ angular separation at time t: s_k(t) =

(55)

Data generation

⇒ angular separation at time t: s_k(t) = q

α^(k)(t)−α⁽⁰⁾(t)

cos δ⁽⁰⁾(t) ²+ δ^(k)(t)−δ⁽⁰⁾(t)2

(56)

Region of possible motions (K = 200)

(57)

Region of possible motions: Mimas (TASS)

(58)

Region of possible motions: Titan (TASS)

(59)

Average size of the region of possible motions

σ_S(t) = v u u t 1 K

K

X

k=1

(s_k(t) )² where

sk(t) = q

α^(k)(t)−α⁽⁰⁾(t)

cos δ⁽⁰⁾(t) ²+ δ^(k)(t)−δ⁽⁰⁾(t)2

is the (angular) separation between thek-th orbit and the inital orbit at timet

(60)

Size of the region of possible motions: Mimas (TASS)

(61)

Size of the region of possible motions: Titan (TASS)

(62)

Principle of simulations

(63)

Performance of MCCM: Mimas (B = 200), TASS

(64)

Performance of MCO: Mimas (B = 200), TASS

(65)

Performance of the Bootstrap: Mimas (B = 200), TASS

(66)

Performance of the Block Bootstrap: Mimas (B = 200)

(67)

Correlation coefficient and multiplying factor (TASS)

correlation coefficient ρ_S = corr(σ_S^sim(t), σ^estim_S (t)) multiplying factor κ_S

Mimas Titan

Method ρ_S κ_S ρ_S κ_S

MCCM 0.511 1.876 0.955 0.790

MCO 0.999 1.001 0.994 0.966

Bootstrap 1.000 1.458 0.999 1.456 Block Bootstrap 0.999 1.484 0.999 1.441

(68)

Comments

MCO shouldn’t / can’t be used on real data:

Data generation clearly in favour of MCO in the simulations (noise really Gaussian, constant variance)

Unknown noise-level(s), difficult to estimate precisely Inhomogeneity of real observations (different coordinates, different kinds of observations)⇒can’t easily “add” noise Problem of choosing the blocks:

Dependent blocks⇒slight overestimation of the error Too large blocks⇒more variable estimation

Question: when are two observations independent?

Multiplying factor for the bootstrap∈[1.4; 1.5]: why? how general is this?

κ_S seems much closer to 1 for NUMINT

(69)

Comments

(70)

Comments

Unknown noise-level(s), difficult to estimate precisely

Inhomogeneity of real observations (different coordinates, different kinds of observations)⇒can’t easily “add” noise Problem of choosing the blocks:

(71)

Comments

Unknown noise-level(s), difficult to estimate precisely Inhomogeneity of real observations (different coordinates, different kinds of observations)⇒can’t easily “add” noise

Problem of choosing the blocks:

(72)

Comments

(73)

Comments

Dependent blocks⇒slight overestimation of the error

Too large blocks⇒more variable estimation Question: when are two observations independent?

(74)

Comments

(75)

Comments

(76)

Comments

(77)

How many resamples do we need? ρ

S

(TASS)

(78)

How many resamples do we need? m

S

(TASS)

(79)

Application: old vs. recent observations

(80)

Precision of old observations: Mimas (TASS)

(81)

Precision of recent observations: Mimas (TASS)

(82)

Precision when using all the observations: Mimas (TASS)

(83)

Precision of old observations: Titan (TASS)

(84)

Precision of recent observations: Titan (TASS)

(85)

Precision when using all the observations: Titan (TASS)

(86)

Precision when using all the observations: Japet (TASS)

(87)

Astronomical conclusions

qualitative differences between satellites:

fast motion (Mimas) / slow motion (Titan) main term of the mean longitude

accurate observations on a short period can be less useful than noisy observations on a long period

⇒ old observations indeed are useful Other applications (Desmars’ Ph.D., 2009):

expected improvement of reducing errors: Gaia mission(a few observations very accurate + improvement of the accuracy of past observations)

asteroids: Toutatis (time-space accuracy of close approaches to Earth)

(88)

Toutatis: will December, 12th be the end of the world?

(89)

Mathematical conclusions

Bootstrap: versatile and robust method for estimating the extrapolated error

Building blocks⇒handling dependence between observations Open problems:

Multiplying factorκS

Formal proofs: known results in simpler statistical frameworks only

Theoretical link between sensitivity to initial conditions and resampling-based estimators of extrapolated error

What about other resampling methods (e.g., subsampling)?

(90)

(91)

Expected improvement of precision thanks to Gaia results:

Mimas

(92)

Expected improvement of precision thanks to Gaia results:

Encelade

(93)

Toutatis orbit

(94)

Toutatis: time-space precision of close approach to Earth

on December, 12th 2012

(95)

Toutatis: time-space precision of close approach to Earth

on October 2322

(96)

Results with NUMINT instead of TASS (B = 30 samples)

(97)

Results with NUMINT instead of TASS (B = 30 samples)

Mimas Titan

Method ρ_S κ_S ρ_S κ_S

MCO 0.989 0.848 0.997 0.723

Bootstrap 0.999 1.041 0.997 0.832 Block Bootstrap 0.981 0.999 0.997 0.842