HAL Id: hal-02746369
https://hal.inrae.fr/hal-02746369
Submitted on 3 Jun 2020
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Modeling and estimating parameters for epidemics using
diffusion processes. Application to influenza surveillance
Romain Guy, Catherine Laredo, Elisabeta Vergu
To cite this version:
Romain Guy, Catherine Laredo, Elisabeta Vergu. Modeling and estimating parameters for epidemics
using diffusion processes. Application to influenza surveillance. Dynstoch Meeting 2013, Apr 2013,
Copenhague, Denmark. pp.43. �hal-02746369�
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Modeling and estimating parameters for epidemics using diusion
processes.
Application to inuenza surveillance
Romain GUY1,2
Joint work with C. Larédo1,2and E. Vergu1 1UR 341, MIA, INRA, Jouy-en-Josas 2 UMR 7599, LPMA, Université Paris Diderot
Dynstoch April 18th, 2013
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes Outline
Main goal: provide a framework to analyze epidemic data, from modeling to parameter estimation
1 Characteristics of epidemic processes
Constraints related to the observation of the epidemic process Mechanistic model, mathematical formalism and statistical framework 2 Diusion approximation for density dependent Markov Jump processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models 3 Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval Partial observations of the epidemics
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes Inference for epidemics using diusion processes
Constraints related to the observation of the epidemic process Mechanistic model, mathematical formalism and statistical framework
Outline
1 Characteristics of epidemic processes
Constraints related to the observation of the epidemic process Mechanistic model, mathematical formalism and statistical framework
2 Diusion approximation for density dependent Markov Jump processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
3 Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval Partial observations of the epidemics
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes Inference for epidemics using diusion processes
Constraints related to the observation of the epidemic process
Mechanistic model, mathematical formalism and statistical framework
Incomplete Data
Imperfect data
Incomplete observations Temporally aggregated Sampling & reporting error Unobserved cases
Dierent dynamics
Ex.: Inuenza like illness cases (Sentinelles surveillance network)
One outbreak (3 months)
Recurrent oubreaks (27 years)
Main goal: key parameter estimation
Basic reproduction number, R0 (nb. of secondary cases generated by one primary case in an entirely susceptible population) Average infectious time period (d)
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes Inference for epidemics using diusion processes
Constraints related to the observation of the epidemic process
Mechanistic model, mathematical formalism and statistical framework
First step (Mechanistic model): Compartmental representation of the population dynamics Dene: Nb. of health states, possible transitions and associated rates
Notations N population size, λ transmission rate, γ recovery rate (R0=λγ, d = 1γ) S, I , R numbers of susceptible, infected, removed individuals
One outbreak: SIR model
Closed population ⇒ N=S + I + R Well-mixing population
⇒ (S, I )λSI /N→ (S − 1, I + 1)
Several outbreak: SIRS with seasonality
δ: waning immunity rate (years)−1 µ: demog. renewal rate (decades)−1 λ(t) = λ0(1 + λ1sin(2πTt
per))
λ0 basic trans. rate, λ1seasonal eect λ1=0 ⇒ oscillations vanish
Observations: taken as new infecteds or new removeds (short infectious period)
Zt+∆ t λSIdt or
Zt+∆ t γIdt
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes Inference for epidemics using diusion processes
Constraints related to the observation of the epidemic process
Mechanistic model, mathematical formalism and statistical framework
Second & Third steps: mathematical formalisms and statistical inference techniques
Stochastic modeling
Pure jump processes (State space Nd) d nb of dierent health states, Exponential holding times ⇒ Markov processes.
Deterministic modeling with noise
ODEs solutions on Rd
Statistical inference usually based on Least squares methods
Statistical inference for stochastic models
Observation of all the jumps: MLE,
⇒Infectious and recovery datesobserved for all individuals,
Incomplete observations: data augmentation methods (e.g. Breto et al (09), Toni et al (09)),
⇒Computer intensive simulations
⇒Computation times: too large especially for large population sizes Not addressed: parameter identiabilitygiven the data.
Consider statistics for diusion processes for an analytic approach (third step) ⇒Before the third step: need of a rigorous and user-friendly method to build diusion processes (second step).
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
Outline
1 Characteristics of epidemic processes
Constraints related to the observation of the epidemic process Mechanistic model, mathematical formalism and statistical framework
2 Diusion approximation for density dependent Markov Jump processes Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
3 Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval Partial observations of the epidemics
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes
Links with other approximations
Application to SIR and SIRS epidemic models
General result for time dependent Markov Jump processes
Results from (Guy et al (13)), extending (Ethier & Kurtz (05))
Density dependent Markov process
The state space is E = {0, .., N}d, E−= {−N, ..., N}d
Dene non negative measurable functions: αl(.) :E → R+indexed by l ∈ E− (ZN(t), t ≥ 0) Markov process with state space E, starting from Z(0) = Nz0 Transition rates k → k + l : qk,k+l(t) = αl(t, k)
Fundamental assumption for density dependent Markov process
(H): The functions αl(.)satisfy: ∀x ∈ [0, 1]d, ∀t ∈ [0, T ],N1αl(t, [Nx]) → βl(t, x) as N → ∞
(for y = (x1, . . . ,xd) ∈ Rd, [y] = ([y1], . . . , [yd])with [yi]integer part of yi
Main idea
The deterministic limit of ZN(t)
N , the Gaussian approximation ofZ(t)N −x(t) and the diusion approximation ofZN(t)
N are completely determined by the collection of functions (βl)l∈E−
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes
Links with other approximations
Application to SIR and SIRS epidemic models
Detailed result
Denition of drift and diusion functions
Dene b(t, x) = X l∈E− lβl(t, x), and Σ(t, x) = X l∈E− llτβ l(t, x) Approximation result Deterministic limit: x(t) = z0+ Z t 0 b(s, x(s))ds satises, sup t∈[0,T ]k ZN(t) N −x(t)k −→N→∞0 in probability.
CLT: Dene Φ(t, s) as the solution of dΦ(t,s)dt =∂∂bx(t, x(t))Φ(t, s), Φ(s, s) = Id √
N(Z(t)N −x(t)) −→
N→∞g(t) centered Gaussian process with covariance Cov(g(t), g(r)) = R0t∧rΦ(t ∧ r, s)Σ(s, x(s))Φ(t ∧ r, s)τds
Di. Approx.: Dene σ(t, x(t)) a solution of σστ = Σ, Xt solution of dXt=b(t, Xt)dt +√1
Nσ(t, Xt)dBt,X0=z0: sup
t∈[0,T ]
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes
Links with other approximations
Application to SIR and SIRS epidemic models
Other links between the processes
Taylor stochastic expansion of the diusion
Under regularity assumptions, Xt=x(t) +√1
Ng(t) + OP(N1), ⇒g(t) =
Zt
0 Φ(t, s)σ(x(s))dBs
Conclusion: Gaussian approximation and diusion approximation of the Markov Jump process are mainly equivalent as N → ∞
→Diusion approximation
→Expansion in N of the process →Taylor's stochastic expansion
Characteristics of the approximation
Diusion with small diusion coecient b(t, x) = X l∈E− lβl(t, x), Σ(t, x) = X l∈E− llτβ l(t, x) ⇒same parameter in the drift and diusion functions
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
Application: diusion approximation of the SIR epidemics (1/2)
Collection functions αL(.)
Only two transitions: (S, I ) → (S − 1, I + 1) and (S, I ) → (S, I − 1),
L = (−1, +1) ⇒ αL(S, I ) = λSNI, L = (0, −1) ⇒ αL(S, I ) = γI . Q-matrix of Z(t): qk,k+L= αL(k).
Computation of the limit functions βL(x)
Let x = (s, i) ∈ [0, 1]2 1 Nα(−1,1)([Nx]) = N1Nλ[Ns][Ni] −→ N→+∞ β(−1,1)(s, i) = λsi 1 Nα(0,−1)([Nx]) = N1γ[Ni] −→ N→+∞ β(0,−1)(s, i) = γi. Functions b and Σ
depend on the two parameters (λ, γ) ⇒ b(λ,γ)(s, i), Σ(λ,γ)(s, i), b(λ,γ)(s, i) = −λsi −1 1 + γi 0 −1 = −λsi λsi − γi . Σ(λ,γ)(s, i) = λsi −1 1 −1 1 + γi 0 −1 0 −1 = λ−λsisi λsi + γi−λsi .
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
Application: diusion approximation of the SIR epidemics (2/2)
Diusion approximation Assume (S0 N,IN0)N→+∞−→ (s0,i0) =x0; Choose σ(s, i) = √ λsi 0 −√λsi √γi solution of σστ= Σ ( dSt = −λStItdt +√1 N √ λStItdB1(t) dIt = (λStIt− γSt)dt −√1 N √ λStItdB1(t) +√1 N √ γItdB2(t)
Small values of R0=λγ: appropriate signal/noise ratio in large population
Figure:Number of infected Itthrough time for N = 10000, R0=1.5,γ = 1/3,(s0,i0) = (0.9990.001) (signal/noise ratio :√It
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
Signal/noise ratio and observations for Sentinelles surveillance network data
Figure:Reported (ρ) new removed (ρZtk+1 tk
γItdt) through time for N = 6.107(signal/noise ratio ∈ [0.1, 6])
Specicity of the SIR observations
Zt k+1 tk ρλStItdt = ρ(Stk −St k+1), and Zt k+1 tk ργItdt = ρ(Rtk+1 −Rt k)
⇒transform integrated observations in discrete observations of increments of other coordinates
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
Diusion approximation of the SIRS model
Description of ZN(t) = (S(t), I (t)) Notations: (s, i) = (S N,NI) ∈ [0, 1]2 β(−1,+1)(s, i) = λs(i + η), β(0,−1)(s, i) = (γ + µ)i, β(−1,0)(s, i) = µs, β(1,0)(s, i) = µ + δ(1 − s − i)
ODE and Diusion approximation
b(t, (s, i)) = PLLβL(t, (s, i)) =−λ(t)s(i + η) + δ(1 − s − i) + µ(1 − s) λ(t)s(i + η) − (γ + µ)i
Σ(t, (s, i)) = PLβL(t, (s, i))LLτ=
λ(t)s(i + η) + δ(1 − s − i) + µ(1 + s) −λ(t)s(i + η) −λ(t)s(i + η) λ(t)s(i + η) + (γ + µ)i
.
Characteristics of epidemic processes
Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
A model appropriate for very large populations
Dierent behaviors depending on the parameters (λ1 bifurcation parameter)
Proportion of infected over time (N = 107, R0=1.5 1
γ =3(day), Tper=365(day), 1
δ =2(yrs), η = 10
−6 and λ1∈ {0.05, 0.15} (λ(t) = λ0(1 + λ1cos(2πt/T ))
⇒Due to reporting rate ρ, signal/noise ratio is lower for Sentinelles data
Observations for SIRS model
Ztk+1 tk ρλStItdt6=ρ(Stk−Stk+1) ⇒≈ ρλ∆StkItk Ito req.: b(t, x) ⇒ b(t, x) 1 ∆ Z t k+1 tk ργItdt ≈ ργItk
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval Partial observations of the epidemics
Outline
1 Characteristics of epidemic processes
Constraints related to the observation of the epidemic process Mechanistic model, mathematical formalism and statistical framework
2 Diusion approximation for density dependent Markov Jump processes
Diusion approximation for time dependent Markov jump processes Links with other approximations
Application to SIR and SIRS epidemic models
3 Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval Partial observations of the epidemics
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
Specicities of the statistical framework
Model: (Xt)Multidimensional diusion process on Rd, dXt=b(θ1,Xt)dt + σ(θ2,Xt)dBt,X0=x0∈ Rd.
Observations: Discrete observations with sampling ∆ on [0, T ] Xtk for tk=k∆, k ∈ {0, .., n}, tk∈ [0, T ] (n∆ = T ), T is xed
Dierent rates of convergence for parameters in the drift and in the diusion coecient⇒splitting the parameters θ1, θ2required
Continuous observation of the diusion on [0, T ] (Kutoyants (80))
MLE: −1 θMLE
1 − θ01 → N 0, Ib(θ10, θ20)−1
Discrete observations of the diusion on [0, T ]
Minimum contrast approaches: estimation of θ2 at rate√n (⇒ ∆ = ∆n→0).
Diusion approximation of density dependent processes and Epidemics
=√1
N; Identical parameters in the drift and diusion coecients: θ1= θ2. N >> n ⇒Focus on θ1for a more accurate estimation
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
Case of Time dependent diusion
Existing results for multidim. autonomous di. with small di. coe., discretely obs.
For , ∆ → 0 (Sorensen & Uchida (03), Gloter & Sorensen (09), Guy et al. (12)): −1( ˆθ01− θ10) → N (0, Ib−1(θ10, θ02))
For → 0, ∆ xed (Guy et al. (2012)):−1( ˆθ0
1− θ10) → N (0, I∆−1(θ01, θ02))
Contrast for discrete observations from (Guy et al. (13))
˜ U= n X k=1 log(det(Ck(θ1))) + 1 2∆ tN k(θ1)Ck−1(θ1)Nk(θ1), with Ck(θ1) = 1∆Ztk tk−1Φ(tk, s)Σ(x(s))Φ(tk, s)τ ds, Nk(θ1) = Xtk − xθ1(tk) − Φ(tk, tk−1)(Xtk − xθ1(tk))
Fundamental property based on the Gaussian process g
g(tk) = Φ(tk,tk−1)g(tk−1) + Z tk
tk−1Φ(tk,s)σ(x(s))dBs
Extension
Results hold for dXt=b(t, Xt)dt + σ(t, Xt)dBt
Taylor Stoch. expansion (Azencott (82)) still valid for non autonomous di. Rem: Results do not hold for general model dXt=b(t, Xt)dt + σ(t, Xt)dBt ⇒But hold for b(t, x) = b0(t, Xt) + 2b1(t, Xt)(case of new infecteds)
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
Application of the time dependent case: estimation on simulations of the SIRS model
Figure:Mean point estimation (on 1000 runs) and theoretical condence ellipsoid for a simulation case close to inuenza dynamics (N = 106, R
0=1.5, d = 3(days), λ1=0.15 ,1/δ = 2(years)) for ∆ = 1 and 7 days
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
The Sentinelles surveillance network data
The data
Reported cases over time during 22 years (1990-2011, 1obs/day)
SIR model
Each year, starting point s0,i0not known
Estimation of i0dicult even with complete data
SIRS model
Bifurcation
Deterministic limit can not completely t the data
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
Theoretical results in the case of diusion with small diusion coecient Additional assumption: Only the rst component is discretely observed
Existing results for partial observations
(Kutoyants (94)): dXt=ft(θ)Ytdt + dBt1,X0=0; dYt=bt(θ)Ytdt + σt(θ)dBt2,Y0=y0; Continuous observation of Xt, functions ft,bt, σtknown:
consistence and asymptotic normality (at rate −1) for MLE of parameter θ. Generalization by linearization of the drift function around its deterministic limit Identiability assumption based on identiability of the deterministic limit of ft(θ)E Yt|Xs≤t
(James & Le Gland (95)): Consistence result for the MLE estimator in the general case of bidimensionnal process (Xt,Yt)with only the coordinate Xtobserved ⇒identiability of parameters based on x(t) deterministic limit of Xt ⇒If we observe only the increments of ργIt
kfor the SIRS model: the deterministic
identiability is ensured for θ1= (ρ, λ0, λ1, γ, δ)with η, µ, s0,i0 xed.
Our result (ongoing work)
Consistency and CLT, in the asymptotics → 0, ∆ → 0: −1( ˆα − α0)
√
n( ˆβ − β0)
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
Application to the estimation from real data (Sentinelles surveillance network)(1/2)
Figure: Deterministic trajectories associated to estimated values α R0 d (days) 1δ(years) λ1 ρ
ˆ
α1 1.43 2.73 1.8404 0.1304 27.84
ˆ
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
Application to the estimation from real data (Sentinelles surveillance network)(2/2)
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
References
(Guy et al (13)) Approximation of epidemic models by diusion processes and their statistical inference. submitted
(Guy et al (12)) Parametric inference for discretely observed multidimensional diusions with small diusion coecient. to be published in S.P.A.
(Ethier & Kurtz (05)) Markov Processes: Characterization and Convergence (Kutoyants (94)) Identication of Dynamical Systems with Small Noise (James & Le Gland (95)) Consistent parameter estimation for partially observed diusions with small noise
(Sorensen & Uchida (03)) Small-diusion asymptotics for discretely sampled stochastic dierential equations
(Sorensen & Gloter (09)) Estimation for stochastic dierential equations with a small diusion coecient
(Azencott (82)) Formule de Taylor stochastique et developpement asymptotique d'integrales de Feynmann
(Breto et al (09)) Time series analysis via mechanistic models
(Toni et al (09)) Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems
Characteristics of epidemic processes Diusion approximation for density dependent Markov Jump processes
Inference for epidemics using diusion processes
Theoretical results for discrete observations on a nite time interval
Partial observations of the epidemics
Hints for proof of the diusion approximation
Complete proof by application of Theorems from Jacod & Shirayev
Representation Theorem (Ethier & Kurtz (05) Ch. 6.4)
Z(t) = Z(0) + Pl∈E−lYl(
Rt
0αl(Z(s))ds) , where (Yl)is a familly of independent Poisson process
Theorem: Representation of X (t) with random time changes
X (t) = X (0) + PlalWl(Rt
0βl(X (s))ds) + R0t X l∈E−
lβl(X (s))ds, where (Wl)is a familly of independent Brownian motions