• Aucun résultat trouvé

Epidemics with Random Sampling

N/A
N/A
Protected

Academic year: 2022

Partager "Epidemics with Random Sampling"

Copied!
37
0
0

Texte intégral

(1)

Epidemics with Random Sampling

Amaury Lambert

MCEB 2013, Hameau de l’Étoile, 28 mai 2013

(2)

Epidemics with sampling : 2 problems

Assumption :

Each infective is sampled/detected after some random time (symptoms, medical exam)

Goal :

Infer epidemiological dynamics from...

Problem 1.Pathogen sequence data sampled atall detection times

= w/ T. Stadler and H. Alexander (ETH Zürich)

Problem 2.Hospital data in antibiotic-resistant epidemics stoppedat the first detection time

= w/ P. Trapman (U. Stockholm)

(3)

Epidemics with sampling : 2 problems

Assumption :

Each infective is sampled/detected after some random time (symptoms, medical exam)

Goal :

Infer epidemiological dynamics from...

Problem 1.Pathogen sequence data sampled atall detection times

= w/ T. Stadler and H. Alexander (ETH Zürich)

Problem 2.Hospital data in antibiotic-resistant epidemics stoppedat the first detection time

= w/ P. Trapman (U. Stockholm)

(4)

Epidemics with sampling : 2 problems

Assumption :

Each infective is sampled/detected after some random time (symptoms, medical exam)

Goal :

Infer epidemiological dynamics from...

Problem 1.Pathogen sequence data sampled atall detection times

= w/ T. Stadler and H. Alexander (ETH Zürich)

Problem 2.Hospital data in antibiotic-resistant epidemics stoppedat the first detection time

= w/ P. Trapman (U. Stockholm)

(5)

Problem 1 : Stoppingat all dots before timet

Problem 2 : Stoppingat first dot = timeT

r

r r

t

0

r r r

t

T

0

(6)

General framework

Continuous time

Branching process assumption :Excess of susceptibles

Age-dependentdeath rate :The duration of infectiousness can have anarbitrarydistribution

Constantbirth rate(transmission)

Constantdetection rate

(7)

General framework

Continuous time

Branching process assumption :Excess of susceptibles

Age-dependentdeath rate :The duration of infectiousness can have anarbitrarydistribution

Constantbirth rate(transmission)

Constantdetection rate

(8)

General framework

Continuous time

Branching process assumption :Excess of susceptibles

Age-dependentdeath rate :The duration of infectiousness can have anarbitrarydistribution

Constantbirth rate(transmission)

Constantdetection rate

(9)

General framework

Continuous time

Branching process assumption :Excess of susceptibles

Age-dependentdeath rate :The duration of infectiousness can have anarbitrarydistribution

Constantbirth rate(transmission)

Constantdetection rate

(10)

General framework

Continuous time

Branching process assumption :Excess of susceptibles

Age-dependentdeath rate :The duration of infectiousness can have anarbitrarydistribution

Constantbirth rate(transmission)

Constantdetection rate

(11)

Splitting tree in forward time (Geiger & Kersting 97)

Thetransmission treewith samplingcan be described by an asexual population withmarkswhere

6 t

r r

r

r r

individuals reproduce independently

death rate may depend onage

birth rate is constant

detection rate is constant.

The population size process(Nt;t≥0)is anon-Markovian birth–death process.

(12)

Splitting tree in forward time (Geiger & Kersting 97)

Thetransmission treewith samplingcan be described by an asexual population withmarkswhere

6 t

r r

r

r r

individuals reproduce independently

death rate may depend onage

birth rate is constant

detection rate is constant.

The population size process(Nt;t≥0)is anon-Markovian birth–death process.

(13)

Without marks (1)

Thetransmission tree stopped attcan beorientedas follows...

6

?

?

?

t 0 1 2 3

H1 H2

H3

b b

b b b

?H4 4

...where the timesH1,H2,H3. . .are thenode depths.

(14)

Without marks (2)

Theoriented,reconstructed tree attcan be represented...

Like this...

...Or like that

? ?

? ?

? ?

t

H1 H1

H2 H2

H3 H3

c

c c c c

? ?

H4 c c c c H4 c

(15)

Coalescent point process

Mathematical result :The node depthsH1,H2,H3. . .of the tree form a sequence ofindependent and identically distributed positive random variables killed at its first value larger thant

(Lambert 2010, Lambert & Stadler 2013)

In particular, conditional onNt6=0, the population sizeNt follows ageometric distribution.

Such a tree is called acoalescent point-process:

fast simulationof reconstructed trees

Easy computation of thelikelihood of a tree(product form).

(16)

Coalescent point process

Mathematical result :The node depthsH1,H2,H3. . .of the tree form a sequence ofindependent and identically distributed positive random variables killed at its first value larger thant

(Lambert 2010, Lambert & Stadler 2013)

In particular, conditional onNt6=0, the population sizeNt follows ageometric distribution.

Such a tree is called acoalescent point-process:

fast simulationof reconstructed trees

Easy computation of thelikelihood of a tree(product form).

(17)

Coalescent point process

Mathematical result :The node depthsH1,H2,H3. . .of the tree form a sequence ofindependent and identically distributed positive random variables killed at its first value larger thant

(Lambert 2010, Lambert & Stadler 2013)

In particular, conditional onNt6=0, the population sizeNt follows ageometric distribution.

Such a tree is called acoalescent point-process:

fast simulationof reconstructed trees

Easy computation of thelikelihood of a tree(product form).

(18)

Coalescent point process

Mathematical result :The node depthsH1,H2,H3. . .of the tree form a sequence ofindependent and identically distributed positive random variables killed at its first value larger thant

(Lambert 2010, Lambert & Stadler 2013)

In particular, conditional onNt6=0, the population sizeNt follows ageometric distribution.

Such a tree is called acoalescent point-process:

fast simulationof reconstructed trees

Easy computation of thelikelihood of a tree(product form).

(19)

Coalescent point process

6

t 1 2 3 4 5 6

?

?

?

?

?

?

H1 H2

H3 H4

H5 H6

Likelihood of tree withnode depths(hi)1≤i≤n−1

L(T) =p(t)

n−1

i=1

f(hi).

(20)

Adding marks : 2 problems

Problem 1.Likelihood of the treespanned by all the distinct detection times ?

;Previously known : case when death rate is constant(Stadler et al 2012)

Problem 2.Likelihood of the treestopped at the first detection timeT?

;Previously known :NT is (still) geometric(Trapman & Bootsma 2009)

(21)

Adding marks : 2 problems

Problem 1.Likelihood of the treespanned by all the distinct detection times ?

;Previously known : case when death rate is constant(Stadler et al 2012)

Problem 2.Likelihood of the treestopped at the first detection timeT?

;Previously known :NT is (still) geometric(Trapman & Bootsma 2009)

(22)

Problem 1 : Assumptions and notation

Asampled individualimmediatelyleavesthe infective population.

Si:=sampling timeof individuali

Ri:=coalescence timebetween individualsi−1 andi.

t

t t

t

0 1

2

3

? 6

?

S1 6

S2

? 6

?

R1 6R2

(23)

Problem 1 : Result

Theorem (Lambert, Alexander & Stadler 2013) The pairs(Si,Ri)form akilled Markov chainwith semi-explicit transitions, where the transition probability only depends on the second component(Si).

;Inference of model parameters from viral phylogenies (HIV, flu).

For any givenorientedtreeT withcoalescence times(xi)2≤i≤nand sampling times(yi)1≤i≤n,

L(T) =g(y1)k(yn)

n

i=2

f(yi−1;xi,yi).

Limitations :

1 Transitions are only semi-explicit ;

2 The Markov chain property depends on the orientation...

(24)

Problem 1 : Result

Theorem (Lambert, Alexander & Stadler 2013) The pairs(Si,Ri)form akilled Markov chainwith semi-explicit transitions, where the transition probability only depends on the second component(Si).

;Inference of model parameters from viral phylogenies (HIV, flu).

For any givenorientedtreeT withcoalescence times(xi)2≤i≤nand sampling times(yi)1≤i≤n,

L(T) =g(y1)k(yn)

n

i=2

f(yi−1;xi,yi).

Limitations :

1 Transitions are only semi-explicit ;

2 The Markov chain property depends on the orientation...

(25)

Problem 1 : Result

Theorem (Lambert, Alexander & Stadler 2013) The pairs(Si,Ri)form akilled Markov chainwith semi-explicit transitions, where the transition probability only depends on the second component(Si).

;Inference of model parameters from viral phylogenies (HIV, flu).

For any givenorientedtreeT withcoalescence times(xi)2≤i≤nand sampling times(yi)1≤i≤n,

L(T) =g(y1)k(yn)

n

i=2

f(yi−1;xi,yi).

Limitations :

1 Transitions are only semi-explicit ;

2 The Markov chain property depends on the orientation...

(26)

Problem 2 : Assumptions

Patients havei.i.d lengths of stayin the hospital, all distributed as some r.v.K

Conditionalon infection, the length of stay of a patient is a size-biasedversion ofK

Detection rate per patient=:δ

(27)

Problem 2 : Notation

For individuali, set

Ui:=time elapsed from entrance of the hospital up to infection

Ai:=time elapsed from infection up toT

Ri:=residual lifetime in the hospital afterT.

Setm:=E(K)and letφdenote the inverse of the convex function x7→x−λ

m Z

(0,∞]

(1−e−xy)P(K>y)dy.

T 6

? 6

? 6

? 6

transmission

U A s R

? 6

? 6

J K

(28)

Problem 2 : Result

Theorem (Lambert & Trapman 2013)

Conditional on NT=n, the triples(Ui,Ai,Ri)of the n infectives at time T areindependent and identically distributed, distributed as

E(f(U,A,R)) = λ

m

φ(δ) φ(δ)−δ

Z

u=0

du Z

a=0

da Z

z=u+aP(K∈dz)e−φ(δ)af(u,a,z−u−a),

In particular, the timesJi=Ui+Aispent in the hospital up to time T areindependent and identically distributed, distributed as the r.v. J

P(J∈dy) = λ/m

φ(δ)−δ P(K>y) 1−e−φ(δ)y dy.

;Inference from hospital data (dates of entrance in the hospital).

(29)

Problem 2 : Result

Theorem (Lambert & Trapman 2013)

Conditional on NT=n, the triples(Ui,Ai,Ri)of the n infectives at time T areindependent and identically distributed, distributed as

E(f(U,A,R)) = λ

m

φ(δ) φ(δ)−δ

Z

u=0

du Z

a=0

da Z

z=u+aP(K∈dz)e−φ(δ)af(u,a,z−u−a),

In particular, the timesJi=Ui+Aispent in the hospital up to time T areindependent and identically distributed, distributed as the r.v. J

P(J∈dy) = λ/m

φ(δ)−δ P(K>y) 1−e−φ(δ)y dy.

;Inference from hospital data (dates of entrance in the hospital).

(30)

Problem 2 : Result

Theorem (Lambert & Trapman 2013)

Conditional on NT=n, the triples(Ui,Ai,Ri)of the n infectives at time T areindependent and identically distributed, distributed as

E(f(U,A,R)) = λ

m

φ(δ) φ(δ)−δ

Z

u=0

du Z

a=0

da Z

z=u+aP(K∈dz)e−φ(δ)af(u,a,z−u−a),

In particular, the timesJi=Ui+Aispent in the hospital up to time T areindependent and identically distributed, distributed as the r.v. J

P(J∈dy) = λ/m

φ(δ)−δ P(K>y) 1−e−φ(δ)y dy.

;Inference from hospital data (dates of entrance in the hospital).

(31)

Jumping contour of a tree

a) Binary tree with edge lengths and b) Contour process of its truncation below timet.

6

@

@

@ @

@

@ a)

b) T

@

@

@@@@ @

@

@

- 6

T

(32)

Jumping contour of a tree with marks

a) Binary tree with marks, b) its reconstructed tree and c) its contour process.

r r r

r r

t r

0 0

1 1

2 2

3 3

? 6

? 6

? 6

? 6

? ?

6 6

S1 S1

S2 S3 S2 S3

? 6

? 6

? 6

?

R2 R3 R2 R63

a) b)

r@ r r

@

@@

@

@

@

@

@

@

@@

@

@

@

@

@

@

@

t6

0 -

c)

(33)

Co-authors

Helen ALEXANDER(ETHZ) . . . .

Tanja STADLER(ETHZ) . . . .

Pieter TRAPMAN(U. Stockholm) . . . .

(34)

SMILE : A cross-disciplinary group in CIRB

CIRB =Center for Interdisciplinary Research in Biology (Collège de France)

SMILE =Stochastic Models for the Inference of Life Evolution

(35)

SMILE group in June 2012

(36)

Institutions

Stochastic Models for the Inference of Life Evolution(SMILE)

Center for Interdisciplinary Research in Biology

Collège de France

Stochastics & Biology group

Laboratoire de Probabilités et Modèles Aléatoires

UPMC University Paris 06

ANRModèles Aléatoires eN Écologie, Génétique, Évolution(MANEGE)

(37)

Conference announcement Mathematics for an Evolving Biodiversity

Montréal, Canada September 16–20, 2013

Organizers : Jonathan Davies (McGill), Nicolas Lartillot (CNRS & U.

Montréal) and myself

http://www.crm.umontreal.ca/2013/Biodiversity13/

Références

Documents relatifs

The shapes of the observed DSDs for eight classes of R are compared: (a) with curves fitted by using X i (R) relations in the N(D) lognormal expression, (b) with curves fitted from

With a careful choice of functions f and g, one can compute the asymptotic behaviour of the consistent maximal displacement at time n, which is [12, Theorem 1] and [13, Theorem 1.4];

The phasing of precession and obliquity appears to influence the persistence of interglacial conditions over one or two in- solation peaks: the longest interglacials are

The main objective of the present study was to determine the effects of age, exercise duration, and test site on HRV variables at rest and during exercise and recovery in young

In this paper, we use the theory of generalized likelihood ra- tio tests (GLRT) to study the adaptive version of the asymp- totical Bayesian Optimum Radar Detector (BORD) built with

As  one  can  see  in  Figure  7,  in  the  present  work,  the  rate  constant  of  reaction  (1)  was  observed  to  be  temperature  independent  at  T 

Thus, from the hydrogen peroxide concentration and pH time-courses, which were correlated to the ozone mass-transfer rate assuming a pseudo-first order absorption regime, the

OPTIMAL CONTROL OF A POPULATION DYNAMICS MODEL WITH MISSING BIRTH RATE9. SIAM Journal on Control and Optimization, Society for Industrial and Applied Mathematics,