• Aucun résultat trouvé

Quasi-stationary distributions and resilience: what to get from a sample?

N/A
N/A
Protected

Academic year: 2022

Partager "Quasi-stationary distributions and resilience: what to get from a sample?"

Copied!
39
0
0

Texte intégral

(1)

Quasi-stationary distributions and resilience: what to get from a sample?

Tome 7 (2020), p. 943-980.

<http://jep.centre-mersenne.org/item/JEP_2020__7__943_0>

© Les auteurs, 2020.

Certains droits réservés.

Cet article est mis à disposition selon les termes de la licence LICENCE INTERNATIONALE DATTRIBUTIONCREATIVECOMMONSBY 4.0.

https://creativecommons.org/licenses/by/4.0/

L’accès aux articles de la revue « Journal de l’École polytechnique — Mathématiques » (http://jep.centre-mersenne.org/), implique l’accord avec les conditions générales d’utilisation (http://jep.centre-mersenne.org/legal/).

Publié avec le soutien

du Centre National de la Recherche Scientifique

Publication membre du

Centre Mersenne pour l’édition scientifique ouverte

(2)

QUASI-STATIONARY DISTRIBUTIONS AND RESILIENCE:

WHAT TO GET FROM A SAMPLE?

by Jean-René Chazottes, Pierre Collet, Servet Martínez

& Sylvie Méléard

Abstract. — We study a class of multi-species birth-and-death processes going almost surely to extinction and admitting a unique quasi-stationary distribution (qsd for short). When rescaled by K and in the limit K +∞, the realizations of such processes get close, in any fixed finite-time window, to the trajectories of a dynamical system whose vector field is defined by the birth and death rates. Assuming this dynamical system has a unique attracting fixed point, we analyzed the behavior of these processes for finiteKand finite times, “interpolating"

between the two limiting regimes just mentioned. In the present work, we are mainly interested in the following question: Observing a realization of the process, can we determine the so-called engineering resilience? To answer this question, we establish two relations which intermingle the resilience, which is a macroscopic quantity defined for the dynamical system, and the fluctuations of the process, which are microscopic quantities. Analogous relations are well known in nonequilibrium statistical mechanics. To exploit these relations, we need to introduce several estimators which we control for times betweenlogK(time scale to converge to the qsd) and exp(K)(time scale of mean time to extinction).

Résumé(Distributions quasi-stationnaires et résilience : que peut-on obtenir des données ?) Nous étudions une classe de processus de naissance et mort avec plusieurs espèces dans la situation où l’extinction est certaine et la distribution quasi-stationnaire est unique. Si on fixe un intervalle de temps fini et qu’on normalise les réalisations d’un tel processus par un paramètre d’échelleK, elles deviennent arbitrairement proches, dans la limiteK+∞, des trajectoires d’un certain système dynamique dont le champ de vecteurs est défini à partir des taux de naissance et mort. Quand le système dynamique admet un seul point fixe attractif, nous avons précédemment analysé le comportement du processus pour des valeurs deKfinies et pour des temps finis, c’est-à-dire le comportement intermédiaire entre les deux comportements limites évoqués ci-dessus. La question principale qui nous intéresse est la suivante : si on observe une réalisation du processus, pouvons-nous estimer la résilience au sens de l’ingénieur (engineering resilience) ? Pour répondre à cette question, nous démontrons deux relations entremêlant la résilience, qui est une quantité macroscopique définie pour le système dynamique sous-jacent, et les fluctuations du processus, qui sont, elles, des quantités microscopiques. De tels genres de relations sont bien connus en mécanique statistique hors d’équilibre. Afin d’exploiter ces relations nous introduisons plusieurs estimateurs empiriques que nous parvenons à contrôler pour des temps entre logK, qui est l’échelle de temps pour observer la convergence vers la distribution quasi-stationnaire, etexp(K), qui est l’échelle du temps moyen d’extinction.

2020Mathematics Subject Classification. — 60J28, 92D25.

Keywords. — Birth-and-death process, dynamical system, engineering resilience, quasi-stationary distribution, fluctuation-dissipation relation, empirical estimators.

S. M. has been supported by the Chair “Modélisation Mathématique et Biodiversité” of Veolia Environnement-École Polytechnique-Museum national d’Histoire naturelle-Fondation X. P. C. and S. M. warmly thank the Basal Conicyt CMM AFD170001 project. J.-R. C. and P. C. also acknowl- edge the hospitality of the Instituto de Física de San Luis Potosí.

(3)

Contents

1. Introduction and main results. . . 944

2. Time evolution of moments of the process and moments of the QSD. . . 951

3. Controlling time averages of the estimators. . . 959

4. Fluctuation and correlation relations. . . 964

5. Variance estimates for the estimators. . . 966

Appendix A. Proof of the two variance estimates. . . 970

Appendix B. Counting the number of births. . . 976

Appendix C. Gaussian limit for the rescaled qsd . . . 977

References. . . 979

1. Introduction and main results

1.1. Context and setting. — The ability of an ecosystem to return to its reference state after a perturbation stress is given by its resilience, a concept pioneered by Holling. Resilience has several faces and multiple definitions [5]. In the traditional theoretical setting of dynamical systems, that is, differential equations, one of them is the so-calledengineering resilience. It is concerned with what happens in the vicinity of a fixed point (equilibrium state) of the system, and is given by minus the real part of the dominant eigenvalue of the Jacobian matrix evaluated at the fixed point. It can also be defined as the reciprocal of the characteristic return time to the fixed point after a (small) perturbation. In this paper, we are interested in how to determine the engineering resilience from the data. But which data? The drawback of the notion of engineering resilience is that we do not observe population densities governed by differential equations. Instead, we count individuals which are subject to stochastic fluctuations. Can we nevertheless infer the resilience? The subject of this paper is to show that this is possible in the framework of birth-and-death processes which are, in a sense made precise below, close to the solutions of a corresponding differential equation, at certain time and population size scales.

Let us now describe our framework. We consider a population made of dspecies interacting with one another. Suppose that the state of the process at timet, which we denote byNK(t) = (N1K(t), . . . , NdK(t)), isn= (n1, . . . , ni, . . . , nd)∈Zd+, whereni is the number of individuals of the ith species. Then the rate at which the population increases (respectively decreases) by one individual of the jth species is KBj(n/K) (respectively KDj(n/K)), where K is a scaling parameter. Under the assumptions we will make, the process goes extinct, i.e., 0is an absorbing state, with probability one. There are two limiting regimes for the behavior of this process. The first one is to fix K and let t tend to infinity, which leads inevitably to extinction. The second one consists in fixing a time horizon and lettingK tend to+∞, after having rescaled the process byK. In this limit, the behavior of the rescaled process is governed by a certain differential equation. More precisely, given any 0< tH <+∞ and anyε >0

(4)

andx0∈Rd+, we have

K→+∞lim PbKx0c

sup

06t6tH

dist NK(t)/K, x(t)

> ε

= 0,

where dist(·,·)is the Euclidean distance inRd+, and x(t)is the solution of the differ- ential equation inRd+

dx

dt =B(x)−D(x)

with initial condition x0. We refer to [4, Chap. 11] for a proof. We use the notations x= (x1, . . . , xd),B(x) = (B1(x), . . . , Bd(x)), and so on and so forth. We will make further assumptions (see Subsection 1.4) on the birth and death rates to be in the following situation. The vector field

X =B−D

has a unique attracting fixed pointx(lying in the interior ofRd+). We denote byM its differential evaluated atx, namely

M=DX(x).

We then define the (engineering) resilience as

ρ=−sup{Re(z) :z∈Sp(M)}, where Sp M

denotes the spectrum (set of eigenvalues) of the matrix M. Under our assumptions, we haveρ>0.

We can now formulate more precisely the goal of this paper. Given afinite-length realization of the process(NK(t), t6T), with large, but finite K, we want to build an estimator for ρ. To this end, we need a good understanding of the behavior of the Markov process(NK(t))in an intermediate regime between the two limiting regimes described above. This was done in a previous work of ours [3], and this can be roughly summarized as follows. All statesn6= 0are transient and 0is absorbing, hence the only stationary distribution is the Dirac measure sitting at 0. The mean time to extinction behaves like exp(Θ(K)). (We recall Bachmann-Landau notations below.) If we start in the vicinity of the staten=bKxc, that is, if the initial state has its coordinates of size of order K, then either the process wanders aroundn or it gets absorbed at0. More precisely, there is a unique quasi-stationary distribution (qsd, for short) νK which describes the statistics of the process conditioned to be non-extinct before timet. Without this conditioning, the law of the process at timet is well approximated by a mixture of the Dirac measure at 0 and the qsd νK, for times t ∈ [cKlogK,exp(Θ(K))], where c > 0, in the sense that the total variation distance between them is exponentially small inK, provided thatK is large enough.

We will rely on these results that will be recalled precisely later on. We will also need to prove further properties.

(5)

1.2. Main results. — To estimate the engineering resilienceρ, we will establish a matrix relation involvingM. LetµK = (µK1, . . . , µKd )be the vector of species sizes averaged with respect toνK, that is,

(1.1) µKp =

Z

npK(n), p= 1, . . . , d.

For each τ>0, define the matrix ΣKp,q(τ) =EνK

NpK(τ)−µKp

NqK(0)−µKq

, p, q∈ {1, . . . , d}.

In Section 4.1, we will prove the following result.

Theorem1.1. — For all τ >0 we have

(1.2) ΣK(τ) = eτ MΣK(0) +O √ K

.

Some comments are in order. Ifτ is equal to, say,1/K then the estimate becomes useless. More generally, ifτ is too small theneτ M is too close to the identity matrix.

Moreover, we will show later on thatµKandΣK(τ)are of orderK. Hence the estimate becomes irrelevant if τ becomes proportional to logK. Indeed, without knowing the constant of proportionality,eτ MΣK(0)can be of the same order than the error term.

Before proceeding further, we recall the following classical Bachmann-Landau no- tations.

Notations. — Given a ∈ R, the symbol O(Ka) stands for any real-valued func- tion f(K) such that there exists C > 0 and K0 > 0 such that, for any K > K0,

|f(K)|6CKa. Note in particular thatO(1)will always mean a strictly positive con- stant that we don’t want to specify. Sometimes, we will also use the symbol Θ(Ka) stands for any real-valued functionf(K)such that there existC1, C2>0andK0>0 such that, for any K > K0, C1Ka 6f(K) 6C2Ka. One can naturally generalize Θ(Ka)to vector-valued functions. For instance, for n∈ Rd+ we write n= Θ(Ka)if ni= Θ(Ka)fori= 1, . . . , d.

Relation (1.2) allows to determineM. Indeed, we have (1.3) eτ M= ΣK(τ) ΣK(0)−1+O 1/√

K .

This formula suggests that in order to estimate M, we need estimators for ΣK(0) andΣK(τ). Given a finite-length realization of NK(t),06t6T

up to some time T > 0, we define estimators for µKp and ΣKp,q(τ), for 0 6 τ < T, p, q ∈ {1, . . . , d}, K∈N by

(1.4) Spµ(T, K) = 1

T Z T

0

NpK(s) ds and

(1.5) Sp,qC (T, τ, K) = 1 T−τ

Z T−τ

0

NpK(s+τ)−Spµ(T, K)

NqK(s)−Sqµ(T, K) ds.

(6)

Under suitable conditions on n, K and T, Sµ(T, K) well approximates µK. More precisely, we will prove an estimate of the following form (see Theorem 3.4 for a precise statement)

(1.6) En

Spµ(T, K)

−µKp

6C K+knk1

1 + logK

T + e−c(knk1∧K)+Tec0K for every n ∈ Zd+, p = 1, . . . , d, where c, c0, C are positive constants. We use the notationknk1 =Pd

i=1ni. Let us comment on this bound. Roughly speaking, it can only be useful if T is much smaller than exp(O(1)K) if n is, say, of order K. For instance, suppose that, forKlarge enough, we want the bound to beΘ(K−a), for some a >0. One can check that this is possible ifn= Θ(K)andT = Θ(Ka+1logK). (Note in particular that, in this situation, we have a consistent estimator whenK→+∞.) However, when T becomes exp(O(1)K) or larger, we know thatEn

Spµ(T, K)

≈0, because with high probability, at this time scale the process is absorbed at0. This is the manifestation of the fact that the only stationary distribution is the Dirac measure at0. Consistently, our bound becomes very bad in that regime.

An estimate of the same kind holds forSC(T, τ, K)which well approximatesΣK(τ) in the appropriate ranges.

Remark1.1. — It is possible to use discrete time instead of continuous time in the above averages. Indeed the key results (in particular Proposition 3.3) are obtained for discrete times.

We can now define the empirical matrixMemp (T, τ, K)by eτ Memp (T ,τ,K)=SC(T, τ, K)SC(T,0, K)−1.

We will see later on that, in appropriate regimes, SC(T,0, K) is near ΣK(0) and SC(T, τ, K)is nearΣK(τ)(see Propositions 5.4 and 5.6). The matrixΣK(0)is invert- ible as a covariance matrix of a non-constant vector and isΘ(K)(see Proposition 2.9).

Then (1.2) implies thatΣK(τ)is invertible and the same holds forSC(T, τ, K). These remarks imply that the matrixMemp is well defined.

We define the empirical resilience by ρemp(T, τ, K) =−sup

Re(z) :z∈Sp Memp (T, τ, K) . Our main result (Theorem 5.7) is then the following.

Theorem. — For τ = Θ(1), n = Θ(K) (initial state) and 0 < T exp(Θ(1)K), andK large enough, we have

ρemp(T, τ, K)−ρ

6O(1)K2

√T + 1

√K

with a probability larger than 1−1/K. In particular, ifT >K5, we have ρemp(T, τ, K)−ρ

6O(1)/

√ K.

(7)

Several comments are in order. The dependence on the initial state n is some- what hidden and involved in the fact that the estimates hold “with a probability larger than 1−1/K”. Indeed, the estimate of this probability results from Cheby- shev inequality and variance estimates in which the process is started inn. What the symbol precisely means is not mathematically defined. It means that we need to consider T “much smaller than something exponentially big inK”. Indeed, since we do not control explicitly the various constants appearing in exponential terms inK, we have to considerT which varies on a scale smaller thanexp(Θ(1)K), for instance exp Θ(1)√

K

. The reader is invited to step through the proof of Theorem 5.7 for the more precise, but cumbersome bound we obtain.

1.3. A “fluctuation-dissipation” approach. — The above estimator for the engineer- ing resilience, based on (1.3), is valid for anyd. In the case d= 1(only one species), we have another, simpler, estimator based on a “fluctuation-dissipation relation”. This relation is in fact true for any d and of independent interest. Let DK be the d×d diagonal matrix given by

DKp,p=KBp(x) =KDp(x).

We have the following result. We writeΣK instead ofΣK(0), and the transpose of a matrixM is denoted byM|.

Theorem1.2. — We have

(1.7) MΣK+ ΣKM∗|+ 2DK=O √ K

.

This relation is proved in Section 4.2. For background on fluctuation-dissipation relations in Statistical Physics, we refer to [7, §§2-3]. Note that the matrix ΣK is symmetrical, but in general the matrixMis not (see [3]). Note also that each term in the left-hand side of (1.7) is of orderK, as we will see below.

IfΣK andDK are known, the matrixM is not uniquely defined, except ford= 1 (see for example [8]). For d = 1, (1.7) easily gives the resilience since it becomes a scalar equation:

ρ=K(B(x) +D(x))

K +O 1/√ K

.

Remark 1.2. — The quantity K(B(x) +D(x)) = 2KB(x) is the average total jump rate KνK(B(n/K) +D(n/K)) up to O(1) terms. This follows from a Taylor expansion of B(n/K) +D(n/K)aroundx, Theorem 2.6 and Proposition 2.7.

In the cased= 1, an estimator for DK is

(1.8) SD(T, K) = 1

T (number of births up to timeT).

In Section 5, we establish a bound for

En

SD(T, K)

−KB(x)

(8)

which depends onT, K andknk1, and is small in the relevant regimes. The estimator we use forΣK is

(1.9) SΣ(T, K) = 1 T

Z T

0

NK(s)−Sµ(T, K)

NK(s)−Sµ(T, K) ds.

Again, we can control how well this estimator approximatesΣK. This provides another estimator forρ, with a controlled error.

1.4. Standing assumptions. — The two (regular) vector fields B(x) and D(x) are given inRd+. We assume that their components have second partial derivatives which are polynomially bounded. Obviously, we suppose thatBj(x)>0andDj(x)>0 for allj= 1, . . . , dandx∈Rd+. A dynamical system inRd+ is defined by the vector field X(x) =B(x)−D(x), namely

dx

dt =B(x)−D(x) =X(x).

Forx∈Rd+, we use the following standard norms:

kxk1=

d

X

j=1

xj, kxk2= q

Pd j=1x2j. We now state our hypotheses.

(H.1) The vectors fieldsB andD vanish only at0.

(H.2) There existsxbelonging to the interior ofRd+ (fixed point ofX) such that B(x)−D(x) =X(x) = 0.

(H.3) Attracting fixed point: there exist β >0 andR >0 such thatkxk2 < R, and for allx∈Rd+ withkxk2< R,

(1.10) hX(x),(x−x)i6−βkxk2kx−xk22.

(H.4) The fixed point0of the vector fieldX is repelling (locally unstable). More- over, on the boundary of Rd+, the vector fieldX points toward the interior (except at0).

(H.5) Define

B(y) = supb

kxk1=y d

X

j=1

Bj(x), D(y) =b inf

kxk1=y d

X

j=1

Dj(x) and fory >0, let

F(y) = B(y)b D(y)b . We assume that there exists 0< L < R such that

sup

y>L

F(y)<1/2 and lim

y→+∞F(y) = 0.

(H.6) There exists y0 > 0 such that R

y0 D(y)b −1dy < +∞ and y 7→ D(y)b is increasing on[y0,+∞[.

(9)

(H.7) There existsξ >0such that

(H7) inf

x∈Rd+

inf

16j6d

Dj(x) sup16`6dx`

> ξ.

(H.8) Finally, we assume that

(H8) inf

16j6dxjBj(0)>0.

(By∂xj we mean the partial derivative with respect toxj.)

Assumptions (H.5) and (H.6) ensure that the time for “coming down from infinity”

for the dynamical system is finite. Together with (H.3), this also implies thatx is a globally attracting stable fixed point. More comments on these assumptions can be found in [3].

1.5. A numerical example. — We consider the two-dimensional vector fields B(x1, x2) =

a x1+b x2

e x1+f x2

and D(x1, x2) =

x1 c x1+d x2 x2 g x1+h x2

,

where all the coefficients are positive. This is a model of competition between two species of Lotka-Volterra type. We have taken

a= 0.4569, b= 0.2959, e= 0.5920, f = 0.6449 c= 0.9263, d= 0.9157, g= 0.9971, h= 0.2905.

Assumptions (H.1) and (H.4) are easily verified numerically. Assumptions (H.5) and (H.6) are satisfied becauseB(y)b 6(a+b+e+f)yandD(y)b >(c∧h)y2/4. Concerning (H.2), we checked numerically that there is a unique fixed point inside the positive quadrant, namelyx= (0.3567,1.4855). It remains to check (H.3), namely that

−β = sup{R(x) :x∈R2+}<0, where

R(x) =hX(x),(x−x)i kxk2kx−xk22 .

We first checked that the numeratorN(x) =hX(x),(x−x)iis negative and vanishes only at0 andx. It is easy to check thatN(x)<0 forkxk2 large enough. We have verified numerically that the only solutions of the equations∂x1N =∂x2N = 0in the closed positive quadrant arex andz= (0.1739,0.4361), withN(z) =−0.2852, thus this is a negative local minimum. This implies that N(x)<0 in the closed positive quadrant, except at0andx where it vanishes. This implies thatR60in the closed positive quadrant. It is easy to check that

lim sup

kxk2→+∞

R(x)6−(c∧h)/√ 2.

This implies that R < 0 except perhaps at 0 and x. Near 0 we have by Taylor expansion

R(x) =−hDX(0)x, xi

kxk2kxk22 1 +O(kxk2)

=−hx, D|B(0)xi

kxk2kxk22 1 +O(kxk2)

(10)

and, since the vectorD|B(0)xhas positive components, there exists% >0such that for allx∈R2+

hx, D|B(0)xi>%kxk2.

Ify=x−x is small, we have by Taylor expansion (sinceX(x) = 0) R(x) = hMy, yi

kxk2kyk22 1 +O(kyk2)

=

y,12 M∗|+M y

kxk2kyk22 1 +O(kyk2) . One can check numerically that the two real eigenvalues of the symmetric matrix

M∗|+M

are strictly negative, the largest being numerically equal to−0.786. This completes the verification of hypothesis (H.3).

Illustrating standard experiments on populations of cells or bacteria, we have cho- senK = 105 and simulated a unique realization of the process withT = 100 which contains about5.107 jumps (cell divisions or deaths). The resilience computed from the vector field is numerically equal to 0.547. We have computed ρemp(100,1,105).

The relative error, that is|ρemp(100,1,105)−ρ|/ρ, is equal to0.022.

Note that the situation we are interested in is completely different from standard statistical approach where one can repeat the experiments.

1.6. Organization of the paper. — In Section 2, we will study the time evolution of the moments of the process and we will prove moment estimates for the qsd. In Sec- tion 3, we will obtain control on the large time behavior of averages for the process.

In Section 4, we will prove the relations (1.2) and (1.7). In Section 5, we will apply these relations to obtain approximate expressions of the engineering resilience in terms of the covariance matrices for the qsd. From the results of Section 3, we will deduce variance bounds for the estimators (1.4), (1.5) and (1.8), starting either in the qsd or from an initial condition of orderK.

Acknowledgements. — We thank the two anonymous referees for fruitful comments and suggestions.

2. Time evolution of moments of the process and moments of the QSD 2.1. Time evolution of moments starting from anywhere. — The generatorLK of the birth and death processNK= (NK(t), t>0)is defined by

(2.1) LKf(n)

=K

d

X

`=1

B`(n/K) f(n+e(`))−f(n) +K

d

X

`=1

D`(n/K) f(n−e(`))−f(n) ,

where e(`) = (0, . . . ,0,1,0, . . . ,0), the 1 being at the `-th position, and f :Zd+ →R is a function with bounded support. We denote by(StK, t>0) the semigroup of the processNK acting on bounded functions, that is, forf :Zd+→R, we have

SKt f(n) =E

f(NK(t))

NK(0) =n

=En

f(NK(t)) .

(11)

ForA >1, let

(2.2) TA= inf{t >0 :kNK(t)k1> A}.

Notice that we will use eitherk·k1ork·k2. They are of course equivalent but one can be more convenient than the other, depending on the context. We have the following result.

Theorem2.1. — There exists a constantC(2.1)>0such that forK large enough, the operator groupS1K extends to exponentially bounded functions and

sup

nZd+

S1K ek·k1

(n)6eC(2.1)K. Proof. — Introduce the functionGK defined on[y0,+∞)by

GK(y) = Z

y

dz

D(z)b + 1 KD(y)b .

Assumption (H.6) implies thatGKis well defined and decreasing on[y0,+∞). We can define its inverse function on(0, s0]fors0>0small enough (independent ofK). Take 0< η6s0∧(1− e−1)/4. Then there is a unique positive functionyK defined by (2.3) yK(s) =GK−1(ηs), s∈(0,1].

Note thatyK(s)>y0 andlims↓0yK(s) = +∞. Let ϕK(s) = e−KyK(s)

KD(yb K(s)). Note that

lims↓0ϕK(s) = 0.

Using the Lipschitz continuity ofDb (and then its differentiability almost everywhere) and (2.3), we obtain

˙

ϕK(s) = dϕK

ds (s) =−e−KyK(s)

D(yb K(s))+e−KyK(s)Db0(yK(s)) KD(yb K(s))2

dyK

ds (s) =ηe−KyK(s). We now consider the function

fK(t, n) =ϕK(t) eknk1 to which we apply Itô’s formula at timet∧TA. We get

En

h

ϕK t∧TA

ekNK(t∧TA)k1i

=En

Z t∧TA

0

tfK+LKfK

(s, NK(s)) ds

. We have

tfK(t, n) +LKfK(t, n) = ˙ϕK(t) eknk1 +KϕK(t) eknk1

(e−1)

d

X

`=1

B`(n/K) + (e−1−1)

d

X

`=1

D`(n/K)

.

(12)

Note that

tfK(t, n) +LKfK(t, n) 6eknk1

˙

ϕK(t) +KϕK(t) (e−1)Bb(knk1/K)−(1−e−1)Db(knk1/K) 6eknk1

˙

ϕK(t)−KϕK(t)(1−e−1)Db(knk1/K) 1−eF(knk1/K) . It follows from (H.5) that there exists a number ζ > y0 such that if y > ζ, then F(y)<(2e)−1.

Ifknk1< ζK we get

tfK(t, n) +LKfK(t, n)

6O(1) eζK ϕ˙K(t) +KϕK(t) . Forknk1>K(ζ∨yK(t))we have

tfK(t, n) +LKfK(t, n)60 sinceϕ˙K(t) =ηKD(yb K(t))ϕK(t)andD(knkb 1/K)>D(yb K(t)).

Finally, forζK6knk1< KyK(t)we get ∂tfK(t, n) +LKfK(t, n)

6eKyK(t)ϕ˙K(t) =η.

We deduce that En

h

ϕK 1∧TA

ekNK(1∧TA)k1i

6O(1) eζK.

The result follows by lettingAtend to infinity and by monotonicity.

We deduce moment estimates for the process which are uniform in the starting state, and in time, for times larger than1.

Corollary2.2. — For allt>1, the semi-group(St)maps functions of polynomially bounded modulus in bounded functions. In particular, for all q∈N, we have

(2.4) sup

t>1

sup

nZd+

En

kNK(t)kq1

6qqe−qKqeC(2.1). Proof. — We have

En

kNK(1)kq1

=KqEn

hkNK(1)kq1

Kq e−kNK(1)k1/KekNK(1)k1/Ki 6Kqqqe−qEn

h

ekNK(1)k1/Ki

since for allx>0,xqe−x6qqe−q. Inequality (2.4) follows from Hölder’s inequality and Theorem 2.1. Let us now considert >1. From the Markov property and by using the previous inequality, we deduce that

En

NK(t)

q 1

=En

h

ENK(t−1)

NK(1)

q 1

i

6qqe−qKqeC(2.1).

The proof is finished.

For timestless than 1, the moment estimates depends on the initial state.

(13)

Proposition2.3. — For each integerq, there exists a constant cq >0 such that for all K >1,t>0 andn∈Zd+

En

NK(t)

q 2

6cqKq+knkq21{t<1}.

Proof. — We have only to study the caset <1, the other case being given in (2.4).

We prove the result for q even, namely q = 2q0. The result for q odd follows from Cauchy-Schwarz inequality. Letting

fq0(n) =knk2q2 0 we have

LKfq0(n) =K

d

X

`=1

B`(n/K)

knk22+ 2n`+ 1q0

− knk2q20 +K

d

X

`=1

D`(n/K)

knk22−2n`+ 1q0

− knk2q2 0 .

Using (H.5) and the equivalence of the norms, we see that there exists a constant cq0 >0 such that ifknk2> cq0K

LKfq0(n)<0.

Moreover, we can takecq0 large enough such that for alln LKfq0(n)6cq0K2q0.

Applying Itô’s formula tofq0 we get as in the proof of Theorem 2.1 En

hkNK(t∧TA)k2q2 0i

6knk2q2 0+En

Z t∧TA

0

cq0K2q0ds

6knk2q2 0+t cq0K2q0.

(Recall thatTAis defined in (2.2).) The result follows by lettingAtend to infinity.

2.2. Moments estimates for the qsd. — Let us first recall (cf. [3]) that, under the assumptions of Section 1.4, there exists a unique qsd νK with support Zd+r{0}.

Further, starting from the qsd, the extinction time is distributed according to an exponential law with parameterλ0(K)satisfying ([3, Th. 3.2])

(2.5) e−d1K0(K)6e−d2K,

whered1> d2>0 are constants independent ofK. Recall also that for allt >0, (2.6) PνK NK(t)∈, T0> t

= e−λ0(K)tνK ), where

T0= inf{t >0 :NK(t) = 0}.

Finally, for allf in the domain of the generator

(2.7) LKνK(f) =νK(LKf) =−λ0(K)νK(f)

(14)

with the notation

νK(f) = Z

f(n) dνK(n).

We use several notations from [3] that we now recall. Let n=bKxc.

Forx∈Rd+ and r >0, B(x, r)is the ball of centerxand radius r. We consider the sets

(2.8) ∆ =B n, ρ(4.2)√ K

, D=B n,(minjnj)/2

∩Zd+,

whereρ(4.2)>0is a constant defined in [3, Cor. 4.2]. Note that sincenis of orderK, we have ∆ ⊂D forK large enough. The first entrance time in ∆ (resp.D) will be denoted byT (resp.TD).

We first prove that the support of the qsd is, for large K, almost included in D. (This will be important to control moments later on.)

Proposition2.4. — There exists a constantc(2.4)>0such that for allKlarge enough νK Dc

6e−c(2.4)K.

Proof. — We first recall two results from [3]. From [3, Lem. 1.5], there exist γ > 0 andδ∈(0,1)such that for allKlarge enough

(2.9) sup

n∈∆cr0

Pn T> γlogK, T0> T 6δ.

By [3, Sublem. 5.8], there exist two constants C >0 and c > 0 such that for all K large enough, and for allt >0

(2.10) sup

n∈∆Pn TDc < t

6C 1 +t e−cK. Now, forq∈N r{0}define

tq =qγlogK.

We will first estimatesupnPn NK(tq)∈Dc, T0> tq

. Note thatNK(tq)∈Dcimplies TDc6tq. We distinguish two cases according to whethern∈∆ orn∈∆cr{0}.

Letn∈∆. It follows from (2.10) that Pn NK(tq)∈Dc

6C 1 +tq e−cK. Now letn∈∆cr{0}. We have

Pn NK(tq)∈Dcr{0}

=Pn NK(tq)∈Dcr{0}, T6tq

+Pn NK(tq)∈Dcr{0}, T> tq . Using the strong Markov property at timeT and (2.10) we obtain

Pn NK(tq)∈Dcr{0}, T6tq

=En

h

1{T6tq}PNK(T) NK(tq−T)∈Dcr{0}i 6C(1 +tq) e−cK.

(15)

We bound the second term recursively in q.

Pn T> tq, T0> T

=En

h

1{T>tq−1}1{T0>T}PNK(tq−1) T> t1, T0> T

i

6δ sup

n∈∆cr{0}Pn T> tq−1, T0> T ,

where we used the strong Markov property at timetq−1 and (2.9). This implies sup

n∈∆cr{0}Pn NK(tq)∈Dcr{0}, T> tq

6 sup

n∈∆cr{0}Pn T> tq, T0> T

q. Therefore

sup

n6=0Pn NK(tq)∈Dcr{0}

6C 1 +tq

e−cKq.

Takingq=bKcwe conclude that there exists a constantc0 >0such that forKlarge enough

sup

n6=0Pn NK(tbKc)∈Dcr{0}

6e−c0K. This implies

PνK NK(tbKc)∈Dc, T0> tbKc

6e−c0K but by (2.6)

PνK NK(tbKc)∈Dc, T0> tbKc

= e−λ0(K)tbKcνK Dc

and the result follows from (2.5).

Corollary 2.5. — For each q ∈ N, there exists Cq > 0 such that for all K large enough

Z

Dc

knkq1K(n)6CqKqe−c(2.4)K and Z

knkq1K(n)6CqKq. Proof. — It follows at once from (2.6) (at time1) and Theorem 2.1 that (2.11)

Z

eknk1K(n)6eλ0(K)eC(2.1)K 62 eC(2.1)K forK large enough. We have

Z

Dc

knkq1K(n) =Kq Z

Dc

(knk1/K)qe−knk1/Keknk1/KK(n) 6Kqqqe−q

Z

eknk1/K1Dc(n) dνK(n).

We use Hölder inequality to get Z

Dc

knkq1K(n)6Kqqqe−q Z

eknk1K(n)

1/KZ

1Dc(n) dνK(n) 1−1/K

. The first result follows from (2.11) and Proposition 2.4. The second estimate follows from the first one, and the boundsupn∈Dknk16O(1)K.

We now estimate centered moments.

(16)

Theorem 2.6. — For each q ∈ Z+, there exists Cq > 0 such that for all K large enough

Z

kn−Kxk2q2K(n)6CqKq.

Proof. — The proof consists in a recursion overq. The bound is trivial forq= 0. For q∈Ndefine the function

fq(n) =kn−Kxk2q2 1D1(n), where

D1=B Kx,(2K/3) minjxj

∩Zd+.

Recall thate(j)is the vector with1at thejth coordinate and0elsewhere. From the trivial identity

(2.12) kn−Kx±e(j)k22=kn−Kxk22±2(nj−Kxj) + 1 it follows that

kn−Kx±e(j)k2q2 − kn−Kxk2q2 ±2q(nj−Kxj)kn−Kxk2q−22

63q2q(1 +kn−Kxk2q−22 ).

Indeed, applying the trinomial expansion to (2.12), we obtain kn−Kx±e(j)k2q2 − kn−Kxk2q2 ±2q(nj−Kxj)kn−Kxk2q−22

6q! X

p16q−2 p1+p2+p3=q

kn−Kxk2p2 1(2kn−Kxk2)p2

p1!p2!p3! +qkn−Kxk2q−22 . Observe that if p1 6 q−2, p1 +p2+p3 = q and then 2p1+p2 = p1+q−p3 6 2q−2−p362q−2, sincep3>0. This implies that

kn−Kxk2p2 1(2kn−Kxk2)p2 62q(1 +kn−Kxk2q−22 ).

It follows that

(2.13) LKfq(n) = 2qK

d

X

j=1

Xj(n/K) (nj−Kxj)kn−Kxk2q−22 1D1(n) +Rq(n), where

(2.14) |Rq(n)|6O(1) K6q(1 +kn−Kxk2q−22 )1D1(n) +qK2q+11Dc(n) To get this bound, we used the fact that

sup

j=1,...,d

|1D1(n±e(j))−1D1(n)|61Dc(n).

Using (1.10) we get (2.15) K

d

X

j=1

Xj(n/K) (nj−Kxj)kn−Kxk2q−22 1D1(n)

6−β0kn−Kxk2q2 1D1(n) =−β0fq(n),

(17)

where

β0= β 3min

j xj.

Integrating the equation (2.13) with respect toνK and using (2.7), (2.14), (2.15) and Proposition 2.4, we obtain

(2qβ0−λ0(K))νK(fq)6O(1) K6q(1 +νK(fq−1)) + 6qK2q+1e−c(2.4)K . Observing that νK(f0) 6 1, it follows by recursion over q that, for each integer q, there exists Cq0 >0 such that, for all K large enough, νK(fq) 6Cq0Kq. Finally we have

Z

kn−Kxk2q2K(n) =νK(fq) + Z

kn−Kxk2q2 1D1c(n) dνK(n) 6νK(fq) +

Z

kn−Kxk2q−22 1Dc(n) dνK(n)

sinceD⊂D1. The result follows using the previous estimate and Corollary 2.5.

The next result gives a more precise estimate for the average of n(instead of an error of order√

K).

Proposition2.7. — We have

µK−Kx=O(1),

whereµK is defined in (1.1). Moreover, since kn−Kxk2=O(1), we have

(2.16) µK−n=O(1).

Proof. — Define the functions

gj(n) =hn−Kx, e(j)i, 16j6d.

By Taylor expansion and the polynomial bounds onB andD we get LKgj(n) =K Bj(n/K)−Dj(n/K)

=

d

X

m=1

mBj(x)−∂mDj(x)

gm(n)1D(n) +O(1)kn−Kxk22 K 1D(n) +O(1) (Kp+knkp2)1Dc(n) for some positive integerpindependent ofK. Using Cauchy-Schwarz inequality, iden- tity (2.7), Corollary 2.5 and Proposition 2.4 we get

Z

1 +knkp2

1Dc(n) dνK(n) =o(1).

From Proposition 2.4, Theorem 2.6 and (2.5) we get

d

X

m=1

mBj(x)−∂mDj(x)

νK(gm) =O(1).

The result follows from the invertibility of thed×dmatrix(∂mBj(x)−∂mDj(x)) which follows from (H.3). The other inequalities follow immediately.

(18)

Corollary2.8. — For all K >0, we have kΣKk6

Z

n−µK

2

2K(n) = Z

kn−Kxk22K(n) +O(1)6O(1)K.

Proof. — Combine Proposition 2.7 and Theorem 2.6.

We now show thatΣK is indeed of orderK.

Proposition2.9. — There exist two strictly positive constants c(2.9) and c0(2.9) such that for allK large enough, the matrix ΣK satisfies

ΣK >c(2.9)KId

for the order among positive definite matrices, Id being the identity matrix, and, in particular,

Z

n−µK

2

2K(n)>c0(2.9)K.

Proof. — We denote byΣeK the positive definite matrix ΣeKp,q=

Z

np−np

nq−nq

K(n).

By (2.16) we have

(2.17)

K−ΣK

2=O(1).

Letv be a unit vector inRd. We have hv,ΣeKvi=

Z

hv,(n−n)i2K(n)>

Z

hv,(n−n)i2K(n).

From [3, Lem. 5.3] there exists a constantc >0such that for allK large enough and alln∈∆,

νK({n})>c U({n}), whereU is the uniform distribution on∆. Therefore

hv,ΣeKvi>c Z

hv,(n−n)i2dU(n) and we get

hv,ΣeKvi>c(2.9)Kkvk22.

The result follows.

3. Controlling time averages of the estimators ForT >0, we define the time average of a functionf :Zd+→Rby

(3.1) Sf(T, K) = 1

T Z T

0

f(NK(s)) ds.

The goal of this section is to obtain a control of |Sf(T, K)−νK(f)| for a suitable class of functions.

We recall the following result from [3, Th. 3.1].

(19)

Theorem3.1([3]). — There exista > 0,K0>1 such that, for all t>0 and for all K>K0, we have

(3.2) sup

n∈Zd+r{0}

Pn(NK(t)∈, t < T0)−Pn(t < T0K()

TV 62 e−at/logK. It is also proved in [3] that, for a time much larger thanlogK and much smaller than the extinction time (which is of order exp(Θ(1)K)), the law of the process at timet is close to the qsd. The accuracy of the approximation depends on the initial condition. This suggests to study the distance between the law of the process at timet and the qsd as a function of the initial condition,Kandt. This will result from (3.2) ifPn T06t

can be estimated. In fact we prove a more general result.

Lemma3.2. — For γ > 0, define τγ = inf

t > 0 : kNK(t)k1 6 γK . There exist δ >0,α >0 andC >0 such that for alln∈Zd+,K >1,06γ 61∧α/kxk1 and t>0, we have

(3.3) Pn τγ 6t 6C

exp (−δ(ζ((knk1/K)∧ α)−γkxk1)K)

+ texp (−δ(α−γkxk1)K) , where

(3.4) ζ= min

16j6d xj >0.

Takingγ= 0in (3.3), we get (3.5) Pn T06t

6C exp (−δ(ζ(knk1/K)∧ α)K) + texp(−α δK) .

Proof. — It follows from (H.1) and (H.3) (using Taylor’s expansion ofX(x)near0) that there exists α0 ∈ (0, R) (where R was introduced in Assumption (H.3)) such that for allx∈Rd+ satisfyingkxk20 we have

hX(x), xi>βkxk2kxk2−2βkxk2hx, xi+βkxk32+hX(x), xi

>βkxk2kxk2+O(1)kxk22> βkxk22 2 kxk2. (3.6)

Forα∈(0, α0]andδ >0 to be chosen later on, we define ψ(n) = e−δ(hn,xi∧αK). It is easy to verify that ifhn, xi> α K+kxk2 we have

LKψ(n) = 0.

IfαK− kxk26hn, xi6αK+kxk2 we have

LKψ(n)

6O(K) e−αδK.

Forhn, xi6αK− kxk2, we haveknk16hn, xi/ζ 6αK/ζ, whereζ is defined in (3.4), and

LKψ(n) =Kg(δ, n/K) e−δhn,xi,

Références

Documents relatifs

In the model considered in the present article, due to the (stochastic) evolution of population size, even when the studied diploid population is at Hardy-Weinberg equilibrium

We will show that the process Y preserves the absolutely continuity with respect to µ and that when the process starts from any point measure after any positive time the

In contrast, we will explain how the limiting distribution of the process conditioned to not being absorbed when it is observed can explain some complex behavior, as the

The rate of extinction θ(α) is the highest in this case, equal to 0.100. In fact, the process reaches the upper bound 100 very rarely, so that the behavior of the process is

Indeed, the estimate of this probability results from Chebychev inequality and variance estimates in which the process is started in n.. What the symbol precisely means is

As a continuation of a recent paper, dealing with finite Markov chains, this paper proposes and analyzes a recursive algorithm for the approximation of the quasi-stationary

We provide non-asymptotic estimates on the deadlock time and the state of the system at deadlock for discrete models (Section 4.1) and asymptotic results which hold true for any

Présenté dans quatre salles et trois cabinets, le nouvel accrochage du MAH met en évidence la place de la peinture italienne et espagnole dans l’histoire des collections