Polymer dynamics in the depinned phase: metastability with logarithmic barriers

(1)

HAL Id: hal-00655963

https://hal.archives-ouvertes.fr/hal-00655963

Submitted on 3 Jan 2012

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

with logarithmic barriers

Pietro Caputo, Hubert Lacoin, Fabio Martinelli, François Simenhaus, Fabio

Toninelli

To cite this version:

Pietro Caputo, Hubert Lacoin, Fabio Martinelli, François Simenhaus, Fabio Toninelli. Polymer dy-namics in the depinned phase: metastability with logarithmic barriers. Probability Theory and Re-lated Fields, Springer Verlag, 2011, pp.Online first. �10.1007/s00440-011-0355-6�. �hal-00655963�

(2)

WITH LOGARITHMIC BARRIERS

PIETRO CAPUTO, HUBERT LACOIN, FABIO MARTINELLI, FRANC¸ OIS SIMENHAUS, AND FABIO LUCIO TONINELLI

Abstract. We consider the stochastic evolution of a (1 + 1)-dimensional polymer in the de-pinned regime. At equilibrium the system exhibits a double well structure: the polymer lies (essentially) either above or below the repulsive line. As a consequence, one expects a metastable behavior with rare jumps between the two phases combined with a fast thermalization inside each phase. However, the energy barrier between these two phases is only logarithmic in the system size L and therefore the two relevant time scales are only polynomial in L with no clear-cut separation between them. The whole evolution is governed by a subtle competition between the diffusive behavior inside one phase and the jumps across the energy barriers. Our main results are: (i) a proof that the mixing time of the system lies between L52 _{and L}52+2_{; (ii)}

the identification of two regions associated with the positive and negative phase of the polymer together with the proof of the asymptotic exponentiality of the tunneling time between them with rate equal to a half of the spectral gap.

2000 Mathematics Subject Classification: 60K35, 82C20

Keywords: reversible Markov chains, polymer pinning model, metastability, spectral gap, mixing time, coupling, quasi-stationary distribution.

1. Introduction, model and results

Random polymers are commonly used in statistical mechanics to model a variety of inter-esting physical phenomena. A rich class of models with a non-trivial behavior is obtained by considering a simple random walk path interacting with a defect line in the thermodynamic limit when the length of the path tends to inﬁnity. The equilibrium of these so-called polymer pinning models has been studied in depth in the mathematical literature, and the associated localization/delocalization phase transition is, nowadays, a well understood phenomenon, even in the presence of non-homogeneous interactions; see [9] for a recent survey.

Markovian stochastic dynamics of random pinned polymers, on the other hand, have received much less attention from a mathematical point of view. Besides its importance in bio-physical applications (see e.g. [5, 6] and references therein), the stochastic evolution of polymer models poses new challenging probabilistic problems from many points of view and the connection between the equilibrium and dynamical properties of the model is still largely unexplored. In particular, we feel that the problem of how the polymer relaxes to the stationary distribution (time scales, overcoming of energy barriers, metastability, patterns leading to equilibrium) still lacks a satisfactory solution even in the simplest homogeneous models; see [7] for some initial results in this direction.

In this paper we consider the dynamics of a homogeneous polymer model interacting with a repulsive defect line with two main motivations in mind:

(3)

(i) the repulsive regime is characterized by a relaxation to equilibrium occurring on a time scale certainly much larger [7] than the usual diﬀusive one which is typical e.g. of the neutral case1 _{[21]. The new scale is clearly the result of a subtle competition in the polymer}

evolution between diﬀusive behavior and jumps against energy barriers.

(ii) The whole relaxation mechanism should show certain typical features of metastable evo-lution but in a very atypical context2 _{in which the relevant relaxation time scales are only}

polynomial in the size L of the system (i.e. the energy barriers are only logarithmic in

L), with little separation between the mixing time inside one phase and the global mixing

time. A signature of this fact can be found in the anomalous growth with L of the global mixing time, a growth which is much more rapid than the naive guess based on the usual rule Tmix≈ exp(∆E), with ∆E the so-called activation energy. In order to appreciate the

novelty of such a situation it is useful to compare it to another well known case, namely the Glauber dynamics for the low temperature Ising model in a square box with free bound-ary [14], for which a very precise analysis of the metastable behavior was possible exactly because of a sharp separation, at an exponential level, between the two time scales. 1.1. Dynamics of the polymer pinning model. Let Ω = Ω2L denote the set of all lattice

paths (polymers) starting at 0 and ending at 0 after 2L steps, L∈ N:

Ω ={η ∈ Z2L+1: η_−L= ηL= 0 , ηx+1 = ηx± 1 , x = −L, . . . , L − 1} .

The stochastic dynamics is deﬁned by the natural spin-ﬂip continuous time Markov chain with state space Ω. Namely, sites x = −L + 1, . . . , L − 1 are equipped with independent rate 1 Poisson clocks. When site x rings, the height ηx of the polymer at x is updated according to

the rules: if ηx−1 = ηx+1 ± 2 then do nothing; if ηx−1 = ηx+1 = h, and |h| ̸= 1, then set

ηx = h± 1 with equal probabilities; if ηx−1 = ηx+1 = +1, then set ηx = 0 with probability λ

λ+1 and ηx = 2 with probability

1

λ+1; similarly, if ηx−1 = ηx+1 = −1, then set ηx = 0 with

probability _λ+1λ and ηx = −2 with probability _λ+11 . Here λ > 0 is a parameter describing the

strength of the attraction (λ > 1) or repulsion (λ < 1) between the polymer and the line η ≡ 0. The inﬁnitesimal generator of the Markov chain is given by

Lf(η) = L−1_∑ x=−L+1 rx,+(η) [ f (ηx,+)− f(η)]+ L−1_∑ x=−L+1 rx,−(η) [ f (ηx,−)− f(η)] , (1.1) where: f is a function Ω 7→ R; ηx,± denotes the conﬁguration which coincides with η at every site y ̸= x and equals ηx± 2 at site x; the rates rx,± are zero unless ηx,± ∈ Ω, while if ηx,±∈ Ω

and ηx−1 = ηx+1 = h they satisfy rx,±= 1₂ for h̸= ±1, and rx,∓= _λ+1λ = 1− rx,±, for h =±1.

The process deﬁned above is the heat bath dynamics for the homogeneous polymer pinning

model, with equilibrium measure π = π_2Lλ on Ω deﬁned by

π_2Lλ (η) = λ

N (η)

Z_2Lλ , (1.2)

where N (η) = #{x ∈ {−L + 1, . . . , L − 1} : ηx = 0} denotes the number of zeros in the path

η ∈ Ω and Z_2Lλ = ∑_η′_∈ΩλN (η′). For every λ > 0 and L ∈ N, π = π_2Lλ is the unique reversible invariant measure for the Markov chain.

1_{In the neutral case (absence of an interaction between the polymer and the line) our process is nothing but}

the usual (finite) symmetric simple exclusion model.

2_{Over the years there have been many different formulations of “metastability”; see [18, 20, 4]. We also refer}

to the recent contributions [2, 3] where, as in our case, energy barriers are only logarithmic in the characteristic size of the system. We feel however that our situation does not fit completely in any of the mentioned contexts.

(4)

1.2. Relaxation to equilibrium. The equilibrium properties of the polymer pinning model have been studied in detail, cf. e.g. [10] or [9, Section 2] for an extensive review. In particular, it is well known that, under the measure π_2Lλ , for λ > 1 the path is strongly localized with a non-vanishing density of zeros, while for λ < 1 the path is delocalized with √L height ﬂuctuations

and with the number of zeros stochastically dominated by a geometric random variable with λ-dependent parameter. The dynamical counterpart of this localization/delocalization transition has not been fully understood yet. Some progress in this direction has been reported in [7], where various bounds on the spectral gap and mixing time of the Markov chain were obtained, together with estimates on the decay of time correlations.

We recall that the spectral gap is the smallest nonzero eigenvalue of −L, and one is often interested in the relaxation time Trel= 1/gap which governs decay to equilibrium inL2(π), while

the mixing time Tmix(δ), for δ∈ (0, 1), is the smallest time t such that

max

η ∥Pt(η,·) − π∥ 6 δ , (1.3)

where Pt(η,·) denotes the distribution of the Markov chain at time t with initial state η ∈ Ω, and

∥µ − ν∥ is the usual total variation distance between two probability measures. When δ = 1 2e we

often write simply Tmix instead of Tmix(δ). With these conventions one has Trel 6 Tmix always,

and the inequality is strict in general.

A dynamical phase transition occurs when we move from the localized regime λ > 1 to the delocalized regime λ < 1. It was shown in [7], see Theorem 3.4 and Theorem 3.5 there, that for

λ> 1 one has Trel= O(L2) and Tmix = O(L2log L), while for λ < 1 one has Trel> L

5

2−ε, (1.4)

for all ε > 0, provided L is large enough.

We refer to [7] for results and conjectures concerning the localized regime λ > 1. Here, we consider the delocalized regime, i.e. in the repulsive case λ < 1. The ﬁrst question we address concerns an upper bound on the relaxation time Trel and the mixing time Tmix. It is worth

noting that even a crude polynomial bound is non-trivial. We refer to [15, 7] for polynomial bounds for the model with a horizontal wall at level zero, i.e. when lattice paths are constrained to be non-negative. On the other hand, without the wall constraint, the equilibrium measure π is known to be concentrated, as L → ∞, on configurations in which the density of monomers in the upper (lower) half plane is approximately one (zero). However, a mathematical working definition of the plus or minus phase for the polymer is not so obvious and we have been forced to introduce a mesoscopic parameter ℓ (i.e. L≫ ℓ ≫ 1) and define Ω± by

Ω+ ={η ∈ Ω : ηx> 0 , −L + ℓ < x < L − ℓ} , Ω− =−Ω+, (1.5)

where for any set A of polymer conﬁgurations −A := {η ∈ Ω : −η ∈ A}.

The presence of the two phases associated to Ω±dramatically changes the relaxation scenario, with a bottleneck at the set Ω\ (Ω+∪ Ω−). As explained in [7, Section 6], one may suspect that

Trel∼ L52 is the correct asymptotic behavior in the delocalized regime. Let us brieﬂy recall the

heuristic reasoning behind this prediction.

The time to reach equilibrium can be roughly thought of as the time needed to switch from, say, Ω− to Ω+. A point x such that ηx = 0 and ηx−1 ̸= ηx+1 is called a crossing of the polymer.

Note that any zero (and therefore any crossing) x must belong to the set EL of points in the

segment{−L, . . . , L} which have the same parity as L. Since there are typically very few zeros at equilibrium, one may consider the extreme case where at most one crossing ξ is allowed at all

(5)

L L

L −L −L

−L ξ ξ ξ

Figure 1. From left to right a snapshot sequence of the motion of a single crossing ξ which allows the system to switch from a mostly negative to a mostly positive conﬁguration.

times. In this case, the time evolution of ξ should be essentially described by a suitable birth and death process or random walk on EL; see Figure 1.

From equilibrium considerations, one knows that this random walk should have reversible invariant measure ρ roughly proportional to

ρ(x)∝ L3/2(L + x)−3/2(L− x)−3/2, x∈ EL, (1.6)

and that its relaxation time can be bounded from above and below by constant multiples of

L5/2; see Lemma 2.2 and Lemma 4.6 below for more details. Notice that, although the measure

ρ gives uniformly (in L) positive mass to the two attractors x± =±L, the drift which pushes the random walk away from the saddle x = 0 is proportional to the inverse of the distance from the attractors.

This heuristics is turned into the rigorous bound (1.4) by using a suitable test function in the variational principle that characterizes the spectral gap; see [7, Section 6]. However, it seems very hard to give a rigorous upper bound on Trel of the same order of magnitude. We obtain a

bound that can be oﬀ by at most two powers of L.

Theorem 1.1. For any λ < 1,

lim sup L→∞ log Tmix log L 6 5 2 + 2 .

The main tool for the proof of Theorem 1.1 is the analysis of an eﬀective dynamics for the crossings of the polymer. To describe this, we introduce the variable σ ∈ {−1, +1}OL_{, where}

OL={−L, . . . , L} \ ELdenotes the sites with the same parity of L + 1. If η is a conﬁguration of

the polymer, then ηx ̸= 0 at any x ∈ OL, and we deﬁne σ(η) by σx = sign(ηx). The projection

of π on S = {−1, +1}OL _{is then}

ν(σ) = ∑

η : η∼σ

π(η) , (1.7)

where the sum is over all conﬁgurations η compatible with the signs σ. The ﬁeld ν has non-trivial long range correlations. Consider the heat bath dynamics for the variables σ: sites x∈ OL are

equipped with independent rate 1 Poisson clocks; when site x rings we replace σx by σx′ where

the new sign σ_x′ is distributed according to the conditional probability ν(· | σy, y ̸= x), i.e.

the probability (1.7) conditioned on the value of σy, y̸= x. Denote by T_relS the corresponding

relaxation time. For this process, the exponent 5/2 can be shown to be optimal.

Theorem 1.2. For any λ < 1,

lim L→∞ log T_relS log L = 5 2.

The proof of Theorem 1.1 and Theorem 1.2 combines several diﬀerent tools which play a prominent role in the analysis of convergence to equilibrium of Markov chains: decomposition methods, spectral gap analysis, comparison inequalities, and coupling estimates. An outline of the main steps of the proof is given at the beginning of Section 4.

(6)

1.3. Metastability. Recall the deﬁnition (1.5) of the two sets Ω±, and deﬁne the associated phases as the restricted equilibrium measures π± := π(· | Ω±), so that (cf. Section 2)

π −1

2(π

+_{+ π}−₎ _{= o(1) .} _(1.8)

The notation o(1) refers to asymptotics as L → ∞. In the thermodynamic limit, we expect relaxation to equilibrium within each phase to occur on time scales T_rel± such that Trel ≫ T_rel±,

while on a time scale proportional to Trel one should see the system jump from one phase to the

other according to i.i.d. exponentially distributed times. A strong indication of this metastable behavior comes from the following theorem. Below, we use η(t) to denote the state at time t of the Markov chain with generator (1.1).

Theorem 1.3. There exists a set S+ ⊂ Ω+ such that π(S+) = 1/2 + o(1), and that uniformly

in η ∈ S+ and uniformly in t > 0:

Pη(_τ−_{> t})_{= e}−t/(2Trel) _{+ o(1) ,}

where

τ−= inf{t > 0 : η(t) ∈ S−} , S−=−S+.

Here Pη stands for the law of the process with initial state η ∈ Ω. By symmetry, Theorem 1.3 also implies that uniformly in η∈ S−=−S+ and uniformly in t > 0,

Pη(_τ+_{> t})_{= e}−t/(2Trel) _{+ o(1) ,}

where τ+= inf{t > 0 : η(t) ∈ S+}.

Remark 1.4. From the proof of Theorem 1.3 it will be clear that the set S+ is increasing w.r.t. the natural partial order among polymer configurations defined in Section 2.5, so that in particular the maximal configuration (in the sequel denoted by ∧) is in S+.

If we deﬁne the renormalized process

ωs= 1 ( η(sTrel)∈ Ω+ ) − 1(η(sTrel)∈ Ω− ) ,

we expect that, starting from any conﬁguration in Ω±, {ωs, s > 0} converges to the simple

two-state Markov chain with switching rate (from ±1 to ∓1) equal to 1/2, whose spectral gap equals 1. Such a strong uniform result seems very hard to obtain for our model. The diﬃculty is that, in contrast with familiar metastability results [18], here there is no clear-cut separation of time scales: while (1.4) and Proposition 2.6 below imply Trel ≫ Trel+, the ratio Trel/Trel+ is only

polynomially large in L. However, we do have a detailed description of the renormalized process when the initial condition is the maximal conﬁguration. Namely, deﬁne the maximal element of Ω as ηmax=∧, i.e. ∧x = x + L for x6 0 and ∧x= L− x for x > 0, and let Tmix∧ (ε) denote the

ﬁrst time t such that ∥Pt(∧, ·) − π∥ 6 ε.

Theorem 1.5. For any δ > 0, uniformly in t> L2+δ

Pt(∧, ·) − [ _{1 + e}−t/Trel 2 π +₊1− e−t/Trel 2 π −] _{= o(1),} _(1.9) and uniformly in t> 0 νπ+ t − [ _{1 + e}−t/Trel 2 π +₊1− e−t/Trel 2 π −] _{= o(1)} _(1.10)

(7)

where ν_tπ+ denotes the law at time t of the process started from the initial distribution π+. Moreover, for any ε∈ (0, 1/2) one has

T_mix∧ (ε) = Trellog

( 1 2ε

)

(1 + o(1)). (1.11)

Remark 1.6. Theorem 1.5 shows in particular that, when the dynamics is started from either

π+ or ∧, there is no cut-off phenomenon [13], i.e. the variation distance from equilibrium does not fall abruptly to zero, but rather does so smoothly (on the timescale Trel). That is another

signature of the metastable behavior of our system and it is in contrast with what one expects for the neutral or attractive case λ> 1.

One of the key features of metastability is that, once the system decides to jump from e.g.

S+ to S−, then it does so very quickly on the time scale of the mixing time. We verify that this is indeed the case for most starting conﬁgurations inside S+∪ S−.

Let T denote the random time spent outside S+∪ S− up to the hitting time of S−:

T :=

∫ τ−

0

1_{η(s)∈(S+_∪S−)c_}ds. (1.12) From Theorems 1.3 and 1.5 one easily deduces that, for most initial conditions in S+_{, τ}−_{≫ T :}

Corollary 1.7. There exists a subset ˜S+ of the set S+ of Theorem 1.3 satisfying π( ˜S+) = 1/2 + o(1) such that, uniformly on η∈ ˜S+, T = o(τ−) in probability, i.e. there exists a sequence

δL tending to zero as L→ ∞ such that for every η ∈ ˜S+,

Pη[_{T > δ} Lτ−

]

6 δL. (1.13)

Along the same lines of the proof of the Corollary, one can establish the weak convergence of the renormalized process ωsto the two-state Markov chain, provided that the initial conﬁguration

is inside a suitable subset of ˜S+∪ ˜S− with almost full measure. We decided to omit details for shortness.

1.4. Organization of the paper. The rest of the paper consists of three sections. Section 2 starts with standard material and then proceeds with the introduction of some essential tools to be used in the proof of the main results, including general results for monotone systems that can be of independent interest. This section contains also some new results concerning the relaxation within one phase and the properties of the principal eigenfunction of the generator. The metastability results are discussed in Section 3. Here, we start with the proof of Theorem 1.5. In later subsections we develop the construction needed for the proof of Theorem 1.3. Finally, Section 4 proves Theorem 1.1 and Theorem 1.2. This section is broken into several subsections corresponding to the various steps of the proof. A high level description of the arguments involved is given at the beginning of the section.

Notational conventions. Whenever we write o(Lp) or O(Lp) for some p∈ R it is understood that this refers to the thermodynamic limit L → ∞. Also, we use the notation f(L) = Ω(Lp₎

when there exists a constant c > 0 such that f (L)> c Lp for all suﬃciently large L. For positive functions f, g, we use the notation f (L) ≫ g(L) whenever lim infL→∞f (L)/g(L) = +∞, and

f (L)∼ g(L) when limL→∞f (L)/g(L) = 1. Also, we write f ≍ g if there exists some constant

(8)

2. Some tools

We begin with some generalities about reversible Markov chains. Then, we recall the deﬁnition of the polymer dynamics and derive some consequences of monotonicity. Next, we give some estimates on convergence to equilibrium in the “plus” phase. Finally, we characterize in detail an eigenfunction of L with eigenvalue −gap.

2.1. Preliminaries. We will consider reversible continuous time Markov chains with finite state space X, defined by the infinitesimal generator L acting on functions f : X 7→ R,

[Lf](x) = ∑

y∈X

c(x, y)[f (y)− f(x)] , (2.1) where c(·, ·) is a bounded non-negative function on X × X satisfying π(x)c(x, y) = π(y)c(y, x), for a probability measure π on X. In the applications below, the rates c(x, y) will always be such that the Markov chain is irreducible and the reversible invariant measure π is positive on

X. We refer e.g. to [1, 13] for more details on reversible Markov chains.

Let νx

t =P(vtx ∈ ·) denote the law of the state vtx of the Markov chain at time t with initial

condition x∈ X. We shall investigate the rate of convergence of ν_txto π. If the initial condition x is distributed according to a probability µ on X, we write ν_tµ=∑_x_∈Xµ(x)ν_txfor the distribution at time t. As usual, one can associate a semi-group {Pt, t > 0} to the generator L in such a

way that [Ptf ](x) = [etLf ](x) =

∑

y∈Xνtx(y)f (y). We also use the notation Pt(x, y) = νtx(y),

and ν_tµ= µPt.

The mixing time of the Markov chain is deﬁned by

Tmix(ε) = inf { t > 0 , max x∈X ∥ν x t − π∥ 6 ε } , (2.2) where ∥µ − ν∥ = 1 2 ∑ x |µ(x) − ν(x)|

is the total variation distance. We shall use the convention that Tmix stands for Tmix(_2e1). It is

well known that with this notation one has

∥νx

t − π∥ 6 e−⌊t/Tmix⌋, (2.3)

for all t > 0, where ⌊a⌋ denotes the integer part of a > 0. The spectral gap and the relaxation time of the process are deﬁned by

gap = min f :X7→R E(f, f) Varπ(f ) , Trel= 1 gap, (2.4) where for f : X 7→ R, E(f, f) = ∑ x∈X π(x)f (x)[−Lf](x) = 1 2 ∑ x,y∈X π(x)c(x, y)[f (y)− f(x)]2 (2.5) is the quadratic form of the generator, a.k.a. the Dirichlet form, while Varπ(f ) stands for the

variance π(f2)−π(f)2. Thus, gap is the lowest non-zero eigenvalue of−L. The following bound relating total variation distance and relaxation time is an immediate consequence of reversibility and Schwarz’ inequality:

∥νµ t − π∥ 6 1 2e −t/Trel√_Var π(f ) , (2.6)

(9)

where f (σ) = µ(σ)/π(σ) and µ is a probability on X. Another standard relation between total variation and relaxation time is the identity

gap =− lim

t→∞

1

t log maxx,y ∥ν x t − ν

y

t∥. (2.7)

Combining (2.3), (2.7) and (2.6), one can obtain the following well known relations:

Trel6 Tmix6

(

1− log π_∗)Trel, where π∗= min

x∈Xπ(x) . (2.8)

2.2. A general decomposition bound on the spectral gap. We shall need a continuous time version of a general decomposition bound obtained by Jerrum et al. [11]. Consider the continuous time reversible Markov chain deﬁned by (2.1). Suppose the space X is partitioned in the disjoint union of subspaces X1, . . . , Xm, for some m∈ N and deﬁne the generators

[Lif ](x) =

∑

y∈X

ci(x, y)[f (y)− f(x)] , ci(x, y) = c(x, y)1(y∈ Xi) , x∈ Xi.

Then Li is the generator of the Markov chain restricted to Xi, its reversible invariant measure

being given by πi = π(· | Xi). Let λmin denote the minimum of the spectral gaps of the Markov

chains generated byLi, i = 1, . . . , m. Next, letL denote the inﬁnitesimal generator deﬁned by

[Lφ](i) = m ∑ j=1 ¯ c(i, j)[φ(j)− φ(i)] , for φ∈ Rm, where ¯ c(i, j) = ∑ x∈Xi, y∈Xj π(x| Xi) c(x, y) .

This deﬁnes a continuous time Markov chain on {1, . . . , m} with reversible invariant measure ¯

π(i) = π(Xi). Let ¯λ denote the gap of this chain. A straightforward adaptation of [11, Theorem

1] yields the following estimate.

Proposition 2.1. Define γ = maximaxx∈Xi ∑

y∈X\Xic(x, y) . Then, with the notation of (2.4), gap> min { ¯_λ 3, ¯ λ λmin ¯ λ + 3γ } . (2.9)

2.3. Killed process and quasi-stationary distribution. Here we recall some standard facts about killed processes, their generators and quasi-stationary distributions for reversible Markov chains; we refer to [1] for an introduction. Given a reversible Markov chain with generator L as above and a subset Γ⊂ X, we consider the process killed upon entering Γ, with sub-probability law deﬁned by

ν_tx,Γ(B) =Px(vt∈ B ; τΓ > t) , x∈ Γc, (2.10)

where B⊂ X, vt denotes the state of the Markov chain with generator L at time t, Px denotes

the law of the process started at x, and τΓ denotes the hitting time of the set Γ. The associated

semi-group PΓ

t is given by

[P_tΓf ](x) = [etLΓf ](x) = ∑

y∈Γc

ν_tx,Γ(y)f (y) , x∈ Γc, (2.11) where the killed generator LΓ satisﬁes, for every x∈ Γc:

[LΓf ](x) = [L(f1Γc)](x) = [Lf](x) − ∑

y∈Γ

(10)

We assume that P_tΓis irreducible. ThenLΓ is a negative deﬁnite, self-adjoint operator inL2(π),

and its top eigenvalue −γΓ is characterized by

γΓ= min f :X7→R, f 1Γ=0 ⟨ −LΓ_{f, f}⟩ π π(f2₎ = min_{f :X}_7→R, f 1Γ=0 E(f, f) π(f2₎ , (2.13)

where we use ⟨·, ·⟩_π for the scalar product inL2(π), and E(f, f) is deﬁned by (2.5).

Let gΓ denote the (unique, positive on Γc) eigenfunction ofLΓ associated to −γΓ. Extending gΓ to all x∈ X by setting gΓ(x) = 0 for x∈ Γ, one deﬁnes the quasi-stationary distribution νΓ, i.e. the probability on X given by

νΓ(y) = π(y)gΓ(y)

π(gΓ) , y∈ X . (2.14)

An equivalent characterization of νΓ is as the limit νΓ(B) = lim

t→∞P x_(v

t∈ B | τΓ> t) , (2.15)

where B ⊂ X, and the chosen initial point x ∈ Γc is arbitrary. The fundamental property of the quasi-stationary distribution is that, starting from νΓ, the hitting time τΓ is exponentially

distributed with parameter γΓ:

PνΓ_(τ

Γ> t) = e−γΓt, (2.16)

where PνΓ _{stands for the law of the process when the initial state is distributed according to}

νΓ. Another way of expressing quasi-stationarity is νΓP_tΓ = e−γΓt_νΓ_{, for all t} > 0. A general

property of γΓ (cf. Lemma 3.1 below) is that γΓ> π(Γ) gap.

2.4. Polymer model. Let Ω = Ω2L stand for the space of all lattice paths deﬁned in the

introduction. A partial order in Ω is given by

η6 η′ ⇐⇒ ηx 6 ηx′ , x =−L, . . . , L . (2.17)

Given ζ, ξ ∈ Ω such that ζ 6 ξ we deﬁne the restricted space Ωζ,ξ of all paths η ∈ Ω such that

ζ 6 η 6 ξ. The dynamics is deﬁned by the continuous time Markov chain with state space Ωζ,ξ, with inﬁnitesimal generator Lζ,ξ given by (1.1) where the rates rx,±(η) are replaced by

r_x,ζ,ξ_±(η) = rx,±(η)1(ηx,±∈ Ωζ,ξ) (2.18)

This process is the heat bath dynamics associated to the probability measure πλ,ζ,ξ_2L on Ωζ,ξ deﬁned as in (1.2) with the normalization now given by

Z_2Lλ,ζ,ξ = ∑

η′∈Ωζ,ξ

λN (η′). (2.19)

Equivalently, πλ,ζ,ξ_2L = πλ_2L(· | Ωζ,ξ). This is referred to as the polymer model with top/bottom constraints (ζ is the bottom, ξ is the top). For simplicity, when no confusion arises, we often omit the superscripts λ, ζ, ξ and the subscript L from our notation in what follows. We write

v_tη for the state of the Markov chain at time t when the initial conﬁguration is some η, and let

ν_tη denote its distribution. When the initial condition η is distributed according to a probability measure µ on Ω we write ν_tµ as in Section 2.1.

Note that the generatorLζ,ξcan be written in the form (2.1) by setting c(η, η′) = rζ,ξ_x,_±(η)1(η′ =

ηx,±), and π = πλ,ζ,ξ_2L is reversible. While this holds for every value λ > 0 of the parameter describing the strength of the interaction, we will only consider the case λ < 1 below, which corresponds to a strictly delocalized regime for the polymer.

(11)

The minimal path ∨ and maximal path ∧ for the order (2.17) are deﬁned by ∨x =−x − L

for x 6 0, ∨x = −L + x for x > 0, and ∧ = −∨. Clearly, if ζ = ∨ and ξ = ∧, then Ωζ,ξ = Ω.

This case is referred to as the polymer model with no top/bottom constraint.

The following well known estimates will be often used in our proofs. We refer e.g. to [9, Section 2] for the proof of Lemma 2.2 below, as well as for other known properties of the delocalized equilibrium measure. Let Z2L = Z2Lλ denote the partition function (2.19) with no top/bottom

boundaries and write Z_2L+ = Z_2L+,λ for the partition function (2.19) with ξ = ∧ and ζ given by the minimal non-negative element of Ω, i.e. ζx = 0 if x∈ EL (x has the same parity as L) and

ζx = 1 if x∈ OL (x has opposite parity w.r.t. L); Z2L+ is the partition function of the polymer

with a horizontal wall at height zero. Recall that N = N (η) stands for the number of zeros in the path η lying strictly between −L and L. Considering reﬂections of the path between consecutive zeros one obtains

2Z_2L+,λ= Z_2Lλ/2. (2.20)

Lemma 2.2. Consider the polymer with no top/bottom constraint with λ∈ (0, 1). There exist

constants ci = ci(λ) > 0, i = 1, 2 such that

2−2LZ_2Lλ ∼ c1L−3/2, (2.21)

and

π(N (η) > k)6 c2e−k/c2, ∀ k ∈ N . (2.22)

An immediate implication of (2.20) and (2.21) is that

2−2LZ_2L+,λ∼ c+L−3/2, (2.23)

for some constant c+ > 0 as soon as λ < 2. Moreover, (2.21) and (2.23) imply the bounds π (ηy > 0 ∀y ∈ {−L, . . . , x} , and ηx = 0) ≍ π (ηx = 0)

≍ L3/2_{(L + x)}−3/2_(L_{− x)}−3/2_, _(2.24)

for every x∈ EL.

2.5. Monotonicity. An important property satisﬁed by the Markov chains introduced above is the monotonicity with respect to the partial order (2.17). A convenient way of stating the monotonicity property is that there exists a coupling P of the trajectories of the Markov chains corresponding to distinct initial conditions such that if η6 η′ thenP almost surely vη_t 6 vη_t′ for all t > 0. More generally, one can deﬁne a coupling P of trajectories corresponding to distinct top/bottom constraints and distinct initial conditions such that if ζ 6 ζ′, ξ 6 ξ′, and η 6 η′, thenP almost surely vη;ζ,ξ_t 6 v_tη′;ζ′,ξ′ for all t> 0. Recall that a function f : Ω 7→ R is said to be increasing if f (η)6 f(η′) whenever η 6 η′. An event A is increasing if the indicator function 1A

is increasing. The monotonicity property of the dynamics implies the so-called FKG property of the equilibrium measures π = π_2Lλ,ζ,ξ: for every pair of increasing functions f, g : Ω 7→ R, one has π(f g)> π(f)π(g). We refer to [7, Section 2] for a more detailed discussion of the monotone coupling and the consequences of monotonicity.

Lemma 2.3. Let µ be a probability on Ω and write f (η) = µ(η)/π(η), and ft(η) = νtµ(η)/π(η),

t > 0. If f is increasing then, for every t > 0, ft is increasing. As a consequence, there exists

an increasing event A such that ∥νµ

t − π∥ = ν µ

(12)

Proof. Write ν_tµ(η) = ∑_η₀_∈Ωµ(η0)Pt(η0, η), where Pt(·, ·) stands for the kernel of the Markov

chain. Reversibility then gives

ft(η) = ∑ η0∈Ω f (η0)π(η0)Pt(η0, η)/π(η) = ∑ η0∈Ω f (η0)Pt(η, η0) . (2.26)

Next, let P denote the monotone coupling introduced above and let E denote expectation w.r.t. P. Then, (2.26) coincides with E[f(vη

t)], and if η6 η′, ft(η′)− ft(η) =E[f(vη ′ t )− f(v η t)] =E[f(v η′ t )− f(v η t) ; v η′ t > v η t] .

Thus, ft is increasing whenever f is. Finally, it is well known that the total variation distance

can be written in the form (2.25) where A = {η : ν_tµ(A) > π(A)}. Since A = {ft > 1}, A is

increasing whenever f is.

Lemma 2.4 compares arbitrary initial conditions to the extremal initial conditions. Lemma 2.5 states a useful sub-multiplicativity property satisﬁed by extremal evolutions. For lightness of notation, we state these results only in the case of no top/bottom boundaries, i.e. ζ =∨, ξ = ∧, but the same applies for general ζ, ξ with exactly the same proof.

Lemma 2.4. For any t > 0 and any η, η′ ∈ Ω:

∥νη t − ν η′ t ∥ 6 4L2∥νt∧− νt∨∥ . As a consequence, gap =− lim t→∞ 1 tlog∥ν ∧ t − νt∨∥ .

Proof. Let P denote the monotone coupling as above. Then, ∥νη t − ν η′ t ∥ 6 P(v η t ̸= v η′ t )6 P(v∧t ̸= vt∨) 6 L−1_∑ x=−L+1 L−1_∑ h=−L [P((v_t∧)x > h)− P((vt∨)x> h)] 6 4L2_∥ν∧ t − νt∨∥ .

The second point follows from the ﬁrst one and the classical characterization (2.7) of the spectral

gap.

Lemma 2.5. For any s, t> 0,

∥νt+s∧ − νt+s∨ ∥ 6 ∥νt∧− νt∨∥ ∥νs∧− νs∨∥ .

Proof. With the same argument of Lemma 2.3, for some increasing event A ∥νt+s∧ − νt+s∨ ∥ = νt+s∧ (A)− νt+s∨ (A) .

Let ρ be a coupling beween ν_t∧ and ν_t∨ at ﬁxed time t> 0. Then

ν_t+s∧ (A)− ν_t+s∨ (A) = ∫ (ν_sη(A)− ν_sσ(A))dρ(η, σ) = ∫ (ν_sη(A)− ν_sσ(A))1(σ̸= η)dρ(η, σ) 6 (νs∧(A)− νs∨(A))ρ(σ̸= η) 6 ∥νs∧− νs∨|| ρ(σ ̸= η) .

(13)

2.6. Relaxation in one phase. Here we obtain results concerning the polymer dynamics in the phase π+= π(·|Ω+) with Ω+ defined after (1.5); cf. Proposition 2.6 below. Then, we show that the polymer started at the maximal configuration ∧ relaxes first to the restricted equilibrium

π+ in a time O(L2+δ) for arbitrarily small δ > 0, while it takes much longer to reach the full equilibrium π; cf. Lemma 2.7 and Lemma 2.8 below.

Recall the definition (1.5) of the subspace Ω+ ⊂ Ω, where L ≫ ℓ, and ℓ diverges as L → ∞; see (2.29) below. The corresponding restricted equilibrium is given by π+= π(·|Ω+). Note that this is a particular instance of the polymer equilibrium πλ,ζ,ξ_2L with top/bottom boundaries: the top is ξ = ∧ while the bottom ζ = ζ(Ω+) is the lowest element of Ω+. Similarly, one defines Ω− = −Ω+, i.e. use (1.5) with ηx > 0 replaced by ηx < 0, and the equilibrium π− is defined

accordingly.

Since λ < 1, the equilibrium bounds (2.24) imply that

π(Ω+) = π(Ω−) = 1 2+ O(ℓ

−1/2_{) ;} _(2.27)

see e.g. [7, Section 2]. In particular, if ℓ diverges as L→ ∞, then π−1

2(π

+_{+ π}−₎ _{= o(1) .} _(2.28)

What follows depends only marginally on the precise dependence of ℓ on L, provided that

L≫ ℓ ≫ 1. For the sake of simplicity we shall ﬁx its value as

ℓ(L) =⌊(log L)14⌋ . (2.29)

This choice turns out to be convenient in the proof of Proposition 2.6 below, but we point out that any choice of the form ℓ(L) = O(Lε) for small ε > 0 would be suﬃcient to obtain the same conclusion with a little more work.

We start by establishing a mixing time upper bound for the dynamics constrained to stay in Ω+, i.e. the process evolving with bottom boundary given by ζ = ζ(Ω+). To avoid confusion we shall write µη_t (instead of ν_tη) for the law at time t of this Markov chain with state space Ω+_and

initial condition η. We write L+ for its generator and gap+ for the associated spectral gap.

Proposition 2.6. For every ε > 0, there exists L0 = L0(ε) such that for all L > L0, for all t> 0 and all initial conditions η ∈ Ω+:

∥µη

t − π+∥ 6 4L2exp

(

− t/L2+ε)_. _(2.30)

In particular, gap+> L−2−ε.

Proof. The last statement follows from (2.30) and (2.7). To prove (2.30) we establish that for

every ε > 0, there is a constant L0 = L0(ε) > 0 such that, taking T = L2+ε, we have ∥µ∧T − µ

ζ

T∥ 6 1 − L−ε, (2.31)

for all L > L0(ε), where ζ stands for the minimal element ζ = ζ(Ω+) of Ω+. Once (2.31) is

available, we obtain (2.30) (with a new value of ε) from Lemma 2.4, since Lemma 2.5 (which is also valid for the restricted dynamic) and (2.31) imply

∥µ∧t − µ ζ t∥ 6 ( ∥µ∧T − µ ζ T∥ )_{⌊t/T ⌋} 6 exp(−⌊t/T ⌋L−ε₎)_{6 2 exp}(_{− t/L}2+2ε)_,

for any L large enough.

To prove (2.31), we divide the sites x∈ {−L, . . . , L} in three overlapping regions:

(14)

where ℓ = ℓ(L) is given by (2.29). Let T2 = L2+ε1, T1 = Lε1 with some ε1 > 0 such that T = L2+ε > T′ := 2T2+ T1. We shall prove (2.31) with T replaced by T′ (this implies the

claim since by Lemma 2.4 the left hand side of (2.31) is monotone as a function of T ). Call

µη0,c

T′ the law of the “censored” process obtained as follows. Start from η0 at time 0 and, for

time t ∈ [0, T2] reject all the updates involving x /∈ I2. For time t ∈ (T2, T2 + T1] reject all

updates involving x /∈ I1 ∪ I3, and for time t∈ (T2+ T1, 2T2+ T1 = T′] reject all the updates

involving x /∈ I2. From the Peres-Winkler censoring inequality [19, Theorem 16.5] one has that

µ∧,c_T′ stochastically dominates µ∧T′. Similarly, µ ζ,c

T′ is stochastically dominated by µ ζ

T′. On the

other hand, as in Lemma 2.3, one has

∥µ∧T′ − µ ζ

T′∥ = µ∧T′(A)− µ ζ T′(A) ,

where A⊂ Ω+ is an increasing event. Therefore,

∥µ∧ T′− µ ζ T′∥ 6 µ∧,cT′ (A)− µ ζ,c T′(A)6 ∥µ∧,cT′ − µ ζ,c T′∥ ,

and the lemma follows once we show that

∥µ∧,c_T′ − µζ,c_T′∥ 6 1 − L−ε. (2.32) To prove (2.32) we shall couple the two conﬁgurations η_T∧,c′ , η_Tζ,c′ with law µ∧,c_T′ , µζ,c_T′ respectively. From the analysis of the polymer with the wall [7, Section 4], it is not hard to infer that uniformly in the boundary values at −L + ℓ − 1 and L − ℓ + 1 the system evolving in the region I2 has a mixing time O(L2log L). Therefore, after a time T2 = L2+ε1, up to O(L−p) corrections for a

large constant p > 0, for any event E, µ∧,c_T

2(E) coincides with the equilibrium probability of E

in I2 with boundary conditions ∧ at −L + ℓ − 1 and L − ℓ + 1. The same applies to µζ,c_T₂(E)

provided the equilibrium is taken with boundary conditions ζ at−L+ℓ−1 and L−ℓ+1. Choose the event E that the conﬁguration is minimal (in Ω+) at both−L + ℓ2− 1 and L − ℓ2+ 1 (i.e.

η_−L+ℓ2₋₁ = η_L_−ℓ2₊₁ = 0 if L− ℓ2 is odd, and η_−L+ℓ2₋₁ = η_L_−ℓ2₊₁ = 1 if L− ℓ2 is even). From

known equilibrium estimates [9, Section 2], it is not diﬃcult to show that at equilibrium, with either of the two boundary conditions considered above, the probability of E is bounded below by

c1ℓ−6 for some constant c1> 0 depending only on λ. Therefore, using an independent coupling

in the time-lag [0, T2], we have that the event E occurs for both η_T∧,c₂ , η_Tζ,c₂ with probability at

least c ℓ−12.

Next, conditioned on the event E we see that from time T2 up to time T2 + T1 the two

processes evolve (in the regions I1 and I3 only) with the same boundary conditions (equal to the

minimal conﬁguration at both −L + ℓ2 and L− ℓ2). Since T1 = Lε and the mixing time of the

system in the regions I1 and I3 (which evolve independently) is certainly at most eO(ℓ

2₎

, with probability 1 + O(L−p) (conditionally on event E) we have that the two conﬁgurations coincide in both regions I1, I2 at time T2+ T1. Therefore if we let the system run only in I2 now for an

additional time T2 we have a probability close to 1 to have the two conﬁgurations coinciding

everywhere. It follows that, conditionally on the event E, there is a probability of, say, at least 1/2 of no discrepancy between η_T∧,c′ and ηTζ,c′. Therefore, lettingP denote the coupling described

above, ∥µ∧,c_T′ − µζ,c_T′∥ 6 P(η∧,c_T′ ̸= η_Tζ,c′) 6 P(η∧,cT′ ̸= η ζ,c T′ | E)P(E) + 1 − P(E) 6 1 −1 2P(E) .

(15)

We now go back to the model with no top/bottom boundaries, that is the law ν_tη corresponds to the evolution with ξ =∧, ζ = ∨. The next result is crucially based on estimates obtained in [7, Section 6] for the delocalized regime λ < 1.

Lemma 2.7. Uniformly in t6 L5/2(log L)−9,

ν_t∧(Ω+) = 1 + o(1) .

Proof. Deﬁne the event A ={ ∑L_x=_−Lηx < L3/2(log L)−3 }. Proposition 6.2 in [7] proves that

ν_t∧(A) = o(1) uniformly in t6 L5/2(log L)−9. Since Ω−⊂ A we have

ν_t∧(Ω−) = o(1) , t6 L5/2(log L)−9. (2.33) Next, let us check that

ν_t∧(Ω+| (Ω−)c)> π(Ω+| (Ω−)c) , t> 0 . (2.34) To this end, observe that since (Ω−)c is increasing, using Lemma 2.3 the function

f (σ) = 1(Ω−)c(σ) ν_t∧(σ| (Ω−)c) π(σ| (Ω−)c₎ = 1(Ω−)c(σ) ν_t∧(σ) π(σ) π((Ω−)c) ν_t∧((Ω−)c₎,

is increasing. Since Ω+⊂ (Ω−)c is increasing, with the FKG property for π, this implies (2.34). From (2.33) and (2.34) we obtain

ν_t∧(Ω+)> (1 + o(1))ν_t∧(Ω+| (Ω−)c)> (1 + o(1))π(Ω+| (Ω−)c) = 1 + o(1) ,

where the last bound follows from (2.27).

Lemma 2.8. For any ε > 0, uniformly in t∈ [L2+ε_{, L}5/2_{(log L)}−9_{] :} ∥νt∧− π+∥ = o(1) ,

where π+ is defined by π+= π(· | Ω+).

Proof. Using Lemma 2.7 it is enough to prove

∥νt∧(· | Ω+)− π+∥ = o(1) ,

uniformly in t∈ [L2+ε, L5/2(log L)−9]. Consider the function f : Ω7→ R given by

f (σ) = 1Ω+(σ) ν_t∧(σ| Ω+) π+_(σ) = 1Ω+(σ) ν_t∧(σ) π(σ) π(Ω+) ν_t∧(Ω+₎.

Since Ω+ is increasing, Lemma 2.3 shows that f is increasing. Therefore, the event

A ={σ ∈ Ω+ : ν_t∧(σ| Ω+) > π+(σ)} is increasing. Using monotonicity we have

ν_t∧(A| Ω+) = ν_t∧(A)/ν_t∧(Ω+)6 µ∧_t(A)/ν_t∧(Ω+) ,

where µ∧_t denotes the evolution constrained to stay in Ω+_{; see Proposition 2.6. Therefore,} ∥ν∧

t(· | Ω+)− π+∥ = νt∧(A| Ω+)− π+(A)6

µ∧_t(A)

ν_t∧(Ω+₎ − π +_{(A) .}

The conclusion now follows from Lemma 2.7 and Proposition 2.6. The full power of Lemma 2.8 will be seen in the next sections. One of its consequences is the fact that the mixing time Tmix can be bounded in terms of the relaxation time via

(16)

for some constant c > 0. Indeed, (2.35) follows quite easily from Lemma 2.4, Lemma 2.8 and (2.6); see Lemma 4.1 below for a more subtle application of the same reasoning. Note that, since

Trel≫ L5/2−ε, the bound (2.35) improves considerably the standard estimate (2.8) by replacing the factor− log π_∗ = O(L) with a factor O(log L).

2.7. Characterization of the principal eigenfunction. What follows refers to the model with no top/bottom boundaries. Recall that ∧, (resp. ∨) denotes the maximal (resp. minimal) conﬁguration in Ω. A function g : Ω7→ R is called antisymmetric if g(−η) = −g(η) for all η ∈ Ω. The following result gives a precise characterization of one eigenfunction corresponding to−gap.

Proposition 2.9. There exists an increasing antisymmetric eigenfunction g of L, such that

∥g∥_L2(π)= 1. It satisfies

Lg = −gap g. (2.36)

Moreover, when L tends to infinity

g(∧) = ∥g∥L_∞ = 1 + o(1)

and

∥g − (1Ω+ − 1Ω−)∥_L

1(π)= o(1). (2.37)

Proof. By decomposing 1_∧− 1_∨ on a basis of eigenfunctions of L one sees that

g := lim

t→∞

Pt(1∧− 1∨)

∥Pt(1∧− 1∨)∥_L2(π)

is an eigenfunction with unit L2 norm. It is increasing and antisymmetric as Pt(1∧− 1∨) is

antisymmetric and increasing for all t (Pt preserves monotonicity and symmetries). To prove

(2.36), it suﬃces to show that the projection of 1_∧− 1_∨ on the eigenspace of L associated to

−gap is non-zero. To do so, ﬁrst observe that by reversibility,

1

π(∧)∥Pt(1∧− 1∨)∥L1(π)= 2∥ν

∧

t − νt∨∥.

Then, by the second point of Lemma 2.4 lim t→∞ 1 tlog∥Pt(1∧− 1∨)∥L2(π)= lim_t_→∞ 1 t log∥Pt(1∧− 1∨)∥L1(π) = lim t→∞ 1 t log∥ν ∧ t − νt∨∥ = −gap ,

(where we used equivalence of the norms in ﬁnite dimensional spaces).

We now estimate the L_∞ norm of g. Let ε > 0 be small and ﬁxed, and let t0 be such that L2+ε < t0 < L5/2−ε. The function g is an eigenfunction for Pt0 = exp(t0L), with eigenvalue

e−t0gap_{. Therefore} e−t0gap_g(∧) = E[_g(v∧ t0) ] 6 π+_{(g) + 2g(}_∧)∥ν∧ t0 − π +_∥,

where the last inequality follows from the fact that for any two measures µ, ν and any function

f ,

|µ(f) − ν(f)| 6 2∥f∥L∞∥µ − ν∥ . Hence, by Lemma 2.8 and the fact that gap−1 ≫ t0 (cf. (1.4)):

g(∧) 6 π +_(g) e−t0gap− 2∥ν∧ t0− π +_∥ = π +_{(g)(1 + o(1)).} _(2.38)

(17)

Moreover, by symmetry and Jensen’s inequality 1 = π(g2)> 2 ∑ η∈Ω+ π(η)g(η)2= 2π(Ω+)π+(g2)> 2π(Ω+)π+(g)2, so that π+(g)6 (2π(Ω+))−1/2= 1 + o(1).

Therefore g(∧) 6 1 + o(1) (and it is trivial to notice that ∥g∥_L_∞ > ∥g∥_L₂_(π)= 1).

We turn to the proof of (2.37). First notice that by (2.38) one has π+(g)> (1+o(1))g(∧) > 1+

o(1) so that

π+(g) = 1 + o(1). (2.39)

Next, we prove that the variation of g within Ω+ is small:

Var_π+(g) = o(1). (2.40)

Indeed, Var_π+(g) = π+(g2)− π+(g)2 6 g(∧)2− π+(g)2 and the claim follows from (2.38). Next,

∥g − 1Ω++ 1_Ω−∥_L

1(π)6 ∥1Ω+(g− 1)∥L1(π)+∥1Ω−(g + 1)∥L1(π)+∥g1Ω\(Ω+∪Ω−)∥L1(π).

The ﬁrst two terms of the right-hand side are equal by symmetry. Adding and subtracting

π+(g), and using Schwarz’ inequality,

∥1Ω+(g− 1)∥_L 1(π)6 π(Ω +₎[_|π+_(g)_{− 1| +}√_Var π+(g) ] = o(1) ,

where the conclusion follows from (2.39) and (2.40). The third term ∥g1Ω_\(Ω+_∪Ω−)∥_L₁(π) is

smaller than ∥g∥L_∞π(Ω\ (Ω+∪ Ω−)) = o(1).

3. Metastability

In this section we ﬁrst prove Theorem 1.5, which is mainly a consequence of the technical lemmas of the previous section and then move to the proof of Theorem 1.3 and its corollary. 3.1. Proof of Theorem 1.5. We use the notation T = L2+δ. Equation (1.11) is an easy consequence of (1.9) and (2.28). Indeed, assuming (1.9), for t> T , one has

∥Pt(∧, ·) − π∥ = [_{1 + e}−t/Trel 2 π + ₊ 1− e−t/Trel 2 π −]_{− π} _{+ o(1) =} 1 2e −t/Trel _{+ o(1).} _(3.1)

To prove (1.9), one writes νt∧− [_{1 + e}−t/Trel 2 π +₊1− e−t/Trel 2 π −] 6 ν_T∧Pt−T − π+Pt−T + π+Pt−T − [_{1 + e}−t/Trel 2 π +₊1− e−t/Trel 2 π −] _. _(3.2)

This is just triangular inequality, combined with the observation that ν_T∧Pt−T = νt∧. The ﬁrst

term on the right hand side is smaller than ∥ν_T∧− π+_{∥ (as P}

t−T contracts the norm) which is

itself small, by Lemma 2.8 and the deﬁnition of T . It remains to estimate the second term i.e. to prove (1.10).

To do this, we use the fact that the density of π+ w.r.t. π is very close to g + 1, where g is the eigenfunction described in Proposition 2.9, so that the density of π+Pt must be close to

(18)

Pt(g + 1). Using reversibility, one can express the densities as follows: dπ +_P t dπ = Pt dπ+ dπ . Then we

rewrite the second term in (3.2) as an L1 norm (omitting a harmless factor 1/2)

Pt−T dπ+ dπ − 1 2π(Ω+₎(1Ω++ 1Ω−)− 1 2π(Ω+₎e−t/T rel₍₁ Ω+− 1Ω−) L1(π) 6 Pt−T dπ+ dπ − 1 2π(Ω+₎(1Ω+ + 1Ω−)− 1 2π(Ω+₎e−t/T rel_g L1(π) + 1 2π(Ω+₎∥e −t/Trel₍₁ Ω+ − 1Ω−− g) ∥_L₁(π).

The last term above is small by Proposition 2.9. From (2.27) we know that 2π(Ω+_{) = 1 + o(1).}

One can then estimate the ﬁrst term Pt−Tdπ + dπ − 1 2π(Ω+₎(1Ω++ 1Ω−)− 1 2π(Ω+₎e−t/T rel_g L1(π) 6 ∥ (1Ω++ 1_Ω−)− 1∥_L₁(π)+ Pt (_dπ+ dπ − 1 − g ) L1(π) + o(1) , (3.3) where we used the triangular inequality, the fact that Pt1 = 1, and

Pt−Tg = e−(t−T )/Trelg = (e−t/Trel + o(1))g ,

which follows from T = o(Trel). On the right hand-side of (3.3), the ﬁrst term is small by (2.28)

and the second is bounded by ∥dπ_dπ+ − 1 − g∥_L₁_(π), which is small by Proposition 2.9. 3.2. Proof of Theorem 1.3. Theorem 1.5 gives some intuition on why the result should be true, and it will be used to determine the time of the jump from one state to the other. However, one needs another key ingredient to get the result, namely the description of the quasi-stationary distribution. The reason for this is that starting from the quasi-stationary distribution, a killed process dies exactly at exponential rate; see Section 2.3. Therefore, most of our eﬀort will fo-cus on stochastic comparison with the quasi-stationary distribution. Let us ﬁrst give a brief roadmap to help the reader through the proof of Theorem 1.3.

Step 1. The sets S± of Theorem 1.3 for which we have the desired exponential hitting time description are constructed by successively refining a first attempt. One first defines S0,±as the sets of polymer configurations where the eigenfunction g in Proposition 2.9 is positive (negative) and one verifies that their equilibrium probability is 1₂ + o(1). Then one examines the Dirichlet problem associated to the process killed in S0,− (S0,+) and one proves that the corresponding eigenvalue γ0is of the same order as the spectral gap apart from a crucial unspecified

multiplica-tive factor in [1/2, 1]. Similarly one veriﬁes that the corresponding quasi-stationary measure is very close to the equilibrium measure π conditioned to be in S0,+(S0,−). In this way we get the exponentiality of the hitting time of e.g. S0,− starting from π+ with a rate which is, modulo a multiplicative factor in [1/2, 1], the spectral gap (see Lemma 3.3).

Step 2. Next one appropriately deﬁnes new sets S1,±⊂ S0,±in order to guarantee that this time the corresponding Dirichlet eigenvalue γ1 is equal to (1₂+ o(1))gap, and that the hitting time of S1,∓ starting from equilibrium conditioned to S1,± is exponential (with the correct rate). Again one of the key points is to show that π+ is close to the quasi-stationary distribution associated the process killed on entering S1,−, and that the equilibrium probability of S1,± is still 1₂+ o(1).

(19)

Step 3. Finally, one deﬁnes the sets S2,± ⊂ S1,± in such a way that: a) the hitting time of

S2,∓ starting from any conﬁguration in S2,± (and not just from the conditional equilibrium) is also exponential with the correct rate 1₂gap; b) the equilibrium probability of S2,±is still 1₂+o(1). It is now time to begin the implementation of the above strategy. Let

S0,+:={η ∈ Ω, g(η) > 0} , (3.4) where g is the eigenfunction deﬁned by Proposition 2.9 and set S0,− =−S0,+. From Proposi-tion 2.9,

∥1S0,+− 1Ω+∥_L

1(π)= o(1). (3.5)

In particular, π(S0,±_{) = 1/2 + o(1). Let S}− _{⊂ S}0,− _{be a decreasing event. We consider the}

quasi-stationary distribution ν+ := νS− of the process killed when it hits S−. Let Pt∗ = PS

−

t ,

resp. L∗ = LS−, denote the semi-group, resp. the generator, associated to this process (see Section 2.3), −γS− =−γ be the largest eigenvalue of L∗ and τ− = τS− be the hitting time of

S−. From (2.16):

Pν+

(τ−> t) = e−γt. (3.6) Our ﬁrst step is to prove that if S− has non-negligible measure, then γ is of the same order of the gap. More precisely:

Lemma 3.1. For any S−⊂ S0,−, one has π(S−)6 γ Trel 6 1.

Proof. The bound π(S−) 6 γ Trel is rather standard, but we include its proof for the sake of

completeness. Let f0 = gS− denote the minimizer in the variational principle deﬁning γ = γS−;

see (2.13). Then Varπ(f0) =⟨f0, f0⟩_π− ⟨ f0, 1_(S−₎c ⟩2 π > ⟨f0, f0⟩ππ(S−) ,

where we used the Cauchy-Schwarz inequality for ⟨f0, 1(S−)c ⟩2 π. Therefore γ = E(f0, f0) π(f₀2) > π(S −₎E(f0, f0) Varπ(f0) > π(S −_{) gap .}

As for the bound γ Trel 6 1, γS− being a non-decreasing function of S− (for the inclusion),

it is suﬃcient to prove the result for the maximal case S− = S0,−. Let g be the eigenfunction deﬁned in Proposition 2.9. From (2.12), for all η∈ (S0,−)c

−(L∗g_|

(S0,−)c)(η) =−(Lg)(η) +

∑

η′∈S0,−

c(η, η′)g(η′)6 − (Lg)(η) = gap g(η) ,

where we use the fact that g(η′) < 0 for η′ ∈ S0,−. Plugging this into (2.13), and using

g_| (S0,−)c > 0, one gets γ 6 ⟨ −L∗_g |(S0,−)c, g|(S0,−)c ⟩ π π(g2 |(S0,−)c ) 6 gap . Next, we prove that the quasi-stationary distribution ν+ for the process killed on S− is very close to π+ if S− has probability close to 1/2.

Lemma 3.2. Uniformly for all decreasing events S−⊂ S0,−, ∥ν+_{− π}+_{∥ 6 (2 − 4π(S}−_{)) + o(1) .}

(20)

Proof. We use triangular inequality to get

∥ν+_{− π}+_{∥ 6 ∥ν}+_{− π(· |(S}−₎c₎_{∥ + ∥π}+_{− π(· |(S}−₎c₎_∥. _(3.7)

We start with the ﬁrst term. First, from (2.15) one has the characterization

ν+= lim

t→∞

δ_∧P_t∗ δ_∧P_t∗(Ω).

Since the operator P_t∗preserves monotonicity (S−is decreasing), arguing as in [19, Lemma 16.6], the density d[δ∧Pt∗]

dπ is seen to be an increasing function for every ﬁxed t> 0. Hence, passing to

the limit t→ ∞, dν+/dπ is an increasing function. Therefore, A := { η∈ (S−)c, such that ν +_(η)π((S−₎c₎ π(η) > 1 }

is an increasing event. From standard properties of the total variation distance

∥ν+_{− π(·|(S}−₎c_{∥ = ν}+_(A)_{− π(A |(S}−₎c_).

We shall prove that ν+(A) is smaller than π+(A) + o(1) by the use of monotonicity and a chain of comparisons. Recall the notation T = L2+δ (δ ∈ (0, 1/4)). We ﬁrst compare ν+ to ν+PT:

remark that

ν+PT = ν+PT∗+ ν+(PT − PT∗)

where the two terms of the decomposition are positive measures. From quasi-stationarity one has ν+P_T∗ = e−γTν+ and therefore the total mass of the second term above is 1− e−γT. Hence

∥ν+_P T − ν+∥ = 1 2 d[ν+(PT − PT∗)] dπ − dν+ dπ (1− e −γT₎ L1(π) 6 1 − e−γ T = o(1). (3.8) The last equality comes from Lemma 3.1 and the fact that Trel≫ T . Next, δ∧PT stochastically

dominates ν+PT so that [ν+PT](A)6 [δ∧PT](A). Hence, from Lemma 2.8 and (3.8):

ν+(A)6 ν+PT(A) + o(1)6 δ∧PT(A) + o(1)

6 π+_{(A) +}_∥δ

∧PT − π+∥ + o(1) = π+(A) + o(1).

Therefore, going back to (3.7)

∥ν+_{− π}+_{∥ 6 ν}+_(A)_{− π(A |(S}−₎c_{) +}_∥π+_{− π(· |(S}−₎c₎_∥ _(3.9)

6 π+_(A)_{− π(A |(S}−₎c_{) +}_∥π+_{− π(· |(S}−₎c₎_{∥ + o(1) 6 2∥π}+_{− π(· |(S}−₎c₎_{∥ + o(1) .}

To estimate the right-hand side of (3.9), notice that

∥π+_{− π(· |(S}−₎c₎_{∥ 6 ∥π}+_{− π(· |(S}0,−₎c₎_{∥ + ∥π(· |(S}0,−₎c₎_{− π(· |(S}−₎c₎_{∥ ,} _(3.10)

and the ﬁrst term is o(1) by Proposition 2.9. Moreover, since S−⊂ S0,−

∥π(· |(S0,−₎c₎_{− π(· |(S}−₎c₎_{∥ =} π((S−)c)− π((S0,−)c) π((S−)c₎ = 1/2− π(S −_{) + o(1)} 1− π(S−) 6 1 − 2π(S −_{) + o(1) .} _(3.11)

Combining (3.9), (3.10) and (3.11), the desired result follows. Now one uses the fact that ν+ and π+ are close in total variation distance to estimate the jumping time to S− starting from either∧ or from π+. For the rest of this section, one deﬁnes, in analogy with τ−, the hitting times τi,− (resp. τi,+), (i = 0, 1, 2) of the sets Si,− (resp. Si,+) to be deﬁned.