HAL Id: hal-03319835
https://hal.archives-ouvertes.fr/hal-03319835
Preprint submitted on 13 Aug 2021
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
large deviation prefactors for non-equilibrium dynamics through matrix Riccati equations
Freddy Bouchet, Julien Reygner
To cite this version:
Freddy Bouchet, Julien Reygner. Path integral derivation and numerical computation of large devia- tion prefactors for non-equilibrium dynamics through matrix Riccati equations. 2021. �hal-03319835�
Freddy Bouchet · Julien Reygner
Path integral derivation and numerical
computation of large deviation prefactors for non-equilibrium dynamics through matrix Riccati equations
Abstract For many non-equilibrium dynamics driven by small noise, in physics, chemistry, biology, or economy, rare events do matter. Large deviation theory then explains that the leading order term of the main statistical quantities have an ex- ponential behavior. The exponential rate is often obtained as the infimum of an action, which is minimized along an instanton. In this paper, we consider the computation of the next order sub-exponential prefactors, which are crucial for a large number of applications. Following a path integral approach, we derive the dynamics of the Gaussian fluctuations around the instanton and compute from it the sub-exponential prefactors. As might be expected, the formalism leads to the computation of functional determinants and matrix Riccati equations. By contrast with the cases of equilibrium dynamics with detailed balance or generalized de- tailed balance, we stress the specific non locality of the solutions of the Riccati equation: the prefactors depend on fluctuations all along the instanton and not just at its starting and ending points. We explain how to numerically compute the pref- actors. The case of statistically stationary quantities requires considerations of non trivial initial conditions for the matrix Riccati equation.
Keywords Non-equilibrium statistical physics·Rare events·Large deviation theory·Arrhenius law ·Sub-exponential prefactors·Stochastic differential equations
F. Bouchet
Univ Lyon, Ens de Lyon, Univ Claude Bernard, CNRS, Laboratoire de Physique, Lyon, France E-mail: [email protected]
J. Reygner
CERMICS, Ecole des Ponts, Marne-la-Vall´ee, France E-mail: [email protected]
1 Introduction
1.1 Rare events, instantons and sub-exponential prefactors
Many systems in physics, chemistry, economics or biology can be described by stochastic differential equations with small noise. In such cases many statisti- cal quantities, for instance the invariant distribution, first exit times, mean first passage times or transition probabilities have asymptotic exponential behavior Cεexp(−I/ε), whereI is the exponential rate,ε is the noise amplitude, andCε is a sub-exponential prefactor (see below for a more precise definition).
This mathematical remark has profound consequences in physics. The most classical examples are the definitions of thermodynamic potentials which are rates in this sense, or the Arrhenius law. Besides static properties, from a dynamical perspective, when conditioned on the occurence of a rare event, path probabilities often concentrate close to a predictable path, called instanton. This is a key and fascinating property for the dynamics of rare events and of their impact, which was first observed in statistical physics, for the nucleation of a classical supersatu- rated vapor [27]. Soon after, a similar concentration of path probabilities has been studied in gauge field theories [10, 40], for instance for the Yang–Mills theory. In- stantons continue to have number of applications in modern statistical physics, for instance to describe excitation chains at the glass transition [26], reaction paths in chemistry [24], escape of Brownian particles in soft matter [39].
The computation of the rate I is the subject of classical techniques using Laplace asymptotics, for instance in classical or path integrals [10, 40, 22]. At the mathematical level, this is the subject of large deviation theory, see for instance the Freidlin–Wentzell theory for small noise large deviations [17]. However, for most genuine applications, computing the rateIis not sufficient and a proper estimation of the sub-exponential prefactorCε, or of its asymptotic behavior in the limit of small noiseε↓0, is required. From a field theory perspective, such computations require the estimation of the path integrals at next to leading order. Such com- putations are very classical in the field theory context and involve the estimation of expectations over Gaussian processes, for which solutions of Riccati equations are needed. Several classical tools and approaches have been devised in quantum field theory and for equilibrium problems, often on a case by case basis (see for instance [8, 40] as examples among many others).
Recently, rare events, instantons, transitions rates, have been studied in far from equilibrium systems and non-equilibrium steady states, were one starts from dynamics without detailed balance. The statistical mechanics approaches have then be extended to scientific fields so far unexpected. For instance rare events, instantons, and related concepts have been used in turbulence [18, 28, 19, 6, 11], atmosphere dynamics [6, 36], climate dynamics [33], astronomy [38, 1], among many other examples. Moreover, a large effort has been pursued to develop dedi- cated numerical approches to compute the related instantons [21].
For all of these cases, it is essential to go beyond the computation of the expo- nential rates and to compute the sub-exponential prefactors. Then, although for- mally classical ideas still apply, several simplifications related to equilibrium dy- namics no more occur. Often, the extent of possible analytic simplifications is limited, and one has to rely more on numerical simulations for actual computa-
tions. The aim of this paper is to develop the theoretical analysis at a level were it will be useful for devising numerical algorithms. We will thus consider stochastic differential equations with small noise, and develop the formalism and show the potential for numerical computations. The numerical computation will be illus- trated on a simple example.
At a technical level, we will derive the needed matrix Riccati equations, dis- cuss their initial conditions which are not trivial, for instance in the case of sta- tistically stationary quantities, and explain the relation between the matrix Riccati equations and the quantities of interest.
1.2 Setting
In this article, we consider systems described by the stochastic differential equa- tion
dXsε=b(Xsε)ds+√
2εdWs, s≥0, (1)
inRd, whereb:Rd→Rdis a smooth vector field,Wis ad-dimensional Brownian motion, andε>0 can be interpreted as a temperature parameter. In general, the diffusion processXεdefined by (1) is not reversible, so that it serves as a model for physical systems driven out of equilibrium. In particular, when the process pos- sesses a stationary distributionPε — called thenon-equilibrium steady state, the latter may not be explicit. Still, the Freidlin–Wentzell theory [17] asserts that in theε↓0 limit and under suitable assumptions,Pε satisfies a large deviation prin- ciple, with a rate functionV called thequasipotential. We shall formally denote by
limε↓0−εlogPε(x) =V(x) (2) this large deviation principle, and we refer to [12] for standard material on large deviation theory. The quasipotential is known to play the role of an entropy func- tion for non-equilibrium models [17, 22, 3].
Let us define the large deviationprefactor Cε(x)to the non-equilibrium steady statePεby the identity
Pε(x) =Cε(x)exp
−V(x) ε
. (3)
An equivalent formulation of (2) is the assertion that limε↓0εlogCε(x) =0, so that no precise information on the prefactor can be obtained merely from large deviation theory. However, combining notions from this theory with a WKB ap- proximation, we derived an asymptotic equivalent of the prefactor in [5], recalled in Equation (17) below.
The purpose of the present article is to continue our study of the prefactor, by proposing an alternative method to obtain (17), based on the path integral for- mulation, and presenting a numerical method to effectively compute the terms appearing in this formula. Both tasks rely on the study of matrix-valued Riccati equations satisfied by quantities related to the fluctuations of the diffusion process Xεaround given deterministic paths.
1.3 Organization of the paper
In Section 2, we state the asymptotic equivalent of the prefactor obtained in [5] and recall the notions of large deviation theory involved in this formula. In Section 3, we describe an alternative method leading to the same formula, which is based on the path integral formulation of the non-equilibrium steady state and establishes a connection between the prefactor and the fluctuations of the processXε around its most probable path. The relation with sharp asymptotics for mean exit times is discussed in Section 4. Section 5 is dedicated to the numerical computation of the asymptotic equivalent of the prefactor, which we apply on an illustrative example in Section 6.
1.4 Notations
The Euclidian norm and inner product ofRdare respectively denoted byk · kand h·,·i. We writeIdfor the identity matrix of sized×d. Given a smooth functionf : Rd→R(typically,Vorbk,ℓk), we denote by∇f and∇2fthe gradient and Hessian matrix of f. Similarly, given a smooth vector fieldF:Rd→Rd(typicallyborℓ),
∇F(x)refers to the matrix with coefficient∂jFi(x)at thei-th row and j-th column, and∇2F(x)(y,y)refers to the vector whosei-th coordinate ishy,∇2Fi(x)yi. The symbols∆and div refer to the usual Laplacian and divergence operators of scalar functions. The closure of an open setD⊂Rdis denoted by ¯D.
2 Prefactor for the stationary distribution
In this section, we first precise the assumptions under which we shall work. We then provide a very brief summary of the main notions from the Freidlin–Wentzell theory on which we shall rely. Finally, we provide an asymptotic equivalent for the prefactorCε(x), obtained in [5] through a WKB approximation.
2.1 Assumptions on the vector fieldb
Throughout this section, we make the following assumptions:
(A1) the deterministic system ˙x=b(x)possesses a unique equilibrium point ¯x∈Rd, which attracts all the trajectories,
(A2) for allε>0, the SDE (1) possesses a unique stationary measurePε.
Remark 1 Assumption (A1) allows to streamline the exposition of our results. In the more general case where the deterministic system ˙x=b(x)possesses several (isolated) equilibrium points, our arguments can be adapted and produce state- ments that are valid in the neighborhood of the equilibrium points.
2.2 Action functional and quasipotential
We recall a few notions from the Freidlin–Wentzell theory [17, Chapter 4] that will be useful in the paper.
Theaction functionalfor the stochastic differential equation (1) is defined for a trajectoryφ= (φs)t1≤s≤t2on the time interval[t1,t2]by
At
1,t2[φ] = (Rt
s=t21L(φs,φ˙s)ds ifφ is absolutely continuous,
+∞ otherwise, (4)
where the LagrangianL writes
∀(x,v)∈Rd×Rd, L(x,v) =1
4kv−b(x)k2. (5) It describes the large deviations of the trajectory(Xsε)s∈[t1,t2]whenε↓0 [17, The- orem 1.1, p. 86].
Thequasipotentialwith respect to ¯xis defined by the variational formula V(x) =inf{At
1,t2[φ],φt1=x,¯ φt2=x,t1<t2}, (6) where the infimum runs over all finite time intervals[t1,t2]and trajectoriesφ= (φs)t1≤s≤t2. The quasipotential is nonnegative. We shall furthermore assume that it solves the Hamilton–Jacobi equation
∀x∈Rd, h∇V(x),∇V(x)i+hb(x),∇V(x)i=0, (7) see the discussion in [17, pp. 100–101]. Equivalently, the vector fieldℓdefined by
b(x) =−∇V(x) +ℓ(x) (8)
satisfies
∀x∈Rd, h∇V(x), ℓ(x)i=0. (9) Then, the quasipotential satisfies the identity
V(x) = Z 0
s=−∞
L(ϕsx,ϕ˙sx)ds=A−∞,0[ϕx], (10) where ϕx = (ϕsx)s≤0 is the minimum action path joining ¯x tox, defined as the unique solution to the backward Cauchy problem
(ϕ˙sx=∇V(ϕsx) +ℓ(ϕsx), s≤0, ϕ0x=x, lim
s→−∞ϕsx=x.¯ (11)
In the sequel of the paper, we shall callϕxthefluctuation path, as it describes the most probable path followed by a fluctuation of the diffusion processXεjoining ¯x tox.
Remark 2 Assuming that the quasipotentialV is smooth enough and differentiat- ing the equalityh∇V, ℓi=0 twice, we get the identity
d k=1
∑
(∂k∇2V)ℓk+∇2V∇ℓ+∇ℓ⊤∇2V+
d k=1
∑
∂kV∇2ℓk=0, (12) which shall be useful in the course of the paper.
2.3 WKB derivation of the formula for the prefactor
Injecting both the decomposition (8) of the vector fieldband the ansatz (3) in the stationary Fokker–Planck equation
ε∆Pε−div(bPε) =0 (13)
shows thatPεis equal to the Gibbs measure PGibbsε (x) = 1
Zεexp
−V(x) ε
, Zε=
Z
x∈Rdexp
−V(x) ε
dx, (14) if and only if the condition
∀x∈Rd, divℓ(x) =0 (15)
holds. In this case, the prefactorCεdoes not depend onxand the Laplace approx- imation forZε provides the asymptotic equivalence
Cεε≃
↓0
sdet∇2V(x)¯
(2πε)d (16)
as soon as∇2V(x)¯ is positive-definite.
If the condition (15) does not hold, an expansion in powers ofε can be per- formed in (13), following the usual WKB approximation method. This leads to the prefactor equivalent
Cε(x)ε≃
↓0
s
det∇2V(x)¯ (2πε)d exp
− Z0
s=−∞divℓ(ϕsx)ds
, (17)
which was derived in [5, Section 3], see also previous results in [9, 31, 32, 35]. In accordance with the discussion above, the exponential term somehow describes the accumulation of ‘non-Gibbsianness’ of the stationary measure along the fluc- tuation pathϕx. We shall call this term thenon-Gibbsianness correction.
3 Derivation of the formula from the path integral formulation
In this section, we still work under Assumptions (A1) and (A2), and describe an alternative method to the WKB approximation in order to derive the formula (17) for the prefactor to the stationary distribution Pε. The method is based on the path integral formulation ofPε, in which a Laplace approximation is performed, leading to the computation of an infinite-dimensional Gaussian integral. The lat- ter involves the fluctuations of the processXε around the pathϕx, and thanks to the Feynman–Kac formula, it is computed by solving a backward matrix Riccati equation.
Although it is certainly less straightforward than the direct WKB approxima- tion sketched in the previous section, the method presented here has the conceptual advantage to establish a connection between the prefactor, written under the form of a functional determinant, and the process of fluctuations ofXεaround the path ϕx. The latter process was recently studied in the mathematical literature [30, 34].
3.1 Laplace approximation in the path integral formulation
When X0ε =x0∈Rd, the probability forXtε to be close to x∈Rd writes in the formal path integral formulation
Px0[Xtε≃x] = Z φt=x
φ0=x0
D[φ]exp
−A0,t[φ]
ε
, (18)
where we used Ito’s convention to approximate the path integral. As a conse- quence, the stationary distributionPε writes
Pε(x) = lim
t→+∞
Z φt=x φ0=x0
D[φ]exp
−A0,t[φ] ε
= lim
t→−∞ Z φ0=x
φt=x0
D[φ]exp
−At,0[φ] ε
,
(19)
where at the second line we considered trajectories defined on[t,0],t<0 rather than on[0,t],t>0. Givent<0 andx∈Rd, we therefore introduce the notation
pεt(x) = Z φ0=x
φt=x0
D[φ]exp
−At,0[φ] ε
. (20)
Notice that thet→ −∞limit of this quantity should not depend onx0, hence from now on we fixx0=x.¯
The second-order expansion of the action functional on[t,0]around the fluc- tuation path defined by (11) writes
At,0[φ]≃At,0[ϕx] +δAt,0
δ φ [ϕx](φ−ϕx) +1 2
δ2At,0
δ φ2 [ϕx](φ−ϕx,φ−ϕx). (21) In thet→ −∞limit, the first-order term vanishes because of the optimality con- dition onϕx. The second-order term rewrites
1 2
δ2At,0
δ φ2 [ϕx](δ φ,δ φ)
=1 4
Z0
s=t kδφ˙s−∇b(ϕsx)δ φsk2− hϕ˙sx−b(ϕsx),∇2b(ϕsx)(δ φs,δ φs)i ds
=1 4
Z0
s=t kδφ˙s+Qxsδ φsk2+2hδ φs,Rxsδ φsi ds,
(22)
where the matricesQxsandRxsare defined, fors≤0, by Qxs=−∇b(ϕsx), Rxs=−
d k=1
∑
∂kV(ϕsx)∇2bk(ϕsx). (23) At this stage, let us anticipate on the numerical discussion of Section 5, and point out that rewriting
∇V(ϕsx) = 1
2(ϕ˙sx−b(ϕsx)), (24)
thanks to (11), shows that the matricesQxsandRxscan be computed from the mere knowledge of the vector fieldbwith its space derivatives, and the fluctuation path ϕx together with its time derivative. In particular, it is not necessary to compute neither the quasipotential nor its space derivatives along the fluctuation path.
As a consequence of the second-order expansion of the action functional, the Laplace approximation yields, forε>0 small but fixed,
pεt(x)t→−≃∞exp
−At,0[ϕx] ε
× Z δ φ0=0
δ φt=x¯−ϕtx
D[δ φ]exp
− 1 4ε
Z 0
s=t kδφ˙s+Qxsδ φsk2+2hδ φs,Rxsδ φsi ds
. (25) By (10), in the t → −∞ limit, At,0[ϕx] converges to the quasipotentialV(x), while (11) shows that ¯x−ϕtxvanishes. Therefore the prefactorCε(x)defined by (3) is equivalent, whenε↓0, to thet→ −∞limit of the path integral
Z δ φ0=0 δ φt=0
D[δ φ]exp
− 1 4ε
Z 0
s=t kδφ˙s+Qxsδ φsk2+2hδ φs,Rxsδ φsi ds
= 1 εd/2
Zδ φ0=0 δ φt=0
D[δ φ]exp
−1 4
Z 0
s=t kδφ˙s+Qxsδ φsk2+2hδ φs,Rxsδ φsi ds
(26) after rescalingδ φ by the factor√ε. Since the term appearing in the time inte- gral above is quadratic inδ φ, the path integral is an infinite-dimensional Gaussian integral, the computation of which usually reduces to the computation of a func- tional determinant. In the next paragraph, we adopt a different strategy to compute its value, based on the use of the Feynman–Kac formula.
3.2 Feynman–Kac formula and backward matrix Riccati equation Let us define, for allt≤0 andx,y∈Rd,
u(x;t,y) = Z δ φ0=0
δ φt=y
D[δ φ]exp
−1 4
Z 0
s=t kδφ˙s+Qxsδ φsk2+2hδ φs,Rxsδ φsi ds
, (27) so that
Cε(x)≃
ε↓0
1 εd/2 lim
t→−∞u(x;t,0). (28)
The path integral in (27) writes as the expectation u(x;t,y) =Et,y
exp
−1 2
Z 0
s=thYsx,RxsYsxids
δ0(Y0ε)
, (29)
where(Ysx)s≤0is the diffusion process defined by dYsx=−QxsYsxds+√
2dWs, s≤0, (30)
andEt,y[·]refers to the expectation under whichYtx=ywhileδ0denotes the Dirac distribution in 0. The linear process(Ysx)s≤0describes the scaled Gaussian fluctu- ations of the original process around(ϕsx)s≤0.
By the Feynman–Kac formula, the functionu(x;t,y)is the solution to the back- ward parabolic problem
−∂tu(x;t,y) =∆yu(x;t,y)− hQxty,∇yu(x;t,y)i −1
2hy,Rxtyiu(x;t,y), t<0, u(x; 0,y) =δ0(y).
(31) Since the diffusion process (Ysx)s≤0 describes Gaussian fluctuations around the fluctuation path, we shall look for a solution to (31) with the Gaussian ansatz
u(x;t,y) = 1
p(4π)dηtxexp
−hy,Ktxyi 4
, (32)
where, for allt <0, ηtx>0 and the matrixKtx is symmetric. Furthermore, the conditionu(x; 0,y) =δ0(y)implies the limit behavior
limt↑0(Ktx)−1=0, lim
t↑0ηtx=0, lim
t↑0ηtxdetKtx=1. (33) Injecting the ansatz (32) into the problem (31), we get the system of ordinary differential equations
∀t<0,
(η˙tx=−ηtxtrKtx,
K˙tx= (Ktx)2+Qtx⊤Ktx+KtxQtx−2Rtx. (34) The second equation is a backward matrix Riccati equation, it is solved in Ap- pendix A.2. We obtain the explicit result
∀t<0, Ktx=−2∇2V(ϕtx) + (Ztx)−1, (35) with
Ztx= Z0
s=t
exp
Zt
r=sAxrdr exp Zt
r=sAxrdr ⊤
ds, Axr=∇2V(ϕrx) +∇ℓ(ϕrx),
(36) and⌊exp(·)⌋denotes the time ordered exponential of matrices, the definition and a few properties of which are recalled in Appendix A.1. From (35) we deduce in Appendix A.3 that
t→−lim∞ηtx= 1
2ddet(∇2V(x))¯ exp
2 Z 0
s=−∞divℓ(ϕsx)ds
. (37)
Combining this result with the identities (28) and (32) allows to recover the for- mula (17). As a conclusive statement, we write the asymptotic equivalence for the non-equilibrium steady state
Pε(x)ε≃
↓0
s
det∇2V(x)¯ (2πε)d exp
−V(x) ε −
Z 0
s=−∞divℓ(ϕsx)ds
. (38)
4 Application to mean exit times
LetD⊂Rd be a domain containing an equilibrium point ¯xof the deterministic system ˙x=b(x). Let us define theexit timefromDby
τDε =inf{s≥0 :Xsε6∈D}. (39) Under suitable assumptions onD, the Freidlin–Wentzell theory asserts that
limε↓0εlogEx¯[τDε] = inf
y∈∂DV(y), (40)
where the subscript ¯xindicates thatX0ε=x, and¯ V still refers to the quasipotential with respect to ¯xdefined by (6).
When the deterministic system ˙x=b(x)possesses several stable equilibrium points and D denotes the basin of attraction of one of these points ¯x, then the diffusion process(Xsε)s≥0is calledmetastable, and the logarithmic equivalent (40) is called theArrhenius law.
In any case, the associatedprefactor LεDto the mean exit time fromDis defined by
Ex¯[τDε] =LεDexp 1
ε inf
y∈∂DV(y)
. (41)
In this section, we express this prefactor in terms of the functionCε(x)computed in (17).
4.1 Domain with a noncharacteristic boundary
In this subsection, we assume that Dis an open, smooth and connected subset ofRd satisfying the following conditions, where we denote byn(y)the exterior normal vector aty∈∂D.
(B1) The deterministic system ˙x=b(x)possesses a unique equilibrium point ¯xin D, which attracts all the trajectories started fromD, andhb(y),n(y)i<0 for ally∈∂D.
(B2) The functionV isC1inD, andh∇V(y),n(y)i>0 for ally∈∂D.
Under these assumptions, we proceed as in Subsection 2.2 and define the vector fieldℓ: ¯D→Rdby the identityb=−∇V+ℓ. Thenℓstill satisfies the orthogonal- ity relation (9).
(B3) The minimum ofV over∂Dis reached at a single pointy∗, at which
µ∗=h∇V(y∗) +ℓ(y∗),n(y∗)i>0, (42) and the quadratic formh∗:ξ7→ hξ,∇2V(y∗)ξihas positive eigenvalues on the hyperplanen(y∗)⊥={ξ∈Rd:hξ,n(y∗)i=0}.
By Assumption (B1), [17, Theorem 4.1, p. 106] applies and yields (40). In [5, Eq. (4.25)], the integral formula
λDε = Z
y∈∂Dh∇V(y) +ℓ(y),n(y)iCε(y)exp
−V(y) ε
dy (43)
was derived for theexit rateλDε=Ex¯[τDε]−1. Using the second-order expansion of V in the neighborhood ofy∗in this formula, we obtain the equivalent
LεDε≃
↓0
1 Cε(y∗)µ∗
s deth∗ (2πε)d−1
ε≃↓0
1 µ∗
s2πεdeth∗ det∇2V(x)¯ exp
Z0
s=−∞divℓ(ϕsy∗)ds
(44)
for the prefactor to the mean exit time fromD.
4.2 The Eyring–Kramers formula for metastable states
In this subsection, we assume that the deterministic system ˙x=b(x)possesses two stable equilibrium points ¯x1and ¯x2, whose basins of attractions are separated by a smooth hypersurfaceS.
We call D the basin of attraction of ¯x1 and formulate the following set of assumptions.
(C1) All the trajectories of the deterministic system ˙x=b(x)started onSremain inSand converge to a single equilibrium point x∗∈S; besides, the matrix
∇b(x∗)possessesd−1 eigenvalues with negative real part and a single positive eigenvalueλ∗.
(C2) Denoting byV the quasipotential with respect to ¯x1still defined by (6), there exists a unique (up to time shift) trajectoryρ = (ρt)t∈R⊂Dsuch that
t→−lim∞ρt=x¯1, lim
t→+∞ρt=x∗, and V(x∗) =A−∞,+∞[ρ]. (45) (C3) V is smooth in the neighborhood of(ρt)t∈R, and the vector fieldℓdefined by
the identityb=−∇V+ℓsatisfies the orthogonality relation (9).
In this context,V reaches its minimum onSat the pointx∗, and we refer to [2] for a proof of the Arrhenius law (40) based on the Freidlin–Wentzell theory. Further- more, the pathρ is called theinstantonand it satisfies
∀t∈R, ρ˙t=∇V(ρt) +ℓ(ρt). (46) As a consequence, for anyt∈R, the fluctuation path(ϕsx)s≤0joining ¯xtox=ρt coincides with the instanton, in the sense that
∀s≤0, ϕsx=ρs+t. (47) In order to describe the prefactorLεDin this case, we formulate the following supplementary assumption.
(C4) The matrixH∗=limt→+∞∇2V(ρt)exists and hasd−1 positive eigenvalues and 1 negative eigenvalue.
Under Assumptions (C1–4), a formula was obtained in [5, Eq. (1.10)] to de- scribe the sharp asymptotics of the expected time taken by the process to reach the neighborhood of ¯x2, which is the contents of the so-calledEyring–Kramers for- mulain the context of reversible diffusion processes [7]. Large deviation theory shows that with overwhelming probability, the path taken by the process to reach
¯
x2passes close tox∗. At this point, sinceb(x∗) =0, the process has a probability close to 1/2 to turn back intoDand a probability close to 1/2 to reach the neigh- bourood of ¯x2. Therefore, the expected time taken by the process to exitDis half the time described by the Eyring–Kramers formula. As a consequence, dividing the right-hand side of [5, Eq. (1.10)] by 2, we get the estimate
LεDε≃
↓0
π λ∗
s |detH∗| det∇2V(x¯1)exp
Z +∞
t=−∞divℓ(ρt)dt
(48) for the prefactor to the mean exit time fromD.
Remark 3 Notice that in the case addressed in Subsection 4.1,LεDis proportional to√ε, while in the present case,LεDdoes not depend onε.
4.3 Comments on the non-Gibbsianness correction
In the formula obtained forLεD in the cases of both Subsections 4.1 and 4.2, the non-Gibbsianness correctiondiscussed at the end of Section 2 appears. In partic- ular for the Eyring–Kramers formula, the identity (47) allows to relate the integral term of (48) to fluctuation paths by the remark that
Z +∞
t=−∞divℓ(ρt)dt= lim
t→+∞
Z 0
s=−∞divℓ(ϕsρt)ds. (49)
As a consequence, in the sequel of this paper, we focus on the numerical evaluation of this term.
Recent rigorous derivations of the Eyring–Kramers formula for nonreversible diffusion processes have been obtained in [25, 29], but to our knowledge, they are restricted to the case of processes whose invariant distribution is given by the Gibbs measure (14) and therefore do not include the non-Gibbsianness correction.
5 Effective computation of the prefactor
In Section 4, we showed that sharp asymptotics for mean exit times essentially follow from an accurate computation of the prefactorCε to the stationary distri- bution, which was the object of Sections 2 and 3. In this section, we therefore come back to the framework of Assumptions (A1–2) and focus on the numerical evaluation of this prefactor, whose equivalent is given by (17).
For a givenx∈Rd, the evaluation of the right-hand side of (17) requires the computation of the following quantities:
– the fluctuation path(ϕsx)s≤0,
– the divergence ofℓalong the fluctuation path, – the determinant of∇2V(x).¯
The main difficulty to access these quantities is that in general, the transverse decomposition (8) of the vector fieldbis not explicit, and therefore neither∇V norℓare straightforward to obtain.
As a first step, one can remark that thanks to (8),
∀s≤0, divℓ(ϕsx) =divb(ϕsx) +∆V(ϕsx), (50) so that computing the Hessian matrix∇2V(ϕsx) of the quasipotential along the fluctuation path is sufficient to obtain divℓ(ϕsx)and det∇2V(x).¯
Motivated by the large deviation principle (2), specific methods have been de- veloped in the computational physics community to evaluate the quasipotentialV. Thegeometric Minimum Action Method (gMAM) [37, 23] is a numerical proce- dure which computes the fluctuation pathϕxand returns the value ofV(x)given by (10). Once we are supplied withϕx, we could virtually iterate the method de- scribed above in order to compute the values of V in the neighborhood of the fluctuation path, and hence approximate∇2V(ϕsx)for selected pointsson a time grid. However, computing a fluctuation path for each evaluation ofV is too costly and we shall look for an algorithm that avoids doing so. Our method relies on the fact that∇2V(ϕtx)satisfies a forward matrix Riccati equation, which was already observed in [31, 32].
Proposition 1 Let x∈Rd. If V is smooth, then the family of matrices(Htx)t≤0
defined by
Htx=∇2V(ϕtx) (51) satisfies theforwardmatrix Riccati equation
H˙tx=−2(Htx)2+Qxt⊤Htx+HtxQxt+Rxt, (52) complemented with the limit condition
t→−lim∞Htx=∇2V(x).¯ (53)
Proof By (11), the time derivative ofHtxwrites H˙tx=
d k=1
∑
(∂kV(ϕtx) +ℓk(ϕtx))∂k∇2V(ϕtx). (54) On the one hand, using (8) and (23) yields
d k=1
∑
∂kV(ϕtx)∂k∇2V(ϕtx) =
d k=1
∑
∂kV(ϕtx)∇2(−bk(ϕtx) +ℓk(ϕtx))
=Rtx+
d k=1
∑
∂kV(ϕtx)∇2ℓk(ϕtx),
(55)
while on the other hand, the identity (12) yields
d k=1
∑
ℓk(ϕtx)∂k∇2V(ϕtx)
=−∇2V(ϕtx)∇ℓ(ϕtx)−∇ℓ(ϕtx)⊤∇2V(ϕtx)−
d k=1
∑
∂kV(ϕtx)∇2ℓk(ϕtx),
(56)
so that
H˙tx=−∇2V(ϕtx)∇ℓ(ϕtx)−∇ℓ(ϕtx)⊤∇2V(ϕtx) +Rxt. (57) We now compute
Qtx⊤Htx+HtxQxt =2(Htx)2−∇2V(ϕtx)∇ℓ(ϕtx)−∇ℓ(ϕtx)⊤∇2V(ϕtx), (58) which yields (52) and completes the proof.
Remark 4 The backward matrix Riccati equation (34) and the forward Riccati equation (52) are related by the fact that ifHtis a solution to (52), thenKt=−2Ht
is a solution to (34). This remark is employed in Appendix A.2 to solve (34) by quadrature.
As was already noted in Section 3.1, the coefficientsQtx and Rxt of the for- ward matrix Riccati equation (52) can be computed from the mere knowledge ofbandϕx. As a consequence,∇2V(ϕtx)can be obtained by numerical integra- tion of (52). Therefore we are left with two tasks: computing the limit condition
∇2V(x)¯ for (52), and integrating this equation on(−∞,0]. These tasks are dis- cussed in the respective Sections 5.1 and 5.2. The whole method is then summa- rized in Section 5.3.
5.1 Determination of the limit condition Thet→ −∞limit condition forHtxis given by
H¯ :=∇2V(x),¯ (59) which we assume to be positive-definite. This matrix satisfies the stationary ver- sion of (52), which writes
2 ¯H2=Q¯⊤H¯+H¯Q,¯ (60) with ¯Q:=−∇b(x). In the context of optimal control, this equation is referred to¯ as acontinuous time algebraic Riccati equation (CARE). Alternatively, since ¯xis a stable equilibrium point of the dynamical system ˙x=b(x), the eigenvalues of Q¯have nonnegative real parts. Let us assume that these eigenvalues have positive real parts. Then ¯H−1solves the continuous Lyapunov equation
2Id=H¯−1Q¯⊤+Q¯H¯−1, (61) so that
H¯−1=2 Z+∞
t=0 exp(−tQ)¯ exp(−tQ¯⊤)dt. (62)
5.2 Reparametrization and integration of the system
The system (52-53) is defined on the time interval(−∞,0], with a limit condition int=−∞. To facilitate its numerical integration, we first introduce a reparametriza- tion of time by the length of the fluctuation path, in the spirit of gMAM [23, 37].
The length of the fluctuation path is defined by Lx:=
Z 0
s=−∞kϕ˙sxkds= Z 0
s=−∞kb(ϕsx)kds, (63)
where the second identity follows from the orthogonality relation (9). For allt≤0, we denote
σt= Zt
s=−∞kϕ˙sxkds= Z t
s=−∞kb(ϕsx)kds∈(0,Lx]. (64) The reparametrized fluctuation path is the trajectory(ϕ˜σx)σ∈[0,Lx] defined by the identity
∀t≤0, ϕtx=ϕ˜σxt, (65) and the continuous extension ˜ϕ0x=x. It is easily observed that, for all¯ σ ∈(0,Lx],
d
dσϕ˜σx =∇V(ϕ˜σx) +ℓ(ϕ˜σx)
kb(ϕ˜σx)k . (66) We now define
H˜σx =∇2V(ϕ˜σx), (67) so that ˜Hσxt=Htxfor allt≤0. Then the family(H˜σx)σ∈[0,Lx]satisfies the problem
d
dσH˜σx=kb(ϕ˜σx)k−1
−2(H˜σx)2+ (Q˜xσ)⊤H˜σx+H˜σxQ˜xσ+R˜xσ
, σ ∈(0,Lx],
H˜0x=H,¯
(68) with
Q˜xσt=Qtx, R˜xσt=Rxt. (69) Notice that whenσ↓0, bothkb(ϕ˜σx)kand−2(H˜σx)2+(Q˜xσ)⊤H˜σx+H˜σxQ˜xσ+R˜xσ vanish. Therefore in order to integrate this system starting fromσ=0, we have to provide ana prioriestimate of theσ↓0 limit of
d dσH˜σx =
d l=1
∑
d
dσϕ˜l,σx ∂l∇2V(ϕ˜σx). (70) First, a first-order expansion in (66) yields
d dσϕ˜σxσ≃
↓0
∇2V(x) +¯ ∇ℓ(x)¯
(ϕ˜σx−x)¯
k∇b(x) (¯ ϕ˜σx−x)¯k =(2 ¯H+∇b(¯x)) (ϕ˜σx−x)¯
k∇b(¯x) (ϕ˜σx−x)¯ k (71) thanks to (8). Each individual term in the right-hand side is computable, so that one can evaluate the quantitiesdσdϕ˜l,σx
|σ=0for alll∈ {1, . . . ,d}.
It remains to compute the third derivatives ofV at ¯xin order to evaluate the matrices∂l∇2V(x),¯ l∈ {1, . . . ,d}. To this aim, we take the derivative of (12) with respect to thel-th coordinate, and evaluate the result at ¯x. Using the fact thatℓ(¯x) and∇V(x)¯ vanish, we get the matrix identity
0=
d k=1
∑
∂k∇2V(x)¯
(∂lℓk(x)) +¯ ∂l∇2V(x)¯
(∇ℓ(x)) +¯ ∇2V(x)¯
(∂l∇ℓ(x))¯ + (∂l∇ℓ(x))¯ ⊤ ∇2V(x)¯
+ (∇ℓ(x))¯ ⊤ ∂l∇2V(x)¯ +
d k=1
∑
(∂klV(x))¯ ∇2ℓk(x)¯ . (72) Substituting the derivatives ofℓwith those ofb+∇V, and introducing the nota- tions
vi jl=∂i jlV(x),¯ hi j=∂i jV(x),¯ βi,j=∂jbi(x),¯ γi,jl=∂jlbi(x),¯ (73) we finally obtain that, for alli,j,l∈ {1, . . . ,d},
0=
d k=1
∑
vi jk(βk,l+hkl) +vikl(βk,j+hjk) +hik(γk,jl+vjkl) + (γk,il+vikl)hjk+ (βk,i+hik)vjkl+hkl(γk,i j+vi jk).
(74)
Since the coefficientshi j,βi,j andγi,jlare known, the system of equations above inducesd3linear relations between thed3unknown coefficientsvi jl— more pre- cisely, since both the left-hand side of (74) and the value ofvi jl are invariant by permutation of the indicesi, jandl, the number of independent linear relations and unknown coefficients is reduced tod(d+1)(d+2)/6. The resolution of this system allows to reconstruct the matrices∂l∇2V(x)¯ and completes the computa- tion of (70). Notice that these matrices also possess an explicit formulation as an integral along the fluctuation path associated with the linearized stochastic differ- ential equation
d ˜Xsε=∇b(x)¯ X˜sε−x¯ ds+√
2εdWs, s≥0, (75)
see [4, Section 3.3 and Equation (3.39)].
5.3 Conclusion
Givenx∈Rdandε>0, the numerical procedure sketched above to compute the right-hand side of (17) can be summarized in the following steps.
1. Compute the fluctuation pathϕx, for example using gMAM [23, 37]. From this step, the value ofV(x)along the fluctuation path can be deduced thanks to (10), which allows to evaluate the right-hand side of (2).
2. Solve the stationary matrix Riccati equation (60) to get ¯H=∇2V(x). This can¯ be done either:
– by computing the integral (62), which solves the Lyapunov equation (61), and inverting the result;
– or by using a numerical solver for the CARE (60) directly.
3. ComputeLxand for a given number of time stepsN≫1, compute times 0= tN>tN−1>···>t1>t0=−∞such that
∀n∈ {1, . . . ,N−1},
Ztn+1 s=tn
kb(ϕsx)kds=θ, (76) withθ=Lx/N.
4. Using (71), compute d dσϕ˜σx
|σ=0≃(2 ¯H+∇b(x))¯ ϕtx1−x¯ ∇b(x)¯ ϕtx1−x¯
, (77)
and solve (74) to get the matrices∂l∇2V(x),¯ l∈ {1, . . . ,d}.
5. Compute the approximation ˜H[n]of ˜Hnθx =Htnby integrating the matrix-valued differential equation
d
dσH˜σx =−2(H˜σx)2+ (Q˜xσ)⊤H˜σx+H˜σxQ˜xσ+R˜xσ
kb(ϕ˜σx)k (78) on the gridσ ∈ {θ,2θ, . . . ,Nθ}, with initial conditions
H˜[0]=H,¯ H˜[1]=H˜[0]+θ
d l=1
∑
d dσϕ˜l,σx
|σ=0∂l∇2V(x).¯ (79) Many schemes can be employed for this numerical integration; in the example of Section 6, we use a first-order implicit Euler scheme, which is observed to have satisfying stability properties.
6. Deduce from (50) that Z0
s=−∞
divℓ(ϕsx)ds≃
N−1 n=0
∑
(tn+1−tn)
divb(ϕtxn) +tr ˜H[n]
, (80)
where we recall that the timest0, . . . ,tNare chosen so that the length of the instanton on each time interval(tn,tn+1)be equal toθ.
6 Numerical illustration
In this section, we apply the method devised in Section 5 to a two-dimensional process which exhibitsbistability, in the sense that the associated vector fieldb: R2→R2possesses two stable equilibrium points. We are therefore in the context of Subsection 4.2.
We shall fix one equilibrium point ¯xand first compute the instanton(ρt)t∈R. By (47), the integral term involved in the computation of the prefactor to the sta- tionary distribution at the pointxwrites
Z0
s=−∞divℓ(ϕsx)ds= Z t
s=−∞divℓ(ρs)ds=J(t). (81)
Hence, computing J(t)for anyt ∈R amounts to computing a whole family of prefactors, at pointsx=ρt. In thet→+∞limit, we shall also obtain the value of the prefactor to the mean exit time from the basin of attraction of ¯x1, as is described in Subsection 4.2.
6.1 Presentation of the example
Forα>0, we consider the potential function onR2defined by V(x1,x2) =v1(x1) +v2(x2), v1(x1) = x41
4 −x21
2, v2(x2) =αx22
2, (82) the critical points of which are(−1,0),(0,0)and(1,0). It is easily checked that the first and third points are stable equilibria of the dynamical system ˙x=−∇V(x), while the second point is stable in the direction ofx2but unstable in the direction ofx1.
For a smooth scalar field c:R2→R to be chosen below, let us define the vector fieldbby
b=−∇V+ℓ, ℓ(x1,x2) =c(x1,x2)
−v′2(x2) v′1(x1)
. (83)
For any choice ofc, the Hamilton–Jacobi equation (7) is satisfied, the vector field bvanishes at the same points as−∇V, and the points(−1,0)and(1,0)are stable equilibria of the dynamical system ˙x=b(x). Thesaddle-point(0,0)is stable in one direction and unstable in one direction, but these directions may differ from the canonical vectors ofR2.
Let us denote ¯x= (−1,0)andx∗= (0,0). Theinstanton(ρt)t∈Ris the hetero- clinic orbit of the dynamical system
˙
x=∇V(x) +ℓ(x) (84)
joining ¯xint=−∞tox∗int= +∞. The instanton, the level lines ofV and the field lines ofbare plotted on Figure 1 for the choice
c(x1,x2) =βx1, β >0. (85) The numerical illustrations of this section are plotted forα=0.5 andβ =3.
6.2 Stationary matrix Riccati equation
With our definition of the vector fieldband the choice (85) for the scalar fieldc, the matrix ¯Q=−∇b(x)¯ appearing in (60) writes
Q¯=
v′′1(x¯1) c(x)v¯ ′′2(x¯2)
−c(x)v¯ ′′1(x¯1) v′′2(x¯2)
=
2 −β α
2β α
, (86)
where we have used the fact that∇V(x) =¯ 0 for the first identity. The numerical resolution of (60) yields the expected result
H¯= 2 0
0α
. (87)
0
−1 1
−1.4 −1.2 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8
0
−1 1
−0.8
−0.6
−0.4
−0.2 0.2 0.4 0.6 0.8
Fig. 1 The level lines ofV(pale blue), some field lines associated withb(red) and the instanton (thick blue) joining ¯x= (−1,0)tox∗= (0,0).
6.3 Third derivatives ofV at ¯x
In order to determine the initial condition for the computation of ˜H[n], we write the coefficients appearing in (74):
– the computation of ¯Hperformed above yields
h11=2, h12=0, h22=α; (88) – the computation of∇b(x)¯ yields
β1,1=−2, β1,2=β α, β2,1=−β, β2,2=−α; (89) – the computation of the second derivatives ofbyields
γ1,11=6, γ1,12=−β α, γ1,22=0,
γ2,11=10β, γ2,12=0, γ2,22=0. (90) The system of linear equations (74) contains 4 equations, corresponding to the choices{i,j,l}={1,1,1},{1,1,2},{1,2,2},{2,2,2}, which write
0=36+6v111−3βv112,
0=6β α+β αv111+ (4+α)v112−2βv122, 0=2β αv112+2(1+α)v122−βv222, 0=3β αv122+3αv222.
(91)
The unique solution of this system is
v111=−6, v112=v122=v222=0, (92)