Free energy difference ∆ F(z)Free energy difference ∆ F(z)

(1)

Free energy computations

Monte Carlo methods in molecular dynamics Tony Leli `evre

CERMICS, Ecole Nationale des Ponts et Chauss ´ees, France, MicMac team, INRIA, France.

http://cermics.enpc.fr/∼lelievre

(2)

Outline

1 Free energy and metastability, 2 Constrained dynamics,

2.1 Thermodynamic integration, 2.2 Non-equilibrium dynamics, 3 Adaptive methods,

3.1 Adaptive methods: algorithms, 3.2 Adaptive methods: convergence, 3.3 Multiple replicas implementations, 3.4 Application to Bayesian statistics.

(3)

1 Free energy and metastability

The aim of molecular dynamics computations is to evaluate numerically macroscopic quantities from models at the microscopic scale.

Some examples of macroscopic quantities:

• thermodynamics quantities: stress, heat capacity, free energy;

• dynamical quantities: diffusion coefficients, viscosity, transition rates.

Many applications in various fields: biology, physics, chemistry, materials science. Molecular dynamics computations consume today a lot of CPU time.

(4)

1 Free energy and metastability

A molecular dynamics model amounts essentially in choosing a potential V which associates to a

configuration (x₁, ..., x_N) = x ∈ R^3N an energy

V (x₁, ..., x_N).

In the NVT ensemble, configurations are distributed according to the Boltzmann-Gibbs probability

measure:

dµ(x) = Z⁻¹ exp(−βV (x)) dx,

where Z = R

exp(−βV (x)) dx is the partition function and β = (k_BT)⁻¹ is proportional to the inverse of the temperature.

Aim: compute averages with respect to µ.

(5)

1 Free energy and metastability

Examples of quantities of interest:

• specific heat

C ∝ hV ²i^µ − hV i²µ

• pressure

P ∝ −hq · ∇V (q)i^µ

(6)

1 Free energy and metastability

Typically, V is a sum of potentials modelling interaction between two particles, three particles and four

particles:

V = X

i<j

V₁(x_i, x_j) + X

i<j<k

V₂(x_i, x_j, x_k) + X

i<j<k<l

V₃(x_i, x_j, x_k, x_l).

For example, V₁(x_i, x_j) = V_LJ(|x_i − x_j|) where

V_LJ(r) = 4ǫ

σ r

12

− ^σ_r 6

is the Lennard-Jones potential.

Difficulties: (i) high-dimensional problem (N ≫ 1) ; (ii) µ is a multimodal measure.

(7)

1 Free energy and metastability

To sample µ, Markov Chain Monte Carlo methods are used.

A typical example is the over-damped Langevin (or gradient) dynamics:

dX_t = −∇V (X_t) dt + p

2β⁻¹dW_t.

Under suitable assumption, we have the ergodic property: for µ-a.e. X₀,

Tlim→∞

1 T

Z _T

0

φ(X_t)dt = Z

φ(x)dµ(x).

(8)

1 Free energy and metastability

Probabilistic insert (1): discretization of SDEs.

The discretization of (GD) by the Euler scheme is (for a fixed timestep ∆t):

X_n+1 = X_n − ∇V (X_n) ∆t + p

2β⁻¹∆tG_n

where (Gⁱ_n)_{1≤i≤3N,n≥0} are i.i.d. random variables with law _N (0, 1). Indeed,

(W_(n+1)∆t − W_n∆t)_n≥0 =^L √

∆t(G_n)_n≥0.

In practice, a sequence of i.i.d. random variables with law _N (0, 1) may be obtained from a sequence of i.i.d.

random variables with law _U((0, 1)).

(9)

1 Free energy and metastability

Proof (invariant measure): One needs to show that if the law of X₀ is µ, then the law of X_t is also µ. Let us denote X^x_t the solution to (GD) such that X₀ = x. Let us consider the function u(t, x) solution to:

( ∂_tu(t, x) = −∇V (x) · ∇u(t, x) + β⁻¹∆u(t, x), u(0, x) = φ(x)(+ assumptions on decay at infinity),

then, u(t, x) = E(φ(X^x_t )). Thus, the measure µ is invariant:

d dt

Z

E(φ(X^x_t ))dµ(x) = Z⁻¹ Z

∂_tu(t, x) exp(−βV (x))dx

= Z⁻¹ Z

−∇V · ∇u + β⁻¹∆u

exp(−βV )= 0.

Therefore,

Z

E(φ(X^x_t ))dµ(x) = Z

φ(x)dµ(x).

(10)

1 Free energy and metastability

Probabilistic insert (2): Feynman-Kac formula.

Why u(t, x) = E(φ(X^x_t )) ? For 0 < s < t, we have (characteristic method):

du(t − s, X^x_s ) = −∂_tu(t − s, X^x_s ) ds + ∇u(t − s, X^x_s ) · dX^x_s +β⁻¹∆u(t − s, X^x_s ) ds,

=

− ∂_tu(t − s, X^x_s ) − ∇V (X^x_s ) · ∇u(t − s, X^x_s ) + β⁻¹∆u(t − s, X^x_s )

ds + p

2β⁻¹∇u(t − s, X^x_s ) · dW _s.

Thus, integrating over s ∈ (0, t) and taking the expectation:

E(u(0, X^x_t )) − E(u(t, X^x₀ )) = p

2β⁻¹E

Z _t

0 ∇u(t − s, X^x_s ) · dW_s

= 0.

(11)

1 Free energy and metastability

Probabilistic insert (3): It ˆo’s calculus. ^{(in 1d.)}

Where does the term ∆u come from ? Starting from the discretization:

X_n+1 = X_n − V ^′(X_n) ∆t + p

2β⁻¹∆tG_n,

we have (for a time-independent function u):

u(X_n+1) = u

X_n − V ^′(X_n) ∆t + p

2β⁻¹∆tG_n ,

= u(X_n) − u^′(X_n)V ^′(X_n) ∆t + p

2β⁻¹∆tu^′(X_n)G_n +β⁻¹(G_n)²u^′′(X_n)∆t + o(∆t).

Thus, summing over n ∈ [0...t/∆t] and taking the limit ∆t → 0,

u(X_t) = u(X₀) −

Z _t

0

V ^′(X_s)u^′(X_s) ds + p

2β⁻¹

Z _t

0

u^′(X_s)dW_s

+β⁻¹

Z _t

0

u^′′(X_s) ds. T. Leli `evre, Cornell University, February 2010 – p. 11

(12)

1 Free energy and metastability

In practice, (GD) is discretized in time, and Cesaro means are computed: lim_N_T_→∞ _N¹

T

P_N_T

n=1 φ(X_n). Remark: Practitioners do not use over-damped

Langevin dynamics but rather Langevin dynamics:

( dX_t = M⁻¹P _t dt,

dP_t = −∇V (X_t) dt − γM⁻¹P _t dt + p

2γβ⁻¹dW _t,

where _M is the mass tensor and _γ is the friction coefficient. In the following, we mainly consider over-damped Langevin dynamics.

(13)

1 Free energy and metastability

Problem: In practice, X_t is a metastable process, so that the convergence of the ergodic limit is very slow.

A bi-dimensional example: X_t¹ is a slow variable of the system.

x₁ x₂

V (x₁, x₂)

X_t¹

t

(14)

1 Free energy and metastability

A more realistic example (Dellago, Geissler): Influence of the solvation on a dimer conformation.

00000000 00000000 00000000 0000

11111111 11111111 11111111 1111

000000 000000 000000 111111 111111 111111

00000000 00000000 00000000 0000

11111111 11111111 11111111 1111 00000000 00000000 00000000 11111111 11111111 11111111

00000000 00000000 00000000 11111111 11111111 11111111

000000 000000 000000 000

111111 111111 111111 111

00000000 00000000 00000000 11111111 11111111 11111111

000000 000000 000000 111111 111111

111111 0000000000000000000000000000 11111111 11111111 11111111 1111 000000 000000 000000 111111 111111 111111 000000 000000 000000 111111 111111 111111

000000 000000 000000 000

111111 111111 111111 111

00000000 00000000 00000000 0000

11111111 11111111 11111111 1111

000000 000000 000000 111111 111111 111111

000000 000000 000000 000

111111 111111 111111 111

000000 000000 000000 111111 111111 111111

000000 000000 000000 000

111111 111111 111111 111

000000 000000 000000 000

111111 111111 111111 111

00000000 00000000 00000000 11111111 11111111 11111111 000000 000000 000000 111111 111111 111111

00000000 00000000 00000000 0000

11111111 11111111 11111111 1111

000000 000000 000000 000

111111 111111 111111 111

000000 000000 000000 111111 111111 111111

00000000 00000000 00000000 0000

11111111 11111111 11111111 1111 00000000 00000000 00000000 0000

11111111 11111111 11111111 1111 000000 000000 000000 111111 111111 111111

Left: compact state ^(ξ ⁼ ^d0). Right: stretched state ^(ξ ⁼ ^d1). A slow variable is ξ(X_t) where ξ(x) = |x₁ − x₂| is a so-called reaction coordinate.

(15)

1 Free energy and metastability

A “real” example: ions canal in a cell membrane.

(C. Chipot).

(16)

1 Free energy and metastability

Metastability: How to quantify this bad behaviour ? 1. Escape time from a potential well.

2. Asymptotice variance of the estimator.

3. “Decorrelation time”.

4. Rate of convergence of the law of X_t to µ. In the following we use the fourth criterium.

(17)

1 Free energy and metastability

The PDE point of view: convergence of the pdf ψ(t, x)

of X_t to _ψ_∞₍x) = Z⁻¹e^−βV ⁽^x⁾. _ψ satisfies the Fokker-Planck equation

∂_tψ = div (∇V ψ + β⁻¹∇ψ),

which can be rewritten as _∂_t_ψ ₌ _β⁻¹_div _ψ_∞_∇ _ψ^ψ

∞

. Let us introduce the entropy

E(t) = H(ψ(t, ·)|ψ_∞) = Z

ln

ψ ψ_∞

ψ.

We have (Csiszár-Kullback inequality):

kψ(t, ·) − ψ_∞kL¹ ≤ p

2E(t).

(18)

1 Free energy and metastability

dE

dt =

Z

ln

ψ ψ_∞

∂_tψ

= β⁻¹ Z

ln

ψ ψ_∞

div

ψ_∞∇

ψ ψ_∞

= −β⁻¹

Z

∇ ln

ψ ψ_∞

2

ψ =: −β⁻¹I(ψ(t, ·)|ψ_∞).

If V is such that the following Logarithmic Sobolev inequality (LSI(R)) holds: _∀ψ pdf,

H(ψ|ψ_∞) ≤ 1

2RI(ψ|ψ_∞)

then E(t) ≤ C exp(−2β⁻¹Rt) and thus ψ converges to

ψ_∞ exponentially fast with rate β⁻¹R.

Metastability _⇐⇒ small R

(19)

1 Free energy and metastability

Metastability: How to attack this problem ?

We suppose in the following that the slow variable is of dimension 1 and known: ξ(X_t), where ξ : Rⁿ → T.

Functionals to be averaged are typically functions of this slow variable.

Let us introduce the free energy A which is such that the image of the measure µ by ξ is Z⁻¹ exp(−βA(z)) dz. From the co-area formula, one gets:

A(z) = −β⁻¹ ln

Z

Σ(z)

e^−βV |∇ξ|⁻¹dσ_Σ(z₎

! ,

where Σ(z) = {x, ξ(x) = z} is a (smooth) submanifold of _Rⁿ, and σ_Σ(z) is the Lebesgue measure on Σ(z).

(20)

1 Free energy and metastability

Co-area formula: Let X be a random variable with law

ψ(x) dx in _Rⁿ. Then ξ(X) has law ^R_Σ(z) ψ |∇ξ|⁻¹ dσ_Σ(z) dz,

and the law of X conditioned to a fixed value z of ξ(X)

is dµ_Σ(z) = ^R ^ψ ^|∇ξ|⁻¹ ^dσ^Σ(z)

Σ(z) ψ |∇ξ|⁻¹ dσ_Σ(z).

Indeed, for any bounded functions f and g,

E(f(ξ(X))g(X)) = Z

Rⁿ

f(ξ(x))g(x)ψ(x) dx,

= Z

R^p

Z

Σ(z)

f ◦ ξ g ψ |∇ξ|⁻¹dσ_Σ(z₎ dz,

= Z

R^p

f(z) R

Σ(z) g ψ |∇ξ|⁻¹dσ_Σ(z₎ R

Σ(z) ψ |∇ξ|⁻¹dσ_Σ(z) Z

Σ(z)

ψ |∇ξ|⁻¹dσ_Σ(z₎ dz

(21)

1 Free energy and metastability

Remarks:

- The measure _|∇ξ|⁻¹dσ_Σ(z₎ is sometimes denoted

δ_ξ(x)−z in the literature.

- A is the free energy associated with the reaction coordinate or collective variable ξ (angle, length, ...).

A is defined up to an additive constant, so that it is enough to compute free energy differences, or the derivative of A (the mean force).

- A(z) = −β⁻¹ ln Z_Σ(z₎ and Z_Σ(z) is the partition function associated with the conditioned probability measures:

µ_Σ(z) = Z_Σ(z)⁻¹ e^−βV |∇ξ|⁻¹dσ_Σ(z).

(22)

1 Free energy and metastability

Example of a free energy profile (solvation of a dimer)

(Profiles computed using TI)

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

−0.6

−0.4

−0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8

Parameter z

Free energy difference ∆ F(z)

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

−0.6

−0.4

−0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8

Parameter z

Free energy difference ∆ F(z)

The density of the solvent molecules is lower on the left than on the right. At high (resp.

low) density, the compact state is more (resp. less) likely. The “free energy barrier” is higher at high density than at low density. Related question: interpretation of the free energy barrier in terms of dynamics ?

(23)

1 Free energy and metastability

Some direct numerical simulations...

Remark: Free energy is not energy !

−3 −2 −1 0 1 2 3

0 0.5 1 1.5 2 2.5 3

x coordinate

y coordinate

−3 −2 −1 0 1 2 3

−2.5

−2

−1.5

−1

−0.5 0 0.5 1 1.5

x coordinate

Free energy

Left: The potential is 0 in the region enclosed by the curve, and +∞ outside.

Right: Associated free energy profile when the x

coordinate is the reaction coordinate (β = 1).

(24)

1 Free energy and metastability

Examples of methods to compute free energy differences A(z₂) − A(z₁):

• Thermodynamic integration (Kirkwood) (homogeneous Markov process),

• Perturbation methods (Zwanzig) and histogram methods,

• Out of equilibrium dynamics (Jarzynski) (non-homogeneous Markov process),

• Adaptive methods (ABF, metadynamics) (non-homogeneous and non-linear Markov process).

Numerically, this amounts to: (i) sampling efficiently a multi-modal measure in high dimension, (ii) computing the marginal law of such a measure along a given

low-dimensional function.

(25)

1 Free energy and metastability

(a) Thermodynamic integration. (b) Histogram method.

(c) Out of equilibrium dynamics. (d) Adaptive dynamics.

(26)

2 Constrained dynamics

• Thermodynamic integration (Kirkwood)

• Perturbation methods (Zwanzig) and histogram methods,

• Out of equilibrium dynamics (Jarzynski),

• Adaptive methods (ABF, metadynamics).

(27)

2.1 Thermodynamic integration

Thermodynamic integration

(28)

2.1 Thermodynamic integration

Thermodynamic integration is based on two remarks:

(1) The derivative A^′(z) can be obtained by sampling the conditioned probability measure µ_Σ(z₎ (Sprik, Ciccotti, Kapral, Vanden-Eijnden, E, den Otter, ...)

A^′(z) = Z_Σ(z⁻¹ ₎

Z

∇V · ∇ξ

|∇ξ|² − β⁻¹div

∇ξ

|∇ξ|²

exp(−βV )|∇ξ|⁻¹dσ_Σ(z)

= Z_Σ(z⁻¹ ₎

Z ∇ξ

|∇ξ|² ·

∇V˜ + β⁻¹H

exp(−βV˜ )dσ_Σ(z₎,

= Z

f dµ_Σ(z),

where V^˜ = V + β⁻¹ ln |∇ξ|, f = ^∇V_|∇ξ|^·∇ξ₂ − β⁻¹div

∇ξ

|∇ξ|²

and H = −∇ ·

∇ξ

|∇ξ|

∇ξ

|∇ξ| is the mean curvature vector.

(29)

2.1 Thermodynamic integration

Proof: (based on the co-area formula)

Z Z

exp(−βV˜ )dσ_Σ(z) ′

φ(z) dz = −

Z Z

exp(−βV˜ )dσ_Σ(z)φ^′ dz,

= −

Z Z

exp(−βV˜ )φ^′ ◦ ξ dσ_Σ(z₎ dz,

= −

Z

exp(−βV˜ )φ^′ ◦ ξ|∇ξ|dx,

= −

Z

exp(−βV˜ )∇(φ ◦ ξ) · ∇ξ

|∇ξ|² |∇ξ|dx,

=

Z

∇ ·

exp(−βV˜ ) ∇ξ

|∇ξ|

φ ◦ ξ dx,

=

Z Z

−β ∇V˜ · ∇ξ

|∇ξ|² + |∇ξ|⁻¹∇ ·

∇ξ

|∇ξ|

!

exp(−βV˜)dσ_Σ(z₎φ(z) dz.

(30)

2.1 Thermodynamic integration

(2) It is possible to sample the conditioned probability measure µ_Σ(z) = Z_Σ(z)⁻¹ exp(−βV˜ )dσ_Σ(z) by considering the following constrained dynamics:

(RCD)

( dX_t = −∇V˜(X_t) dt + p

2β⁻¹dW _t + ∇ξ(X_t)dΛ_t, dΛ_t such that ξ(X_t) = z.

Thus, A^′(z) = lim_T_→∞ _T¹ R _T

0 f(X_t) dt.

The free energy profile is then obtained by thermodynamic integration:

A(z) − A(0) =

Z _z

0

A^′(z) dz ≃

K

X

i=0

ω_iA^′(z_i).

(31)

2.1 Thermodynamic integration

Notice that there is actually no need to compute f in practice since the mean force may be obtained by averaging the Lagrange multipliers.

Indeed, we have dΛ_t = dΛ^m_t + dΛ^f_t, with

dΛ^m_t = −p

2β⁻¹ _|∇ξ|^∇ξ₂ (X_t) · dW_t and

dΛ^f_t =_|∇ξ|^∇ξ₂ ·

∇V˜ + β⁻¹H

(X_t) dt = f(X_t) dt so that

A^′(z) = lim

T→∞

1 T

Z _T

0

dΛ_t = lim

T→∞

1 T

Z _T

0

dΛ^f_t.

Of course, this comes at a price: essentially, we are using the fact that

Mlim→∞ lim

∆t→0

1 M∆t

XM

m=1

h ξ“

q + √

∆t G^m”

− 2ξ(q) + ξ“

q − √

∆t G^m”i

= ∆ξ(q),

and this estimator has a non zero variance.

(32)

2.1 Thermodynamic integration

More explicitly, the rigidly constrained dynamics writes:

(RCD) dX_t = P (X_t)

−∇V˜ (X_t) dt + p

2β⁻¹dW _t

+ β⁻¹H(X_t) dt,

where P(x) is the orthogonal projection operator:

P (x) = Id ₋ n(x) ⊗ n(x),

with n the unit normal vector: n(x) = ∇ξ

|∇ξ|(x).

(RCD) can also be written using the Stratonovitch product: dX_t = −P(X_t)∇V˜ (X_t) dt+ p

2β⁻¹P (X_t)◦dW _t. It is easy to check that ξ(X_t) = ξ(X₀) = z for X_t

solution to (RCD).

(33)

2.1 Thermodynamic integration

[G. Ciccotti, TL, E. Vanden-Einjden, 2008] Assume wlg that z = 0. The probability µ_Σ(0) is the unique invariant measure with support in Σ(0) for (RCD).

Proposition: Let X_t be the solution to (RCD) such that the law of X₀ is µ_Σ(0). Then, for all smooth function φ

and for all time t > 0,

E(φ(X_t)) = Z

φ(x)dµ_Σ(0)(x).

Proof: Introduce the infinitesimal generator and apply the divergence theorem on submanifolds : ∀φ ∈ C¹(R^3N,R^3N),

Z

div _Σ(0)(φ)dσ_Σ(0) = − Z

H · φdσ_Σ(0), where div _Σ(0)(φ) = tr(P∇φ).

(34)

2.1 Thermodynamic integration

Discretization: These two schemes are consistent with (RCD):

(S1)

( X_n+1 = X_n − ∇V˜ (X_n)∆t + p

2β⁻¹∆W _n + λ_n∇ξ(X_n+1),

with λ_n ∈ R such that ξ(X_n+1) = 0, (S2)

( X_n+1 = X_n − ∇V˜ (X_n)∆t + p

2β⁻¹∆W _n + λ_n∇ξ(X_n),

with λ_n ∈ R such that ξ(X_n+1) = 0,

where ∆W _n = W_(n+1)∆t − W_n∆t. The constraint is

exactly satisfied (important for longtime computations).

The discretization of A^′(0) = lim_T_→∞ _T¹ R _T

0 dΛ_t is:

Tlim→∞ lim

∆t→0

1 T

T /∆t

X

n=1

λ_n = A^′(0).

(35)

2.1 Thermodynamic integration

In practice, the following variance reduction scheme may be used:

( X_n+1 = X_n − ∇V˜ (X_n)∆t+p

2β⁻¹∆W_n + λ∇ξ(X_n+1),

with λ ∈ R such that ξ(X_n+1) = 0, ( X_∗ = X_n − ∇V˜(X_n)∆t−p

2β⁻¹∆W_n + λ_∗∇ξ(X_∗),

with λ_∗ ∈ R such that ξ(X_∗) = 0,

and _λ_n _{= (λ} ₊ _λ_∗_)/2.

The martingale part _dΛ^m_t (i.e. the most fluctuating part) of the Lagrange multiplier is removed.

(36)

2.1 Thermodynamic integration

An over-simplified illustration: in dimension 2,

V (x) = ^β₂⁻¹ |x|² and ξ(x) = ^x_a²¹2 + ^x_b2²² − 1.

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35

-3 -2 -1 0 1 2 3 4

mes_int mes_non_int dyn_int_proj_1 dyn_int_proj_2 dyn_non_int_proj_1 dyn_non_int_proj_2

Measures samples theoretically and numerically ^{(as a}

function of the angle θ), with β = 1, a = 2, b = 1, ∆t = 0.01, and 50 000 000 timesteps.

(37)

2.1 Thermodynamic integration

Computation of the mean force: β = 1, a = 2, b = 1. The exact value is: 0.9868348150. The numerical result

(with ∆t = 0.001, M = 50000) is: [0.940613 ; 1.03204].

The variance reduction method reduces the variance by a factor 100. The result (with ∆t = 0.001, M = 50000) is: [0.984019 ; 0.993421].

(38)

2.1 Thermodynamic integration

App. mean force as a function of ∆t and M = T /∆t:

1e-05

1e-04

0.001

0.01

0.1 1000

10000

100000

1e+06

1e+07 0.86

0.88 0.9 0.92 0.94 0.96 0.98 1

dt

M

A balance needs to be find between the discretization error (∆t → 0) and the convergence in the ergodic limit (T → ∞).

(39)

2.1 Thermodynamic integration

Error analysis [Faou,TL, Mathematics of Computation, 2010]: Using classical technics (Talay-Tubaro like proof), one can check that the ergodic measure µ^∆t_Σ(0) sampled by the Markov chain (X_n) is an approximation of order one of

µ_Σ(0): for all smooth functions g : Σ(0) → R,

Z

Σ(0)

gdµ^∆t_Σ(0) − Z

Σ(0)

gdµ_Σ(0)

≤ C∆t.

(40)

2.1 Thermodynamic integration

Metastability issue: Using TI, we have to sample the conditional measures µ_Σ(z) rather than the original Gibbs measure µ. The long-time behaviour of the

constrained dynamics (RCD) will be essentially limited by the LSI contant ρ(z) of the conditional measures

µ_Σ(z) (to be compared with the LSI constant R of the original measure µ). For well-chosen ξ, ρ(z) ≫ R,

which explains the efficiency of the whole procedure.

(41)

2.1 Thermodynamic integration

Remarks:

- There are many ways to constrain the dynamics

(GD). We chose one which is simple to discretize. We may also have used, for example (for z = 0)

dX^η_t = −∇V (X^η_t ) dt − 1

2η∇(ξ²)(X^η_t ) dt + p

2β⁻¹dW _t,

where the constraint is penalized. One can show that

lim_η→0 X^η_t = X_t ⁽ⁱⁿ ^L^∞_t∈[0,T_](L²_ω)-norm) where X_t satisfies (RCD). Notice that we used V and not V^˜ in the penalized dynamics.

(42)

2.1 Thermodynamic integration

The statistics associated with the dynamics where the constraints are rigidly imposed and the dynamics

where the constraints are softly imposed through

penalization are different: “a stiff spring ₆= a rigid rod”

(van Kampen, Hinch,...).

(43)

2.1 Thermodynamic integration

- TI yields a way to compute ^R φ(x)dµ(x):

Z

φ(x)dµ(x) = Z⁻¹ Z

φ(x)e^−βV ⁽^x⁾dx,

= Z⁻¹ Z

z

Z

Σ(z)

φe^−βV |∇ξ|⁻¹dσ_Σ(z₎ dz, (co-area formula)

= Z⁻¹ Z

z

R

Σ(z) φe^−βV |∇ξ|⁻¹dσ_Σ(z₎ R

Σ(z) e^−βV |∇ξ|⁻¹dσ_Σ(z₎ Z

Σ(z)

e^−βV |∇ξ|⁻¹dσ_Σ(z) dz,

=

Z

z

e^−βA(z⁾ dz

−1 Z

z

Z

Σ(z)

φdµ_Σ(z)

!

e^−βA(z) dz.

with Σ(z) = {x, ξ(x) = z},

A(z) = −β⁻¹ ln R

Σ(z)e^−βV |∇ξ|⁻¹dσ_Σ(z₎

and

µ_Σ(z₎ = e^−βV |∇ξ|⁻¹dσ_Σ(z)/ R

Σ(z)e^−βV |∇ξ|⁻¹dσ_Σ(z₎.

T. Leli `evre, Cornell University, February 2010 – p. 43

(44)

2.1 Thermodynamic integration

- [C. Le Bris, TL, E. Vanden-Einjden, CRAS 2008] For a general SDE (with a non isotropic diffusion), the following diagram does not commute:

P_cont

P_disc Projected discretized process

?

Discretized process Projected continuous process

Continuous process

Discretized projected continuous process

∆t

(45)

2.1 Thermodynamic integration

Generalization to Langevin dynamics. Interests:

(i) Newton’s equations of motion are more “natural”;

(ii) leads to numerical schemes which sample the

constrained measure without time discretization error.







dq_t = M⁻¹p_t dt,

dp_t = −∇V (q_t) dt − γM⁻¹p_t dt + p

2γβ⁻¹dW_t + ∇ξ(q_t) dλ_t, ξ(q_t) = z.

The probability measure sampled by this dynamics is

µ_T^∗_Σ(z₎(dqdp) = Z⁻¹ exp(−βH(q, p))σ_T^∗_Σ(z₎(dqdp),

where H(q, p) = V (q) + ¹₂p^T M⁻¹p.