Reversible distributions of multi-allelic Gillespie-Sato diffusion models

(1)

www.elsevier.com/locate/anihpb

Reversible distributions of multi-allelic Gillespie–Sato diffusion models

Kenji Handa

¹

Department of Mathematics, Saga University, Saga 840-8502, Japan

Received 23 August 2002; received in revised form 14 August 2003; accepted 13 January 2004 Available online 12 May 2004

Dedicated to Professor Tokuzo Shiga on the occasion of his 60th birthday

Abstract

We consider multi-allelic Gillespie–Sato diffusion models in population genetics. The case where they have reversible distributions is completely determined in terms of mutation rates and selection intensity. In such cases we give an explicit expression of the reversible distributions, which turn out to be mutually absolutely continuous with respect to some Dirichlet distributions.

Résumé

On considère les modèles de diffusion multi-alleles de Gillepsie–Sato, en génétique des populations. Le cas réversible est complètement déterminé en termes de taux de mutation et d’intensité de sélection. On obtient une expression explicite des distributions réversibles qui se trouvent être absolument continues par rapport à certaines distributions de Dirichlet.

MSC: 60J60; 92D10

Keywords: Gillespie–Sato diffusion models; Wright–Fisher diffusion models; Reversibility; Quasi-invariance; Dirichlet distributions

1. Introduction and the main result

In population genetics theory, stochastic methods such as diffusion approximations had been exploited extensively, yielding rich results which are of interest from both genetical and mathematical view points. Among a number of quantities associated with the diffusion models, it is of particular importance to study their stationary distributions. In general, it is quite difficult to give them in explicit way, and stationary distributions which had been found explicitly are usually shown to exhibit a stronger property called reversibility, i.e., at stationarity the process has the same distribution as its time reversal.

E-mail address: [email protected] (K. Handa).

1 Partially supported by The Sumitomo Foundation.

doi:10.1016/j.anihpb.2003.08.002

(2)

Besides analytical importance of this property, we should mention about a special role of reversibility in the context of population genetics theory. For this purpose, it seems best to cite an account given by Ewens from [5, p. 87] concerning the prospective and retrospective aspects of the processes. “For reversible processes these two aspects have many properties in common, and information about the prospective behavior normally yields almost immediately useful information about the retrospective behavior”. See also e.g. [28,29,15,20,17] and [4, §8] for various applications of reversibility or time reversal in population genetics models.

In this paper we discuss multi-allelic Gillespie–Sato diffusion models (hereafter G-S diffusions), which were introduced heuristically by Gillespie [7] in a di-allelic case and rigorously derived by Sato [19] in a multi-allelic case. In fact, they did not take effect of mutation into consideration, and what we actually study here are diffusion approximations obtained by Shiga [22,23]. (See also [24] for further development.) He proved not only the well- posedness of the processes in a countably infinite-allelic case but also derived certain measure-valued diffusion processes in a continuum limit of the space of alleles. However, our attempt will be made only for finitely-many- allelic cases because of technical difficulties, and we try to identify the case where the multi-allelic G-S diffusions have stationary reversible distributions, and to find explicit expressions of them.

According to [22], diffusion processes we will be concerned with are described as follows. Letd be an integer greater than 1 and R₊be the set of positive numbers. Suppose thatβ₁, . . . , βd∈R₊andγ₁, . . . , γd∈R are given.

We also need ad×d-matrix(λij)such thatλij0(i=j )andd

j=1λij=0. Thed-allele G-S diffusion has state space

Kd=

x=(x1, . . . , xd−1):x10, . . . , xd−10, xd:=1−x1− · · · −xd−10

, (1.1)

wherestands for the transpose. It is prescribed by the generator L=1

2

d−1

i,j=1

aij(x) ∂²

∂x_i∂x_j +

d−1

i=1

bi(x) ∂

∂x_i (1.2)

with coefficients

a_ij(x)=δ_ijβ_ix_i+x_ix_j _d

k=1

β_kx_k−β_i−β_j

(1.3) and

b_i(x)= d

j=1

λ_{j i}x_j+x_i

γ_i− d

j=1

γ_jx_j

, (1.4)

whereδ_ij denotes the Kronecker’s delta. Eachλ_ij represents the rate of mutation from theith allele, sayA_i, to thejth alleleA_j, andγ_i involves the effect of natural selection. It is convenient to introduce notation of the scalar product on R^d

ξ,η = d

i=1

ξ_iη_i, ξ=(ξ₁, . . . , ξ_d), η=(η₁, . . . , η_d)∈R^d.

Putβ=(β₁, . . . , β_d) andγ =(γ₁, . . . , γ_d). Letting x¯=(x₁, . . . , x_d₋₁, x_d)∈R^d forx∈K_d, we have rather simple expression of the above coefficients:

a_ij(x)=δ_ijβ_ix_i+x_ix_j

¯x,β −β_i−β_j , b_i(x)=

d

j=1

λ_{j i}x_j+x_i

γ_i− ¯x,γ .

(3)

Let 1=(1, . . . ,1)∈R^d. Ifβ=C1 for some C >0, then the G-S diffusion is nothing but the Wright–Fisher diffusion model (hereafter W-F diffusion), more precisely, the diffusion approximation for Wright–Fisher models.

(See e.g. [5,3].) Genetically speaking, βi’s come from the difference among alleles in variances of offspring numbers. More precisely, the diffusion approximations studied in [7,19,22,23] are based on certain multi-type branching processes, in the definition of which the variance of the offspring distribution of alleleA_i isβ_i plus a term negligible in large population limit. In the absence of mutation, Gillespie [7] discussed effects of variance in offspring numbers on the fitness of a genotype and on the probability of fixation.

In the present paper, mechanism of mutation is necessary for the process to have a nontrivial equilibrium state.

Actually, Shiga proved ([21], Theorem 3.2 and Remark 3.1 combined with the main result of [22]), under certain irreducibility condition we will also assume for the mutation rates, that the G-S diffusion has a unique stationary distribution and is ergodic. As mentioned above, main purpose of this paper is to find an explicit expression for the stationary distribution. But we do not intend to consider all of them since, even for the W-F diffusion case, only reversible stationary distributions are known explicitly. So what we have to do first is to identify the case where the G-S diffusion has a reversible distribution and then we shall compute it in the reversible case. Here is the main result of this paper.

Theorem 1.1. Suppose thatβ=C1 for anyC >0 and that(λij)₁i,jdis irreducible in the sense that for everyi andj there exist a chaini0, i1, . . . , imin{1, . . . , d}such thati0=i, im=j andλi_n₋₁i_n>0(n=1, . . . , m). Then the G-S diffusion has a reversible distribution if and only if the mutation rates are of uniform type, i.e.,

λ_ij=λ_kj(=:q_j/2), for alli, k∈ {1, . . . , d} \ {j}andj ∈ {1, . . . , d} (1.5) and

γ=Cβ+C1 for some constantsCandC. (1.6)

In the case where both(1.5)and(1.6)are satisfied, the unique stationary (reversible) distribution is given by ¯x,β⁻¹⁻^q,β⁻¹⁻^2C⁻¹

d

i=1

x_i^qⁱ^βⁱ⁻¹⁻¹dx₁· · ·dx_d₋₁/Z_β,q,C, (1.7) whereβ⁻¹=(β1−1, . . . , βd−1),q=(q1, . . . , qd)is given in(1.5), andZβ,q,Cis a positive finite constant that makes(1.7)a probability distribution onKd.

Remarks. (i) We should note difference between the G-S diffusion case and the W-F diffusion case. The latter is covered by a theorem of Li, Shiga and Yao [16]. They proved, under the same irreducibility assumption as above, that reversibility of the W-F diffusion is equivalent to only the condition (1.5). Thus no condition onγis required in this case. In addition, we can assumeβ=1 without loss of generality, and the stationary distribution is known [30] as

e²^¯^x,γ

d

i=1

x_i^qⁱ⁻¹dx₁· · ·dx_d₋₁/normalization.

Note also that distributions of this form are recovered from (1.7) by settingβ=1+C⁻¹γ orγ=Cβ−C1, which satisfies (1.6), and then lettingC→ ∞.

(ii) Clearly (1.7) generalizes Dirichlet distributions, that correspond to the case whereβ=C₁1 for someC₁>0.

Since (1.7) is absolutely continuous with respect to the Dirichlet distribution with parameter(q₁β₁⁻¹, . . . , q_dβ_d⁻¹) and the density is bounded above and uniformly positive, normalizability in (1.7) is obvious.

(iii) WhenC=0 in (1.7), an explicit expression of the normalizationZ_β,q,0will be given in Lemma 3.1 below.

(iv) In the case ofd=2, (1.5) and (1.6) are always satisfied, and the stationary distribution (1.7) is derived directly by using integration by parts in one dimension.

(4)

Heuristics behind the proof of Theorem 1.1 is as follows. In view of general facts (e.g. [14,18], [13, Chapter V, Theorem 4.6]) on symmetrizability of nondegenerate operators of the form (1.1), it seems natural to guess that reversibility of the process we consider would be equivalent to the condition that the “drift term” is of the form

b(x):=(b₁(x), . . . , bd−1(x))= −1

2a(x)∇H (x) for some functionH (x), (1.8) wherea(x)=(a_ij(x))₁_i,j_d₋₁and∇H (x)=(∂H /∂x₁, . . . , ∂H /∂x_d₋₁).Furthermore, this reduces to symme- tries

d−1

k=1

aik(x)∂b_j(x)

∂x_k =

d−1

k=1

aj k(x)∂b_i(x)

∂x_k , i, j∈ {1, . . . , d−1}. (1.9)

Regarding (1.9) as a family of identities between polynomials in x₁, . . . , x_d₋₁, one arrives at (1.5) and (1.6).

Moreover, after multiplying both sides of (1.8) by the inverse matrixa(x)⁻¹, which exists wheneverxbelongs to K_d⁺:=

x=(x₁, . . . , xd−1)∈Kd: x₁>0, . . . , xd−1>0, xd>0 (cf. (1.12) below), a functionH satisfying (1.8) onK_d⁺is found as

H (x)= d

i=1

q_iβ_i⁻¹log(x_i⁻¹)+

q,β⁻¹ +2C

log ¯x,β⁻¹. (1.10)

Lastly, the stationary distribution would be simply given by e⁻^{H (x)}

deta(x)₋1

dx₁· · ·dx_d₋₁/normalization. (1.11) Thus, we need to compute deta(x) and a(x)⁻¹. Such calculations are rather lengthy and summarized in Appendix A. In particular, it will be shown that

deta(x)=(β1x1)· · ·(βdxd) ¯x,β⁻¹. (1.12) Verification of (1.8) and (1.11) for the W-F diffusion can be found, for example, in [1, Appendix F].

In actual proof of Theorem 1.1, we take a strategy similar to [11], in which the same kind of problems are solved for a class of measure-valued diffusion processes of Fleming–Viot’s type. This class contains the W-F diffusions as finite-dimensional cases. The strategy allows one to avoid problems which would be caused by degeneracy ofa(x) on the boundary ofK_d(in R^d⁻¹) and is based on a transformation group{S_f: f ∈R^d⁻¹}onK_dwith property

d

duS_ufx=a(S_ufx)f, f =(f₁, . . . , f_d₋₁)∈R^d⁻¹, x∈K_d. (1.13) Technicalities regarding this group are collected in Appendix B. In this context, reversible distributions (if exist) are interpreted as distributions with certain quasi-invariance property (see Theorem 2.1 below), and some measure- theoretic considerations will yield (1.8), a key in the above heuristics. It is worth noting and observed in Appendix B that aσ-finite measure

deta(x)₋1

dx₁· · ·dx_d₋₁ onK_d⁺ (1.14)

which appeared in (1.11) is invariant under{Sf}, and that such invariant measures are unique up to multiplicative constants. We also emphasize that existence of the transformation group (1.13) crucially relies on a special structure of the diffusion matrix, i.e.,

a(x)⁻¹

ij= ∂²U

∂x_i∂x_j, x∈K_d⁺ (1.15)

(5)

for some functionU on K_d⁺. See Corollary A.1 in Appendix A for an explicit form ofU. It seems that such structure characterizes a class of diffusion models for which appropriate modification of methods in the present paper are available.

The rest of the paper is organized as follows. In the next section, we will prove Theorem 1.1 by means of results shown in Appendices A and B. Such results would be interesting in their own rights and useful in some other situations. As the last part (besides appendices) of this paper, we discuss some aspects of the reversible distributions obtained in Theorem 1.1, especially logarithmic Sobolev inequalities for the reversible G-S diffusions. This kind of inequalities is known as a powerful tool to study ergodic behaviors of the process and asymptotic stability of the equilibrium distribution. (See e.g. [8] for general accounts and various examples.) In our case these are shown to hold as a direct consequence of logarithmic Sobolev inequalities for the reversible W-F diffusions proved by Stannat [25] combined with remark (ii) after Theorem 1.1.

2. Quasi-invariance and reversibility

As for the scalar product on R^d⁻¹, we use notation (f, g)=

d−1

i=1

f_ig_i, f =(f₁, . . . , f_d₋₁), g=(g₁, . . . , g_d₋₁)∈R^d⁻¹.

Set also f ₁= |f₁| + · · · + |f_d₋₁|. As mentioned in the previous section, one of main tools in this section is the transformation group{S_f: f ∈R^d⁻¹}onK_dconstructed in Appendix B. For eachf ∈R^d⁻¹, the image ofx∈K_d byS_f is denoted as

S_fx=

(S_fx)₁, . . . , (S_fx)_d₋₁

∈K_d.

Properties of{S_f}we employ here are the following. The proofs are found in Appendix B.

(S.1) For eachi∈ {1, . . . , d},(S_fx)_i=0 wheneverx_i=0.

(S.2) S₀x=xandS_f(S_gx)=S_f₊_gx. In particular,S₋_f =(S_f)⁻¹. (S.3) S_fx∈K_d⁺if and only ifx∈K_d⁺.

(S.4) For anyx, y∈K_d⁺, there exists a uniquef∈R^d⁻¹such thaty=S_fx. (S.5) ^∂(S_∂f^f^x)ⁱ

j =aij(Sfx), i, j∈ {1, . . . , d−1}. (S.6) S_fxis continuously differentiable inx, and

D_x(S_fx)a(x)=a(S_fx), whereD_x(S_fx)=

∂(S_fx)_i

∂xj

1i,jd−1

. (S.7) There exist constantsC1andC2such that for allx, y∈Kd, f, g∈R^d⁻¹

S_fx−S_gy ₁

x−y ₁+C₁ f−g ₁ exp

C₂min{ f ₁, g ₁} . (S.8) For any nonnegative Borel functionF onK_d⁺

R^d⁻¹

F (Sfx) df1· · ·dfd−1=

K_d⁺

F (y)m(dy)=

K_d⁺

F (Sgy)m(dy),

wherex∈K_d⁺andg∈R^d⁻¹are arbitrary andm(dy)=dy1· · ·dyd−1/deta(y).

(6)

Denote byM1(Kd)the set of Borel probability measures onKd. Given a measurable functionΛon R^d⁻¹×Kd, we say that aν∈M1(Kd)is quasi-invariant under{Sf}with cocycleΛ if for everyf ∈R^d⁻¹,ν andν◦Sf

(=ν◦(S₋f)⁻¹)are mutually absolutely continuous with density given by d(ν◦S_f)

dν (x)=e^Λ(f,x), ν-a.s., (2.1)

which is equivalent to the condition that

K_d⁺

F (S₋_fx)ν(dx)=

K_d⁺

F (x)e^Λ(f,x)ν(dx), f ∈R^d⁻¹

holds for any nonnegative Borel functionF onK_d⁺. In this case, the chain rule, (S.2) and (2.1) together imply that forf, g∈R^d⁻¹

Λ(f+g, x)=Λ(f, Sgx)+Λ(g, x), ν-a.s. (2.2)

This is referred to as cocycle identity. Putb(x)=(b1(x), . . . , bd−1(x)).

Theorem 2.1. Letν∈M1(Kd). Thenνis a reversible distribution of the G-S diffusion if and only ifνis quasi- invariant under{Sf}with cocycle

Λ(f, x)=2 1 0

b(S_ufx), f

du. (2.3)

We prepare an equality which plays a key role in proving Theorem 2.1.

Lemma 2.1. LetΛbe given by(2.3). Fix an arbitraryf ∈R^d⁻¹. For anyG∈C¹(Kd), define Gt(x)=G(S₋tfx)exp

−Λ(tf, S₋tfx)

, t∈R.

ThenG_t∈C¹(K_d)and d

dtG_t(x)= −2

b(x), f

G_t(x)−

a(x)f,∇G_t(x)

. (2.4)

Proof. DefineVt(x)=Λ(tf, S₋tfx). We first compute∇Gt as

∇G_t(x)=e⁻^V^t^(x)∇(G◦S₋_tf)(x)−G_t(x)∇V_t(x). (2.5)

Moreover, by (S.6)

a(x)f,∇(G◦S₋_tf)(x)

=

a(x)f, D_x(S₋_tfx)∇G(S₋_tfx)

=

Dx(S₋tfx)a(x)f,∇G(S₋tfx)

=

a(S₋_tfx)f,∇G(S₋_tfx)

= −d

dtG(S₋_tfx), (2.6)

where (S.5) was used to show the last equality. Observing that Vt(x)=Λ(tf, S₋tfx)=2

t

0

(b(S₋ufx), f ) du, (2.7)

(7)

we see from (S.6) and (2.5) thatGt∈C¹(Kd). Applying calculations (2.6) toW (x):=(b(x), f )instead ofG(x), we get

a(x)f,∇V_t(x)

=2 t

0

a(x)f,∇(W◦S₋_uf)(x) du

= −2 t

0

d

duW (S₋_uf) du

= −2

b(S₋_tfx), f

−

b(x), f

= −d

dtV_t(x)+2 b(x), f

. (2.8)

Combining (2.5) with (2.6) and (2.8) yields a(x)f,∇G_t(x)

= −e⁻^V^t^(x)d

dtG(S₋_tfx)−G_t(x)

−d

dtV_t(x)+2(b(x), f )

= −d

dtG_t(x)−2

b(x), f G_t(x).

This proves (2.4). 2

Proof of Theorem 2.1. Firstly, we recall a fundamental fact on reversibility which is implied by Theorem 2.3 of Fukushima and Stroock [6]. Namely,ν∈M1(K_d)is a reversible distribution of the G-S diffusion if and only if the following symmetry holds:

(LF )(x)G(x)ν(dx)=

F (x)(LG)(x)ν(dx), F, G∈C²(Kd).

(All integrals are taken overK_d.) Note that this particularly implies that for anyF ∈C²(K_d),

(LF )(x)ν(dx)=0.

Since direct computation shows that (LF )(x)G(x)+F (x)(LG)(x)−

L(F G)

(x)= −

a(x)∇F (x),∇G(x)

, (2.9)

the above symmetry is equivalent to that

−

(LF )(x)G(x)ν(dx)=1

2 a(x)∇F (x),∇G(x)

ν(dx) (2.10)

holds for anyF, G∈C²(Kd). Moreover, by approximation, this can be replaced by the condition that (2.10) holds for anyF ∈C²(Kd)andG∈C¹(Kd).

Once (2.4) has been established, the following argument is standard (cf. [11], proof of Theorem 2.1). We shall describe it, using the same notation as in Lemma 2.1. TakingG∈C¹(Kd)andf ∈R^d⁻¹arbitrarily and integrating both sides of (2.4) with respect to aν∈M1(Kd), we have

b(x), f

G_t(x)ν(dx)+1

2 a(x)f,∇G_t(x)

ν(dx)= −1 2

d

dtG_t(x)ν(dx).

DefineF (x)=(x, f )to get a more suggesting form

(LF )(x)Gt(x)ν(dx)+1

2 a(x)∇F (x),∇Gt(x)

ν(dx)= −1 2

d dt

Gt(x)ν(dx). (2.11)

(8)

HereGt∈C¹(Kd)by Lemma 2.1. Ifν is a reversible distribution of the G-S diffusion, then the left-hand side of (2.11) vanishes by (2.10) and in particular

G₁(x)ν(dx)=

G₀(x)ν(dx)or

G(S₋fx)exp

−Λ(f, S₋fx)

ν(dx)=

G(x)ν(dx).

This shows the quasi-invariant property with desired density (2.1). Conversely, ifν is quasi-invariant under{Sf} with cocycleΛ, then the right-hand side of (2.11) vanishes. Thus (2.10) holds for any G∈C¹(Kd)andF such thatF (x)=(x, f )for somef ∈R^d⁻¹. Furthermore, an inductive argument shows that (2.10) can extend to all functionsF (x)=(x, f⁽¹⁾)· · ·(x, f⁽ⁿ⁾)withf⁽¹⁾, . . . , f⁽ⁿ⁾∈R^d⁻¹. Indeed, assuming that both (F =)F1andF2

satisfy (2.10) for anyG∈C¹(Kd), we have by (2.9)

(L(F1F2))(x)G(x)ν(dx)

=

(LF₁)(x)F₂(x)G(x)ν(dx)+

F₁(x)(LF₂)(x)G(x)ν(dx)+ a(x)∇F₁(x),∇F₂(x)

G(x)ν(dx)

= −1

2 a(x)∇F₁(x),∇(F₂G)(x)

ν(dx)−1

2 a(x)∇F₂(x),∇(F₁G)(x) ν(dx) + a(x)∇F1(x),∇F2(x)

G(x)ν(dx)

= −1 2

F₂(x)

a(x)∇F₁(x),∇G(x)

ν(dx)−1 2

F₁(x)

a(x)∇F₂(x),∇G(x) ν(dx)

= −1

2 a(x)∇(F1F2)(x),∇G(x) ν(dx).

Therefore, it follows from linearity of (2.10) in F that (2.10) holds true for all polynomials F (x). A suitable approximation procedure (see e.g. Appendix 7 in [3]) concludes that (2.10) are valid for allF ∈C²(K_d). This implies reversibility ofν. 2

In the next lemma, assuming existence and certain support property, we give a concrete expression of quasi- invariant distributions under{S_f}in terms of continuous cocycleΛ.

Lemma 2.2. (i) LetΛ:R^d⁻¹×Kd→R be continuous. Suppose that there exists aν∈M1(Kd)which is quasi- invariant under{Sf}with cocycleΛ. Ifν(K_d⁺)=1, then

Z(x):=

R^d−1

e^Λ(f,x)df₁· · ·dfd−1<∞ for allx∈K_d⁺, (2.12)

and

ν(dx)=Z(x)⁻¹m(dx). (2.13)

Moreover, for anyf ∈R^d⁻¹andx∈K_d⁺

Λ(f, x)=logZ(x)−logZ(Sfx). (2.14)

(ii) IfHis a continuous function onK_d⁺such that Z:=

K_d⁺

e⁻^{H (x)}m(dx) <∞,

(9)

thenν(dx):=e⁻^{H (x)}m(dx)/Zis a unique distribution onK_d⁺which is quasi-invariant under{Sf}with cocycle Λ(f, x)=H (x)−H (Sfx).

Proof. (i) For any nonnegative Borel functionF onK_d⁺, we have the following equalities by the quasi-invariance supposed.

K_d⁺

F (S₋_fx)ν(dx)=

K_d⁺

F (x)e^Λ(f,x)ν(dx), f ∈R^d⁻¹. (2.15)

By integrating both sides with respect todf1· · ·dfd−1 over R^d⁻¹, using Fubini’s theorem and then (S.8), and noting thatν(K_d⁺)=1, (2.15) becomes

K_d⁺

F (y)m(dy)=

K_d⁺

F (x)Z(x)ν(dx). (2.16)

Since the left-hand side is finite forF (x)=deta(x),Z(x) <∞, ν-a.e. Therefore, (2.16) proves (2.13). Especially, the support suppνofνcoincides withK_d. It is obvious from (S.4) that (2.12) is implied by (2.14). This equality is shown by virtue of cocycle identities (2.2) which hold for allx∈K_dandf, g∈R^d⁻¹since suppν=K_dandΛis continuous. Indeed, for anyx∈K_d⁺andf ∈R^d⁻¹

Z(S_fx)=

R^d−1

e^Λ(g,S^f^x)dg₁· · ·dg_d₋₁=e⁻^Λ(f,x)

R^d⁻¹

e^Λ(g⁺^f,x)dg₁· · ·dg_d₋₁=e⁻^Λ(f,x)Z(x), which proves (2.14).

(ii) LetF be an arbitrary nonnegative Borel function onK_d⁺. By the last equality of (S.8)

K_d⁺

F (x)e⁻^{H (x)}m(dx)=

K_d⁺

F (Sfx)e⁻^{H (S}^f^x)m(dx), f∈R^d⁻¹

or equivalently

K_d⁺

F (x)ν(dx)=

K_d⁺

F (S_fx)e^{H (x)}⁻^{H (S}^f^x)ν(dx), f∈R^d⁻¹.

ReplacingF byF◦S₋_f, we get (2.15) withΛ(f, x)=H (x)−H (S_fx). This shows the required quasi-invariance property. SinceHis continuous, uniqueness follows from the assertion (i). 2

For Λ given by (2.3), we shall verify the condition ν(K_d⁺)=1 under the irreducibility assumption of Theorem 1.1. SetI= {1, . . . , d}. DefineB=(λ_ij)_i,j_∈_I and decompose(b(x), f )into

b(x), f

= ¯x,Bfˆ + ζ(x),fˆ, f =(f₁, . . . , f_d₋₁)∈R^d⁻¹, (2.17) wherefˆ=(f₁, . . . , f_d₋₁,0)∈R^dandζ(x)=(ζ₁(x), . . . , ζ_d(x))is defined by

ζ_i(x)=x_i

γ_i− ¯x,γ

, i∈I.

Givenξ,η∈R^d, defineξ η∈R^dby ξ η=(ξ₁η₁, . . . , ξ_dη_d).

Observe that for allx∈K_dandξ∈R^d ζ(x),ξ

= ¯x,γ ξ − ¯x,γ ¯x,ξ, (2.18)

(10)

and that

ζ(x),ξω(γ)

¯ x,|ξ|

, (2.19)

whereω(γ)=max{|γ_i −γ_j|;i, j ∈I} and|ξ| =(|ξ₁|, . . . ,|ξ_d|) ∈R^d. Define ξˇ =(ˇξ₁, . . . ,ξˇ_d₋₁)∈R^d⁻¹ by ξˇ_i=ξ_i−ξ_d(i=1, . . . , d−1). It is obvious that

ξ= ˆ(ξˇ)+ξ_d1, ξ∈R^d. (2.20)

Lemma 2.3. Suppose thatB=(λij)i,j∈I is irreducible in the same sense as in Theorem 1.1. Ifν∈M1(Kd)is quasi-invariant under{S_f}with cocycleΛgiven by (2.3), thenν(K_d⁺)=1.

Proof. Letf ∈R^d⁻¹be given. Putρ=ω(γ). Define F (x)=exp

−2(1+ρ) 1

0

S_ufx,fˆdu−2 1 0

ζ(Sufx),fˆ du

. (2.21)

Then by (2.3) and (2.17) F (x)expΛ(f, x)=exp

−2 1 0

Sufx, (1+ρ)fˆ−Bfˆ du

. So the quasi-invariance (2.1) implies that

Kd

exp

−2 1

0

S_ufx, (1+ρ)fˆ−Bfˆ du

ν(dx)

=

K_d

exp

−2(1+ρ) 1 0

S₋ufx,fˆdu−2 1 0

ζ(S₋ufx),fˆ du

ν(dx). (2.22)

Since (2.22) holds true iffˆis replaced byfˆ+C1=:ξwithC∈R being arbitrary, we have also by (2.20)

Kd

exp

−2 1

0

S_uˇ_ξx, (1+ρ)ξ−Bξ du

ν(dx)

=

Kd

exp

−2(1+ρ) 1 0

S₋_uˇ_ξx,ξdu−2 1 0

ζ(S₋_uˇ_ξx),ξ du

ν(dx) (2.23)

for anyξ∈R^d. In the case whereξ_i0(i∈I ), it follows from (2.19) andρ=ω(γ)that

Kd

exp

−2 1

0

S_uˇ_ξx, (1+ρ)ξ−Bξ du

ν(dx)

Kd

exp

−2 1

0

S₋_u_ξ_ˇx,ξdu

ν(dx). (2.24)

Letk∈I be arbitrary and putε^(k)=(δik)i∈I∈R^d. For anyc >0, considerξ=cηwith η=

∞

0

e⁻⁽¹⁺^ρ)te^tBε^(k)dt and e^tB= ∞ n=0

tⁿ n!Bⁿ.

(11)

Then(1+ρ)η−Bη=ε^(k)and hence (2.24) becomes

K_d

exp

−2c 1 0

(S_uc_η_ˇx)kdu

ν(dx)

K_d

exp

−2c 1 0

S₋_uc_η_ˇx,ηdu

ν(dx). (2.25)

Here the irreducibility ofB implies that ηi >0 for eachi∈I. Therefore the right-hand side of (2.25) tends to 0 asc→ ∞, while it follows from (S.1) that the left-hand side is bounded from below byν({x∈Kd: xk =0}).

Consequently,xk>0, ν-a.s. Sincek∈I is arbitrary,ν(K_d⁺)=1 as required. 2

Before proving Theorem 1.1, we show a simple lemma. Definea(x)¯ =(a_ij(x))_i,j_∈_I, wherea_ij(x)are the same as in (1.3) even fori=dorj=d.

Lemma 2.4. Letξ∈R^dandg∈R^d⁻¹. Then d

dtStgx,ξ t=0

=

¯ a(x)g,ˆ ξ

. (2.26)

Proof. Noting that the left-hand side of (2.26) does not change its value after replacementξ →ξ+C1, we use (S.5) to get

d

dtS_tgx,ξ t=0

= d

dt(S_tgx,ξˇ) t=0

=

a(x)g,ξˇ

=g, a(x)ˇξ .

Hence (2.26) is easily shown by observing that(a(x)ξˇ)_i=(a(x)ξ)¯ i, i∈ {1, . . . , d−1}. 2

Proof of Theorem 1.1. First suppose that there exists a reversible distributionν∈M1(Kd)of the G-S diffusion.

According to remark (iv) after Theorem 1.1, we assume also thatd 3. By Theorem 2.1 ν is quasi-invariant under{Sf}with cocycleΛgiven by (2.3). Since we have both irreducibility ofB=(λij)and continuity ofΛ, Lemmas 2.2(i) and 2.3 together imply that there exists a functionH (x)(:=logZ(x))onK_d⁺such that

H (x)−H (S_tfx)=Λ(tf, x)=2 t

0

b(S_ufx), f

du, f ∈R^d⁻¹ and therefore

b(x), f

= −1 2

d

dtH (S_tfx) t=0

, f∈R^d⁻¹. (2.27)

We claim thatH is sufficiently smooth onK_d⁺. Fixingy∈K_d⁺arbitrarily, consider G(f ):=H (S_fy)=H (y)−2

1 0

b(S_ufy), f du

as a function off ∈R^d⁻¹. Then G is smooth by (2.17) and (S.5). By (S.4) the equation S_{Ψ (x)}y =x defines a mapΨ:K_d⁺→R^d⁻¹. This is nothing but the inverse of R^d⁻¹f →Sfy∈K_d⁺, whose Jacobian matrix is nondegenerate and smooth by (S.5). Hence the inverse function theorem implies smoothness ofΨ and accordingly of the compositionH=G◦Ψ.

Consequently (2.27) yields symmetry of the form

(12)

d

dt(b(Stgx), f ) t=0

= −1 2

∂²

∂t∂uH (Suf+tgx) t=0,u=0

= −1 2

∂²

∂t∂uH (Stf+ugx) t=0,u=0

= d dt

b(S_tfx), g

t=0

, f, g∈R^d⁻¹. (2.28)

Making use of (2.17), (2.18), and (2.26), one can compute the most right-hand side of (2.28) as d

dt

b(S_tfx), g t=0

= d

dtS_tfx,Bgˆ t=0

+ d

dtS_tfx,γgˆ t=0

− d dt

S_tfx,γS_tfx,gˆ t=0

=

¯

a(x)f ,ˆ Bgˆ +

¯

a(x)f ,ˆ γgˆ

−

¯

a(x)f ,ˆ γ

¯x,gˆ − ¯x,γ

¯

a(x)f ,ˆ gˆ

=

¯

a(x)f ,ˆ Bgˆ +

¯

a(x)f ,ˆ γ ˆ

g− ¯x,gˆ1

− ¯x,γ

¯

a(x)f ,ˆ gˆ

. (2.29)

Noting that this quantity does not change the value when replacingf andgbyfˆ+C₁1=:ξ andgˆ+C₂1=:η respectively and that the last term of (2.29) is symmetric inf andg, we see that (2.28) is equivalent to

J₁(x)[ξ,η] := ¯a(x)ξ,Bη +

¯

a(x)ξ,γ

η− ¯x,η1

=J₁(x)[η,ξ], ξ,η∈R^d. (2.30) Moreover, rewriting the bilinear formJ₁(x)in terms ofβby using Lemma A.1 in Appendix A and then removing a symmetric part, we obtain from (2.30)

J₂(x)[ξ,η] =J₂(x)[η,ξ], ξ,η∈R^d, (2.31)

where

J₂(x)[ξ,η] =

¯ x,β

ξ− ¯x,ξ1

Bη− ¯x,Bη1

−

¯ x,β

ξ− ¯x,ξ1

¯ x,γ

η− ¯x,η1

=

¯

x,βξ(Bη)

− ¯x,ξ

¯

x,β(Bη)

− ¯x,βξ ¯x,Bη + ¯x,β ¯x,ξ ¯x,Bη

−

¯x,βξ − ¯x,β ¯x,ξ

¯x,γ η − ¯x,γ ¯x,η

. (2.32)

Letε^(k), k∈I be the unit vectors in the proof of Lemma 2.3 and consider equalities

J₂(x)[ε⁽ⁱ⁾,ε^{(j )}] =J₂(x)[ε^{(j )},ε⁽ⁱ⁾], x∈K_d⁺, i, j∈I. (2.33) Clearly

J₂(x)[ε⁽ⁱ⁾,ε^{(j )}] =x_iβ_i(Bε^{(j )})_i−x_i ¯x,β·Bε^{(j )} −x_iβ_i ¯x,Bε^{(j )} +x_i ¯x,β ¯x,Bε^{(j )} −x_i

β_i− ¯x,β x_j

γ_j− ¯x,γ

. (2.34)

Given distincti, j∈I, take an arbitraryk∈I\ {i, j}. This is possible becaused3. We regard (2.33) as identities between polynomials withd−1 independent variables{x_l: l∈I, l=k}. Note that

¯x,β =β_k+

l∈I\{k}

x_l(β_l−β_k) (2.35)

and similarly

¯x,Bε^{(j )} =(Bε^{(j )})k+

l∈I\{k}

xl

(Bε^{(j )})l−(Bε^{(j )})k

=λkj +

l∈I\{k}

xl(λlj−λkj).

Comparing coefficients of a monomialx_iin (2.33), we get βi(λij−λkj)=0, i=j=k.

This makes it possible to defineqj=2λij (i=j )for eachj ∈I, and (1.5) has been derived. It follows from the irreducibility of(λij)i,j∈Ithatq:=(q1, . . . , qd)∈R^d₊.