New scaling of Itzykson-Zuber integrals

(1)

www.elsevier.com/locate/anihpb

New scaling of Itzykson–Zuber integrals

Benoît Collins

^a,^∗

, Piotr ´Sniady

^b

aResearch Institute for Mathematical Sciences, Kyoto university, Kyoto 606-8502, Japan bInstitute of Mathematics, University of Wroclaw, pl. Grunwaldzki 2/4, 50-384 Wroclaw, Poland Received 31 May 2005; received in revised form 26 September 2005; accepted 6 December 2005

Available online 17 January 2007

Abstract

We study asymptotics of the Itzykson–Zuber integrals in the scaling when one of the matrices has a small rank compared to the full rank. We show that the result is basically the same as in the case when one of the matrices has a fixed rank. In this way we extend the recent results of Guionnet and Maïda who showed that for the fixed rank scaling, the Itzykson–Zuber integral is given in terms of the Voiculescu’sR-transform of the full rank matrix.

Résumé

Nous étudions les asymptotiques de l’intégrale de Itzykson–Zuber dans le cas où l’une des matrices est de rang petit. Nous prouvons que l’asymptotique obtenue est la même que lorsque la matrice est de rang fixe. De cette manière, nous étendons les récents résultats de Guionnet et Maïda, qui prouvent que en rang fixe, l’intégrale de Itzykson–Zuber s’exprime à l’aide de la R-transformée de Voiculescu de la matrice en rang plein.

MSC:46L54; 15A52

Keywords:Itzykson–Zuber integral;R-transform

1. Introduction and the main result

For diagonal matricesA_N, B_N∈M_N(C)we consider the Itzykson–Zuber integral I_N^(β)(AN, BN)=

e^N^Tr^{U A}^N^U^B^Ndm^β_N(U ),

wherem^β_Ndenotes the Haar measure on the orthogonal groupONwhenβ=1, on the unitary groupUNwhenβ=2, and on the symplectic group Sp(N/2)whenβ=4 (in the latter case,Nis even). Usually one is interested in the study

* Corresponding author.

E-mail addresses:collins@math.kyoto-u.ac.jp (B. Collins), Piotr.Sniady@math.uni.wroc.pl (P. ´Sniady).

doi:10.1016/j.anihpb.2005.12.003

(2)

of the asymptotics of the Itzykson–Zuber integral as the sizeN of the matrices tends to infinity and for this reason one would like to have an insight into the limit

Nlim→∞

1

N²logI_N^(β)(A_N, B_N)=: ˜I^(β)(μ_A, μ_B) (1)

when the spectral measuresμ_A_N, μ_B_N converge weakly towards some probability measures onR. Existence of the limit of (1) was proved by Guionnet and Zeitouni [4].

In [1], the study of another scaling of Itzykson–Zuber integrals was initiated, namely when the rankM of the matrixA_N is small compared to the full rankN. In order to obtain a non-trivial limit one should consider rather the limit

Nlim→∞

1

MN logI_N^(β)(A_N, B_N). (2)

In this paper we shall say that a family of matricesB_N∈M_N(C)converges in distributionif and only if for all integerk,N⁻¹Tr(B_N^k)tends towards a finite limit asN→ ∞.

The first author proved that ifA_N=diag(t,0,0, . . .)has rank one and spectral measures ofB_Nconverge to some probability measureμ_Bthen for any integerk,

Nlim→∞

1 N

∂^k

∂t^klogI_N⁽²⁾(AN, BN) t=0

= d^k dt^k

t 0

Rμ_B(s)ds t=0

.

The functionRis called Voiculescu’sR-transform and is of central importance in free probability theory. We recall its definition in Section 2.

In this paper, the following hypothesis and notation about the sequence of matricesA_N ∈M_N(C)will be used frequently.

Hypothesis 1.A_N is diagonal and it has rankM(N )=o(N ). Its non-zero eigenvalues are denoted bya_1,N· · · a_M,N.

Using the results presented in our previous article [2] it is possible to prove that the following quite general statement holds true:

Theorem 2.Assume thatB_Nhas a limiting distributionμ_Band thatA_Nis uniformly bounded and satisfies Hypoth- esis1. Then forβ=1and2,

1 N M(N )

∂^k

∂t^klogI_N^(β)(t A_N, B_N) t=0

− β 2M(N )

M(N )

i=1

d^k dt^k

2t /β

0

R_μ(t a_i,N)ds t=0

=o(1).

Ifβ=4then

1 N M(N )

∂^k

∂t^klogI_N^(β)(t A_N, B_N) t=0

+ 1 M(N )

M(N )

i=1

d^k dt^k

t 0

R_μ(−t a_i,N)ds t=0

=o(1).

Observe that it is convenient to separate the caseβ=4 from the casesβ=1,2.

We shall not prove this theorem since it is an immediate consequence of Theorem 5.5 of [2]. On the other hand, it has been proved recently in Theorem 7 of [3], that if theL^∞-norm ofAN is bounded by a constant depending onBN

(see Theorem 4 in this paper), under Hypothesis 3 forBN, and provided thatM(N )=O(N^1/2⁻^ε)for someε >0, one has forβ=1,2:

1

N M(N )logI_N^(β)(A_N, B_N)− β 2M(N )

M(N )

i=1 2t /β

0

R_μ(sa_i,N)ds

=o(1). (3)

(3)

The main result of this paper is Theorem 6. It shows that Eq. (3) still holds true if one replaces the restriction M(N )=O(N^1/2⁻^ε)by Hypothesis 1, and that under additional assumption onBN, it is enough to assume that the L^∞-norm ofA_Nis uniformly bounded by any constant, independently onB_N.

The techniques used in this paper are elementary, and substantially simplify the proofs of [3]. Better, it becomes possible to express theR-transform as a limit of Guionnet–Zeitouni’s limits obtained in [4].

Letb_1,N· · ·b_N,N be the real eigenvalues of the diagonal matrixB_N and letμ_B_N be the probability measure N⁻¹N

i=1δb_i,N. We make the following hypothesis on the family of matrices(BN)N∈NwithBN∈MN(C):

Hypothesis 3. The probability measure μ_B_N converges weakly towards a compactly supported probability measure μB and that b1,N (resp. bN,N) converge towards the upper bound λmax (resp. the lower bound λmin) of the support ofμ_B.

The Harish–Chandra formula in the case whenβ=2 and when the eigenvalues have no multiplicity tells that I_N⁽²⁾(AN, BN)=det(e^N^·^a^i,N^b^j,N)^N_i,j₌₁

Δ(a_i,N)Δ(b_i,N) ,

whereΔdenotes Vandermonde determinant. In particular it implies that the value ofI_N⁽²⁾depends only on the eigenvalues ofA_N, B_Ntherefore the assumption thatA_N, B_N are diagonal can be removed in the caseβ=2. However a priori it cannot be removed in the other cases (although it is an open question to the authors’ knowledge).

We recall some definitions that can be found in [3]. For a compactly supported probability measureμ, callH_μits Hilbert transform

H_μ(z)= 1

z−λdμ(λ).

It is invertible under composition in a complex neighborhood of infinity; and let us callz⁻¹+R_μits inverse.R_μis called theR-transformof Voiculescu.

LetH_min=limz↑λminH_μ_B(z)andH_max=limz↓λmaxH_μ_B(z).

Definition.Forβ∈ {1,2,4}, a compactly supported probability measureμand a real parametert we definefμ^(β)(t ) as follows.

Ifβ=1,2, the function is f_μ^(β)(t )=t v(t )−β

2

log

1+2

βt v(t )− 2 βt λ

dμ(λ) with

v(t )=

⎧⎪

⎨

⎪⎩

Rμ(_β²t ) ifHmin^2t_β Hmax, λ_max−_2t^β if ^2t_β > H_max, λmin−_2t^β if ^2t_β < Hmin.

Observe that in the particular caset∈ [H_min, H_max], f_μ^(β)(t )=

2t /β

0

R_μ(st )ds.

In the caseβ=4 the function is defined by f_μ⁽⁴⁾(t )= −f_μ⁽²⁾(−t ).

One can prove that this function is continuous with respect tot and also with respect toμ_B for the metric on the measures given by

(4)

d(μ, μ)= |λmax−λ_max| + |λmin−λ_min| +sup

f (dμ−dμ), f _∞=1 andf of Lipschitz constant 1

. (4)

The following result of [3] will be of fundamental importance for us.

Theorem 4.([3], Theorems 1.2 and 1.6) Assume that the family of matricesB_N satisfies Hypothesis3. Forβ=1,2, for any real numbert,

Nlim→∞N⁻¹logI_N

B_N,diag(t,0, . . . ,0)

=f_μ^(β)

B (t ).

Note that although Guionnet and Maïda in [3] do not handle the caseβ=4, it is easy to extend their results to this case and we will take these results for granted.

At some points, we will need to make the following assumption on the matrixBN: Hypothesis 5.There exists a constantc >0 such that

b_i₊_1,N+ 1

cN b_i,N.

This assumption implies that the distribution ofμBhas an absolutely continuous part with respect to the Lebesgue measure whose density iscalmost everywhere on its support, and the support should be an interval.

With these preliminaries, we are ready to state our main result:

Theorem 6.Letβ∈ {1,2,4}.

(1) LetA_Nbe a sequence of matrices satisfying Hypothesis1, and H_min<lim inf

N a_N,Nlim sup

N

a_1,N< H_max.

Assume thatBNsatisfies Hypothesis3. Then 1

M(N )

i=1

f_μ^(β)

B(a_i,N) =o(1).

(2) LetANbe a sequence of matrices satisfying Hypothesis1. Assume thatBN satisfies Hypotheses3and5. Then 1

M(N )

i=1

f_μ^(β)

B(a_i,N) =o(1).

(3) Let μ, ν be compactly supported real probability measures. Let ν be supported in a compact subset of (Hmin, Hmax). Fora∈ [0,1], letνa=aν+(1−a)δ0. Then

alim→0a⁻¹I˜^(β)(ν_a, μ)=

t∈R

f_μ^(β)(t )dν(t ).

(4) Letμ, νbe compactly supported real probability measures. Assume thatμhas a connected support[λmin, λmax], and that there exists a constantc >0such that

c·dx1_[λmin,λmax]μ.

Then

alim→0a⁻¹I˜^(β)(ν_a, μ)=

t∈R

f_μ^(β)(t )dν(t ).

(5)

Remark.In Theorem 6, we do not handle the limit lima→0a⁻¹I˜^(β)(νa, μ)under the most general assumption that μ, νare compactly supported real probability measures. It is tempting to conjecture that Hypothesis 5 is irrelevant but we could not settle this question.

The next section is devoted to proving Theorem 6.

2. Proof of the main result

I_N^(β)satisfies the following obvious translation invariance property:

I_N^(β)(A, B)=e⁻^{N x}^Tr^AI_N^(β)(A, B+xId), (5)

therefore there is no loss of generality in assuming thatB_Nis positive.

We shall need the following technical results:

Lemma 7.LetB∈M_N(C)be Hermitian, with eigenvaluesb₁· · ·b_d0. LetΠ be a projector of rankN−1 andB=Π BΠand letb₁· · ·b_d=0its eigenvalues. Then one has

b₁b₁ b₂· · ·b_db_d=0.

Proof. LetV be the set of Hermitian projections of rankn+1−iandVbe the subset of projections ofV dominating 1−Π. According to the “minimax” theorem,b_i =minπ∈V π Bπ , andb_i₋₁=minπ∈V π Bπ . SinceV⊂V this implies thatb_i₊₁b_i. ReplacingBby B Id−Bshows thatb_ib_i, which concludes the proof. 2

The following lemma is the keystone towards the proof of Theorem 6:

Lemma 8.One has

M(N )

i=1

Cinf_i,NIN+1−i

N

N+1−idiag(ai,N,0, . . . ,0), Ci,N

IN(AN, BN)

M(N )

i=1

sup

Ci,N

IN+1−i

N

N+1−idiag(ai,N,0, . . . ,0), Ci,N

, (6)

where theinfandsupare taken over all diagonal matrices Ci,N=diag

c₁⁽ⁱ⁾, . . . , c_N⁽ⁱ⁾₊₁₋_i

such thatbkc⁽ⁱ⁾_k bk+i−1. (7)

Proof. We make this proof by induction onM(N ). ForM(N )=1 there is nothing to prove.

Suppose that the result holds at the levelM(N )−1. In the caseβ=2, we identifyUN−1with a subgroup ofUN

via the morphism U_N₋₁→

1 0 0 U_N₋₁

⊂U_N.

In the caseβ=1 we consider instead the embedding ofON−1intoON

O_N₋₁→

1 0 0 O_N₋₁

⊂O_N,

and in the caseβ=4, the embedding SPN−1into SPN

SPN−1→

Id2 0 0 SPN−1

⊂SPN.

We shall only deal with the caseβ=2, the casesβ=1,4 being similar.

(6)

Through the above identification, and using the invariance of Haar measure by convolution, we have I_N^(β)(A_N, B_N)=

U∈UN

V∈UN−1

exp

NTrA_NV U B_NU^∗V^∗

dm^β_N(U )dm^β_N₋₁(V ). (8) In the remainder of the proof, N is fixed, so in order to lighten the notation we omit the subscript N for the matricesANandBN, and replaceai,N byai (resp.bi,Nbybi).

Since a unitary conjugation ofAby a unitary leavesI_N^(β)(A, B)invariant, one can assume without loss of generality that

A=diag(a1, . . . , a_N)=

a₁ 0 0 A₋

withaiaj ifi < j. We shall also adopt the following notation:

B˜=U BU^∗=

b˜₁₁ B˜₁₋ B˜₋₁ B˜₋₋

.

Observe thatB˜₋₋∈M_N₋₁(C)andb˜₁₁∈Care random variables depending on the matrix random variableU. With these notations we have

I_N^(β)(A, B)=

U∈UN

V∈UN−1

exp

NTrAVBV˜ ^∗

dm^β_N(U )dm^β_N₋₁(V )

=

U∈U_N

V∈U_N−1

exp

N a₁b˜₁₁+NTrA₋VB˜₋₋V^∗

=

U∈U_N

exp N a1b˜11

V∈U_N−1

exp

NTrA₋VB˜₋₋V^∗

=

U∈UN

exp

N a₁b˜₁₁ I_N^(β)₋₁

N

N−1A₋,B˜₋₋

dm^β_N(U ). (9)

We are in position to apply the recursion hypothesis to I_N^(β)₋₁

N

N−1A₋,B˜₋₋

.

Lemma 7 implies that

M(N )

i=2 Cinfi,N

I_N₊₁₋_i N

N+1−idiag(ai,0, . . . ,0), Ci,N

I_N^(β)₋₁ N

N−1A₋,B˜₋₋

M(N )

i=2

sup

Ci,N

IN+1−i

N

N+1−idiag(ai,0, . . . ,0), Ci,N

, (10)

where the inf and sup are taken over the same set of matricesC_i,N as in (7).

In addition, it is obvious that

U∈U_N

exp N a1b˜11

dm^β_N(U )=I_N^(β)

diag(a1,0, . . . ,0), B .

The above remark and the inequality (10) plugged into Eq. (9) validate the recursion hypothesis at rankM(N )and complete the proof of the lemma. 2

We rephrase a continuity statement obtained by [3] suited for our purposes

(7)

Lemma 9.LetKbe a compact interval contained in(Hmin, Hmax). There exists a functiong:R+→R+continuous around0and such thatg(0)=0satisfying the following property:

For allε >0, for allt∈K, and for all Hermitian matricesB_N, B_N whose distributions have distanced (defined at Eq.(4))less thanεtoμ,

N⁻¹logI_N

B_N,diag(t,0, . . . ,0)

−N⁻¹logI_N

B_N ,diag(t,0, . . . ,0)g(ε).

Now we are in position to prove the first two statements of Theorem 6.

Proof of Theorem 6, statements (1) and (3). Under Hypothesis 1, one observes that d(μC_N,i, μ)→0,

wheredis the distance defined at Eq. (4) andC_N,iis any sequence of matrices fulfilling the condition (7).

Taking the logarithm in the inequality of Lemma 8, and using the continuity result quoted in Lemma 9 concludes the proof of statement (1).

A minor modification of the above reasoning also shows statement (3). 2

Remark.Lemma 8 and the strategy used in the above proof allow us to state and prove a version of Lemma 9 where one replaces diag(t,0, . . . ,0))by a matrixA_Nsatisfying Hypothesis 1. We skip these details.

The following lemma is a direct observation from the definitions, and we skip its proof.

Lemma 10.LetC_N,iandC_N,i be a pair of matrices fulfilling the condition(7). Under Hypothesis5, one has C_N,i−Id· i

cN C_N,i C_N,i+Id· i cN.

Now we are in position to prove statements involving Hypothesis 5.

Proof of Theorem 6, statements (2) and (4).

We prove statement (4), statement (2) being a straightforward adaptation. Fix a in the open interval (0,1)and choose sequences of random matricesA_N, B_N such thatμ_A_N→ν_aandμ_B_N→μ.

In agreement with statement (4) of Theorem 6, it is possible to choseB_Nsuch that it satisfies Hypotheses 3 and 5.

We choseA_N such that it satisfies Hypothesis 3, and such that its rankM(N )∼aN. Taking the logarithm of the inequality of Lemma 8 yields:

1 MN

M(N )

i=1

Cinf_N,ilogI_N₊₁₋_i N

N+1−idiag(ai,0, . . . ,0), CN,i

1

MNlogIN(A, B) 1 MN

M(N )

i=1

sup

CN,i

logIN+1−i

N

N+1−idiag(ai,0, . . . ,0), CN,i

, (11)

where inf and sup are taken overC_N,ias in (7).

Let C_i,N⁺ =sup{C, C∈C_i,N} and C_i,N⁻ =inf{C, C ∈C_i,N}. The notions of sup and inf are well-defined be- causeCi,Nconsists of diagonal matrices. Consider the function

φ_i,N:C→logI_N₊₁₋_i N

N+1−idiag(ai,0, . . . ,0), CN,i

.

Assume thatai >0. It is straightforward to check thatφi,Nis increasing onCi,N for anyi, Nin the sense thatC > C implies thatφ(C)φ(C). Therefore its sup is reached onC_i,N⁺ and its inf onC_i,N⁻ . Ifai<0, equivalent statements hold if one replaces “increasing” by “decreasing” and sup by inf.

Let us introduce C⁺_N= max

i:iM(N+i)C_N⁺₊_i,i and C_N⁻= min

i:iM(N+i)C⁻_N₊_i,i,

(8)

where the maximum is taken pointwise (the matrices are diagonal and the index set is finite).

It is a simple observation thatC_N⁺andC_N⁻have a limiting distribution and that this distribution can be expressed in simple terms as a function ofμanda. Let us call respectivelyμ⁺_a andμ⁻_a these distributions.

According to Theorem 4, the function t→N⁻¹logI_N

diag(t,0, . . . ,0), C_N⁺ is increasing and converges pointwise towardsf_μ+

a asN tends towards∞. Dini’s lemma shows that this convergence is uniform and Cesaro’s theorem proves that one has

I˜^(β)(ν_a, μ)

t∈R+

f^(β)

μ⁺_a (t )dνa(t )+

t∈R−

f^(β)

μ⁻_a (t )dνa(t ).

In the same way,

t∈R+

f^(β)

μ⁻_a (t )dνa(t )+

t∈R−

f^(β)

μ⁺_a (t )dνa(t )I˜^(β)(ν_a, μ).

In addition, Lemma 10 imply that there exists a constantK depending only on ν andμ such that for allt in the spectrum ofν,

f_μ₊

a(t )−f_μ−

a(t )caK

This concludes the proof of statement (4). 2 3. Concluding remarks

In comparison to [3], our method uses much weaker assumptions on the rank and norm of the matrices. Better, it allows us to state a continuity result about the limit ofI Zin a new scaling and thus validate an “inversion of limit”

phenomenon.

Last, it shows that Hypotheses 3 and 5 are relevant one to perform computations outside the phase transition zone (that is, to say something about the limit outside the open interval (Hmin, H_max)). This is a substantial improvement to the paper [3] in which nothing was proved at “low temperature” for the finite scaling of rankM >1.

Unfortunately, our approach heavily relies on real-number inequalities and we believe that finer estimates, as well as complex valued estimates of [3] cannot be established with our methods.

Acknowledgements

The authors benefited from an excellent working atmosphere in Banff free probability meeting in October 2004.

They also thank Alice Guionnet for stimulating discussions. Both authors gratefully acknowledge the referees’ careful reading and their improvement suggestions.

The research of P. ´S. was supported by State Committee for Scientific Research (KBN) grant 2 P03A 007 23, RTN network: QP-Applications contract No. HPRN-CT-2002-00279, and KBN-DAAD project 36/2003/2004. This author is a holder of a scholarship of European Post-Doctoral Institute for Mathematical Sciences. The research of B.C. was partly supported by a JSPS postdoctoral fellowship and by a RIMS COE postdoctoral fellowship.

References

[1] B. Collins, Moments and cumulants of polynomial random variables on unitary groups, the Itzykson–Zuber integral, and free probability, Int.

Math. Res. Not. 17 (2003) 953–982.

[2] B. Collins, P. ´Sniady, Integration with respect to the Haar measure on unitary, orthogonal and symplectic groups, Comm. Math. Phys., in press.

Preprint, arXiv:math-ph/0402073.

[3] A. Guionnet, M. Maïda, A Fourier view on theR-transform and related asymptotics of spherical integrals, J. Funct. Anal. 222 (2) (2005) 435–490.

[4] A. Guionnet, O. Zeitouni, Large deviations asymptotics for spherical integrals, J. Funct. Anal. 188 (2) (2002) 461–515.