• Aucun résultat trouvé

USEFUL BOUNDS ON THE EXTREME EIGENVALUES AND VECTORS OF MATRICES FOR HARPER'S OPERATORS

N/A
N/A
Protected

Academic year: 2021

Partager "USEFUL BOUNDS ON THE EXTREME EIGENVALUES AND VECTORS OF MATRICES FOR HARPER'S OPERATORS"

Copied!
28
0
0

Texte intégral

(1)

HAL Id: hal-02022492

https://hal.archives-ouvertes.fr/hal-02022492

Submitted on 18 Feb 2019

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

USEFUL BOUNDS ON THE EXTREME

EIGENVALUES AND VECTORS OF MATRICES FOR HARPER’S OPERATORS

Daniel Bump, Persi Diaconis, Angela Hicks, Laurent Miclo, Harold Widom

To cite this version:

Daniel Bump, Persi Diaconis, Angela Hicks, Laurent Miclo, Harold Widom. USEFUL BOUNDS ON THE EXTREME EIGENVALUES AND VECTORS OF MATRICES FOR HARPER’S OPERA- TORS. Large truncated Toeplitz matrices, Toeplitz operators, and related topics, 2017. �hal-02022492�

(2)

VECTORS OF MATRICES FOR HARPER’S OPERATORS

DANIEL BUMP, PERSI DIACONIS, ANGELA HICKS, LAURENT MICLO, AND HAROLD WIDOM

Abstract. In analyzing a simple random walk on the Heisenberg group we en- counter the problem of bounding the extreme eigenvalues of ann×nmatrix of the formM=C+DwhereCis a circulant andDa diagonal matrix. The dis- crete Schr¨odinger operators are an interesting special case. The Weyl and Horn bounds are not useful here. This paper develops three different approaches to getting good bounds. The first uses the geometry of the eigenspaces ofCand D, applying a discrete version of the uncertainty principle. The second shows that, in a useful limit, the matrixMtends to the harmonic oscillator onL2(R) and the known eigenstructure can be transferred back. The third approach is purely probabilistic, extendingM to an absorbing Markov chain and using hitting time arguments to bound the Dirichlet eigenvalues. The approaches allow generalization to other walks on other groups.

October 1, 2018 1. Introduction Consider then×nmatrix

(1) Mn =1

4

2 1 1

1

1 1

1

1 1

2 cos 2πjn

, 0jn1

As explained in [5] and summarized in Section 2, this matrix arises as the Fourier transform of a simple random walk on the Heisenberg group, as a discrete approx- imation to Harper’s operator in solid state physics and in understanding the Fast Fourier Transform. WriteM =C+D withCa circulant, (having 14 on the diago- nals just above and below the main diagonal and in the corners) andD a diagonal matrix (with diagonal entries 12cos 2πjn

for 0j n1). The Weyl bounds [20]

and Horn’s extensions [2] yield that the largest eigenvalueλ1(M)λ1(C) +λ1(D).

1991Mathematics Subject Classification. 60B15; 20P05.

Key words and phrases. Heisenberg group, almost Mathieu operator, Fourier analysis, random walk.

The first, second, third, and fifth authors would like to acknowledge partial support from NSF grants DMS 1001079, DMS 08-04324, DMS 1303761, and DMS 1400248 (respectively). The remaining author would like to acknowledge partial support from ANR grant number ANR-12- BS01-0019.

1

arXiv:1508.05986v2 [math.PR] 7 Nov 2015

(3)

Here λ1(C) =λ1(D) = 12 giving λ1(M)1. This was not useful in our applica- tion; in particular, we needλ1(M)1constn . This paper presents three different approaches to proving such bounds. The first approach uses the geometry of the eigenvectors and a discrete version of the Heisenberg uncertainty principle. It works for general Hermitian circulants:

Theorem 1. Let C be an n×n Hermitian circulant with eigenvalues λ1(C)

· · · ≥ λn(C). Let D be an n×n real diagonal matrix with eigenvalues λ1(D)

· · · ≥λn(D). Ifk, k0 satisfy 1k, k0n, kk0< n, then λ1(C+D)λ1(C) +λ1(D)

1

2min{λ1(D)λk(D), λ1(C)λk0(C)} 1 rkk0

n

!2 . Example 1. For the matrix Mn in (1), the eigenvalues of C andD are real and equal to 1

2cos 2πjn

0≤j≤n−1. For simplicity, take n odd. Then, writing λj = λj(C) =λj(D),λ1 = 12,λ2 =λ3 = 12cos n

, and λ2j+1 =λ2j = 12cos 2πjn for 1j n−12 . Choosek=k0=bc

ncfor a fixed 0< c <1. Then λk =1

2

11 2

πc n1/2

2 +O

1 n3/2

, 1 rkk0

n

!2

(1c)2 and the bound in Theorem 1 becomes

λ1(Mn)1π2 8

c2(1c)2

n +O

1 n3/2

.

The choice c= 12 gives the best result. Very sharp inequalities for the largest and smallest eigenvalues ofMnfollow from [3]. They get better constants than we have in this example. Their techniques make sustained careful use of the exact form of the matrix entries while the techniques in Theorem 1 work for general circulants.

The second approach passes to the largenlimit, showing that the largest eigen- values ofMnfrom (1) tend to suitably scaled eigenvalues of the harmonic oscillator L=14dxd22 +π2x2.

Theorem 2. For a fixedk1, the kth largest eigenvalue ofMn equals 1µk

n +o 1

n

withµk = (2k−1)π2 , the kth smallest eigenvalue ofL.

Theorem 2 gets higher eigenvalues with sharp constants for a restricted family of matrices. The argument also gives a useful approximation to thekth eigenvector.

Similar results (with very different proofs) are in [28].

There are many techniques available for bounding the eigenvalues of stochastic matrices ([24], [13], and [7]). We initially thought that some of these would adapt to Mn. However,Mnis far from stochastic: the row sums ofMn are not constant and the entries are sometimes negative. Our third approach is to letMn0 = 13I+23Mn. This is substochastic (having non-negative entries and row sums at most 1). If ai= 1P

jMn0(i, j), consider the (n+ 1)×(n+ 1) stochastic matrix:

(4)

(2) Mn00=

1 0 0 . . . 0 a1

a2

...

an .

M

n0

This has the interpretation of an absorbing Markov chain (0 is the absorbing state) and the Dirichlet eigenvalues ofMn00(namely those whose eigenvalues vanish at 0) are the eigenvalues ofMn0. In [5] path and other geometric techniques are used to bound these Dirichlet eigenvalues. This results in bounds of the form 1−const.n4/3 for λ1(Mn). While sufficient for the application, it is natural to want an improvement that gets the right order. Our third approach introduces a purely probabilistic technique which works to give bounds of the right order for a variety of similar matrices.

Theorem 3. There is ac >0 such that, for all n1 andMn defined at (1), the largest eigenvalue satisfiesλ1(Mn)1nc.

Section 2 gives background and motivation. Theorems 1, 2, and 3 are proved in Sections 3, 4, and 5. Section 6 treats a simple random walk on the affine group modp. It uses the analytic bounds to show that order p2 steps are necessary and sufficient for convergence. It may be consulted now for further motivation. The final section gives the limiting distribution of the bulk of the spectrum of Mn(a) using the Kac-Murdock-Szeg¨o theorem.

2. Background

Our work in this area starts with the finite Heisenberg group:

H1(n) =

1 x z 0 1 y 0 0 1

: x, y, zZ/nZ

.

Write such a matrix as (x, y, z), so

(x, y, z)(x0, y0, z0) = (x+x0, y+y0, z+z0+xy0).

Let

S={(1,0,0),(−1,0,0),(0,1,0),(0,−1,0)}and (3)

Q(g) = (1

4 gS 0 otherwise. (4)

ThusS is a minimal symmetric generating set forH1(n) and Qis the probability measure associated with ‘pick an element inS at random and multiply.’ Repeated steps of this walk correspond to convolution. For (x, y, z)H1(n),

Qk(x, y, z) = X

(x0,y0,z0)∈H1(n)

Q(x0, y0, z0)Qk−1((x, y, z)(x0, y0, z0)−1).

(5)

When k is large, Qk converges to the uniform distribution U(x, y, z) = n13 The rate of convergence ofQk toU can be measured by the chi-squared distance:

X

(x,y,z)∈H

|Qk(x, y, z)U(x, y, z)|2/(U(x, y, z)) = X

ρ∈Hˆ1 ρ6=1

dρkQ(ρ)ˆ kk2, (5)

On the right, the sum is over nontrivial irreducible representations ofH1(n) withρ of dimensiondρ and ˆQ(ρ)k =P

(x,y,z)Qk(x, y, z)ρ(x, y, z). For background on the Fourier analysis approach to bounded convergence see [8], Chapter 3.

For simplicity, (see [5] for the general case) take n =p a prime. Then H1(p) hasp21-dimensional representationsρa,b(x, y, z) =e2πip (ax+by)fora, binZp. It has p1p-dimensional representations. These act onV ={f :ZpC} via

ρa(x, y, z)f(w) =e2πiap (yw+z)f(x+w), 0ap1.

The Fourier transform ofQat ρa is the matrixMn(a) as in (1) with cos2πj

p

replaced by cos2πaj

p

for 0jp1.

The chi-squared norm in (5) is the sum of the (2k)th power of the eigenvalues so proceeding needs bounds on these. The details are carried out in [5]. The main results show thatk of ordern2 steps are necessary and sufficient for convergence.

That paper also summarizes other appearances of the matricesMn(a). They occur in discrete approximations of the ‘almost Mathieu’ operator in solid state physics.

In particular, see [31], [3], and [1]. If Fn is the discrete Fourier transform matrix (Fn)jk= 1ne2πijkn

; it is easy to see that FnMn(1) = Mn(1)Fn. Diagonalizing Fnhas engineering applications and having a ‘nice’ commuting matrix should help.

For this reason, there is engineering interest in the eigenvalues and vectors ofMn(1).

See [14] and [25].

3. Proof of Theorem 1

Throughout this section C is an n×n Hermitian circulant with eigenvalues λ1(C) λ2(C) ≥ · · · ≥ λn(C) and D is a real diagonal matrix with eigenvalues λ1(D)λ2(D)≥ · · · ≥λn(D). Let xbe an eigenvector of C+D corresponding to λ1(C+D). Recall that

(Fn)jk= 1ne2πijkn

for j, k Z/nZ. This has rows or columns which simultaneously diagonalize all circulants. Write ˆx=Fnxandxh for the conjugate transpose. We usekxk2=xhx.

Our aim is to prove that forkk0< n, λ1(C+D)λ1(C) +λ1(D) (6)

1

2min{λ1(D)λk+1(D), λ1(C)λk0+1(C)} 1 rkk0

n

!2

.

The first step is to write xhCx in terms of a Fourier transform pair ˆx = Fnx.

A subtle point is that although Fn diagonalizes C, the resulting diagonal matrix does not necessarily have entries in decreasing order, necessitating a permutation indexing in the following lemma.

(6)

Lemma 1. Define a permutationσsuch that e2πinσj−1b

n , 0bn1 is the eigenvector corresponding toλj(C). Then

xhCx= ˆxhD0xˆ (7)

withD0 =diag(λσ1(C), . . . , λσn(C)).

Proof. SinceC is diagonalized byFnh,FnCFnh=D0. Thus xhCx=xhFnhFnCFnhFnx= ˆxhD0x.ˆ

A key tool is the Donoho-Stark [15] version of the Heisenberg Uncertainty Prin- ciple. For this, call a vector y ‘-concentrated on a set S [n]’ if |xi| < for i /S.

Theorem 4(Donoho-Stark). Lety,yˆbe a unit norm Fourier Transform pair with y S-concentrated onS andy ˆ T-concentrated on T. Then

(8) |S||T| ≥n(1(S+T))2.

Let (y)S be the projection onto the subspace vanishing offS:

((y)S)i=

(yi iS 0 otherwise. A simple consequence of the bound (8) is

Corollary 1. Ifkk0 < n,z,zˆa unit norm Fourier transform pair andS andT are sets of sizek,k0, then

k(z)Sck2+k(ˆz)Tck2 1 2 1

rkk0 n

!2

.

Proof. Let S = k(z)Sck and T = k(ˆz)Tck. Then kz(z)Sk = k(z)Sck and zz)Tk=k(ˆz)Tck. Thusz isS concentrated onS and ˆzisT concentrated on T. Thus (8) gives

|S||T| n = kk0

n (1(S+t))2or (S+T)1 rkk0

n , so ifkk0n

k(z)Sck2+k(ˆz)Tck2= (2S+2T)1

2(s+T)2 1 2 1

rkk0 n

!2

Proof of Theorem 1. With notation as above,

λ1(C+D) =xh(C+D)x=xhCx+xhDx= ˆxhD0xˆ+xhDx=:∗.

LetD=Dλ1(D)I andD0=D0λ1(C)I. Then

= ˆxhλ1(C)Iˆx+ ˆxhD0xˆ+xhλ1(D)Ix+xhDx

=λ1(C) + ˆxhD0xˆ+λ1(D) +xhDx

(7)

NowDandD0have non-positive eigenvalues so our improvement over the Weyl bounds will follow by showing thatxor ˆxhave support on suitably negative entries ofD0 orD.

LetSandT correspond to the largestk,k0 entries ofD,D0, respectively. Then x= (x)S+ (x)Sc, ˆx= (ˆx)T+ (ˆx)Tc. Each of those decomposition is into orthogonal pieces. Multiplying any of the four pieces by an arbitrary diagonal matrix preserves this orthogonality. Thus

=λ1(C) +λ1(D) + (ˆx)hTD0x)T+ (ˆx)hTcD0x)Tc+ (x)hSD(x)S+ (x)hScD(x)Sc. For the last four terms on the right, terms 1 and 3 are bounded above by zero and 2 and 4 contribute with the following bounds:

∗ ≤λ1(C) +λ1(D) + (λk+1(D)λ1(D))k(x)Sck2+ (λk0+1(C)λ1(C)k(ˆx)Tck2

λ1(C) +λ1(D)

+ min{(λk+1(D)λ1(D)),k0+1(C)λ1(C)}

1 2 1

rkk0 n

!2

where the last line follows from the corollary.

Remarks. (1) These arguments work to give the smallest eigenvalue as well, so in fact we also have forll0< n:

λn(C+D)λn(C) +λn(D) (9)

+1

2min{λn−l(D)λn(D), λn−l0(C)λn(C)} 1 rll0

n

!2 . (2) Our thanks to a thoughtful anonymous reviewer, who pointed out that

Corollary 1 can be improved using Cauchy- Schwartz to show that for 0<

a, b1, 1

rkk0 n

!2

(s+T)2(a2S+b2T) 1

a+1 b

.

Setting a=λk+1(D)λ1(D) and b=λk+1(D)λ1(D), one can improve the previous theorem:

λ1(C+D)λ1(C) +λ1(D)

1(D)λk+1(D))(λ1(C)λk0+1(C)) 1(D)λk+1(D)) + (λ1(C)λk0+1(C)) 1

rkk0 n

!2 . In our case, the result is the same, since the eigenvalues of C and D are identical.

(3) Donoho and Stark [15] give many variations on their uncertainty principle suitable for other transforms. The techniques above should generalize, at least to theG-circulants of [9].

(4) There should be similar theorems withCandDreplaced by general Hermit- ian matrices and perhaps extensions to higher Weyl and Horn inequalities (see [2] and [18]).

(5) Further applications/examples are in Section 6.

(8)

4. The Harmonic oscillator as a limit.

We prove Theorem 2, that fork1 thekth largest eigenvalue ofMn is equal to 1µk/n+o(1/n) and thekth smallest eigenvalue ofMn is equal to−1 +µk/n+ o(1/n), whereµk is thekth smallest eigenvalue of

L=1 4

d2

dx2+π2x2

on (−∞,∞). By a classical computation (see [19]),µk = (2k1)/(2π).

Then×nmatrixMn hasj, k-entry 1

4[δ(jk1) +δ(jk+ 1)] +1

2cos(2πk/n)δ(jk), wherej, kZn=Z/nZ.

We define

M˜n =n(IMn).

This hasj, kentrym1(j, k) +m2(j, k), where (10) m1(j, k) = n

2

δ(jk)1

2[δ(jk1) +δ(jk+ 1)]

(11) m2(j, k) = n

2(1cos(2πk/n))δ(jk).

We will show first that if µ is any limit of eigenvalues of M˜n then µ is an eigenvalue ofL; and, second, that any eigenvalue µof Lhas a neighborhood that contains exactly one eigenvalue, counting multiplicity, of ˜Mnfornsufficiently large.

These imply the stated result.

These will be accomplished as follows. Give each point of Zn measure 1/ n, so the total measure equals

n. We then define an isometry T from L2(Zn) to L2(−

n/2,

n/2) (thought of as a subspace ofL2(R) with Lebesgue measure) for which the following hold:

Proposition 1. Suppose{un} is a sequence of functions of norm one inL2(Zn) such that the sequence {( ˜Mnun, un)} of inner products is bounded. Then {T un} has a strongly (i.e., in norm) convergent subsequence.

Proposition 2. Ifφ is a Schwartz function onRthenTM˜nTφstrongly.1 These will easily give the desired results. (See Propositions 3 and 4 near the end.) The final section 4.2 treats the smallest eigenvalues.

4.1. Proofs for the largest eigenvalues. We use two transforms (with, confus- ingly, the same notation). First, forφinL2(−

n/2,

n/2) we define

φ(`) =b Z

n/2

n/2

e−2πi`x/

nφ(x)dx, (`Z), and we have by Parseval (after making the substitutionx7→x

nin the integral)

(12) kφkb =n1/4kφk.

Herekφkb 2=P

`∈Z|φ(`)|ˆ 2.

1The operatorTacts onL2(− n/2,

n/2), soφis first to be restricted to (− n/2,

n/2).

(9)

ForuL2(Zn) we have its finite Fourier transform u(`) =b X

k

e−2πi`k/nu(k), (`Zn), and we compute below that

(13) kbuk=n3/4kuk.

Here

kbuk2=X

|u(`)|b 2,

the sum over any integer interval of lengthn. To show (13), we have

|u(`)|b 2=X

j,k

e−2πi`(j−k)/nu(j)u(k).

Sincee−2πi(j−k)/nis annth root of unity, equal to 1 only whenj=kinZn, we get kbuk2=X

`

|bu(`)|2=nX

k

|u(k)|2=n3/2kuk2.

Now we define the operatorT. LetJ be an interval of integers of lengthn(which later will be specified further) and set

Dn(x) =X

`∈J

e2πi`x.

ThenT is defined by

(T u)(x) = 1 n

X

k

Dn x

nk n

u(k).

ThusT has kernel

T(x, k) = 1 nDn

x

nk n

. By the definition of the inner product onL2(Zn) we find that

T:L2(− n/2,

n/2)L2(Zn) has kernel

T(k, x) = 1

nDn k

n x

n

. In terms of the transforms we have the following:

Lemma 2. (a) ForuL2(Zn),

T u(`) =c

1

nu(`)b if `J, 0 if `6∈J.

(b) ForφL2(− n/2,

n/2), Tdφ(`) =

nφ(`) whenb `J.

(10)

Proof. For (a), we have T u(`) =c 1

n

Z X

k, `0∈J

e−2πi`x/

ne2πi`0(x/

n−k/n)u(k)dx

= 1 n

Z X

`0∈J

e2πi(`0−`)x/

n

bu(`0)dx.

The result follows.

For (b), we have when`J, Tdφ(`) = 1

n X

k, `0∈J

Z

e−2πi`k/ne2πi`0(k/n−x/

n)φ(x)dx

= n

Z

e−2πi`x/

nφ(x)dx= nφ(`).b

We show two things aboutT. For the second we shall assume now and hereafter that the end-points of J are ±n/2 +O(1), although this is a lot stronger than necessary.

Lemma 3. (a)TT =I. (b)T TI strongly asn→ ∞.

Proof. By Lemma 2b,

T\T u(`) =

nT u(`c 0),

where`0J and`0`nZ. By Lemma 2a this in turn equalsbu(`0),which equals u(`) sinceb ubisn-periodic. This gives (a).

For (b) observe that T T is self-adjoint. Since (T T)2 =T TT T=T T, it is a (nonzero) projection and so has norm one. Therefore it suffices to show that if φis a Schwartz function thenT Tφφ. We have from Lemma 2a that

T T\φ(`) = 1

nTdφ(`)

if` J and equals zero otherwise. If` J then by Lemma 2b it equals φ(`). Itb follows that

kT T\φφkb 2=X

`6∈J

|φ(`)|b 2.

Integrating by parts shows that

φ(`) =b O( n/`),

and so, by our assumption onJ, the sum on the right side isO(1). Then by (12) we get

kT Tφφk=O(n−1/4).

Now the work begins. First, an identity. We introduce the notations

C(ξ) = 1cos(2πξ), S(ξ) = sin(πξ), and observe thatC(ξ) = 2S(ξ)2.

(11)

Lemma 4. ForuL2(Zn),

( ˜Mnu, u) =n−1/2kS(`/n)bu(`)k2+nkS(k/n)u(k)k2.

Note.Here and below we display “k” as the variable in the ambient space Zn and

“`” as the variable in the space Zn of the Fourier transform. We abuse notation and, for example, the “u(k)” above denotes the functionku(k).

Proof. We consider first the contribution of (10) to the inner product. If we define the operatorsA± by

(A±u)(k) =u(k±1), we see that the contribution to the inner product is

n

2(u[A+u+Au]/2, u).

Now

Ad±u(`) =e±2πi`/nu(`),b so if we use (13) we see that the above is equal to

1

2n−1/2(C(`/n)u,b bu(`)) =n−1/2kS(`/n)buk2.

To complete the proof of the lemma we note that the contribution to the inner product of (11) is clearly

n

2(C(k/n)u(k), u(k)) =nkS(k/n)u(k)k2.

Lemma 5. Supposeun satisfy ( ˜Mnun, un) =O(1). Then (a) kx T un(x)k=O(1), and (b)k(T un)0k=O(1).

Proof of (a). We have

ubn(`)bun(`+ 1) =X

k

e−2π`k/n(1e−2πik/n)un(k), the finite Fourier transform of (1e−2πik/n)un(k). We have,

|(1e−2πik/n)un(k)|= 2|S(k/n)un(k)|.

Therefore from (13) and

kS(k/n)un(k)k=O(n−1/2), which follows from Lemma 4, we get

kbun(`)bun(`+ 1)k=O(n1/4).

It follows from Lemma 2a thatT udn(`) =T udn(`+ 1) = 0 if both`, `+ 16∈J and

(14) X

`, `+1∈J

|T udn(`)T udn(`+ 1)|2=O(n−1/2).

If` J but `+ 16∈J thenT udn(`+ 1) = 0 and ` is the right end-point of J and thereforen/2 +O(1). From

(15) kS(`/n)bun(`)k=O(n1/4),

Références

Documents relatifs

We investigate multiplicity and symmetry properties of higher eigenvalues and eigenfunctions of the p-Laplacian under homogeneous Dirichlet boundary conditions on certain

The word-fnal accent pattern, accompanied by the /+h/ afx and word-fnal aspiration, displays a shift in accent from the non-fnal to the fnal syllable1. When the /+h/ afx is

Our inequalities extend those obtained by Niu and Zhang [26] for the Dirichlet eigenvalues of the sub-Laplacian on a bounded domain in the Heisenberg group and are in the spirit of

This work was partially supported by the strategic project POS- DRU 107/1.5/S/77265, inside POSDRU Romania 2007-2013 co-financed by the Euro- pean Social Fund-Investing in

A recent such example can be found in [8] where the Lipscomb’s space – which was an important example in dimension theory – can be obtain as an attractor of an IIFS defined in a

In this work, there is not only a general demonstration of the Kac-Rice formula for the number of crossings of a Gaussian processes, but also formulas for the factorial moments of

To study the well definiteness of the Fr´echet mean on a manifold, two facts must be taken into account: non uniqueness of geodesics from one point to another (existence of a cut

Key words: elliptic operators, divergence form, semigroup, L p estimates, Calder´on-Zygmund theory, good lambda inequalities, hypercontractivity, Riesz transforms,