HAL Id: hal-02919770
https://hal.archives-ouvertes.fr/hal-02919770v2
Preprint submitted on 3 Dec 2021
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Analysis of the Feshbach-Schur method for the Fourier Spectral discretizations of Schrödinger operators
Geneviève Dusson, Israel Sigal, Benjamin Stamm
To cite this version:
Geneviève Dusson, Israel Sigal, Benjamin Stamm. Analysis of the Feshbach-Schur method for the
Fourier Spectral discretizations of Schrödinger operators. 2021. �hal-02919770v2�
FOURIER SPECTRAL DISCRETIZATIONS OF SCHR ¨ ODINGER OPERATORS
GENEVI` EVE DUSSON, ISRAEL MICHAEL SIGAL, AND BENJAMIN STAMM
Abstract.
In this article, we propose a new numerical method and its analysis to solve eigenvalue problems for self-adjoint Schr¨ odinger operators, by combin- ing the Feshbach–Schur perturbation theory with the spectral Fourier discretiza- tion. In order to analyze the method, we establish an abstract framework of Feshbach–Schur perturbation theory with minimal regularity assumptions on the potential that is then applied to the setting of the new spectral Fourier dis- cretization method. Finally, we present some numerical results that underline the theoretical findings.
1. Introduction
In this article, we address the problem of the computation of eigenvalues of self-adjoint Schr¨ odinger operators (quantum Hamiltonians) of the form
H = − ∆ + V.
Our main results are a priori error estimates for the approximation error of the eigenvalue and eigenfunction for a new method allowing irregular potentials be- yond the regularity assumptions within the standard variational setting [2, 1, 12].
The main ingredient of this method and its analysis is a reduction of this infinite- dimensional problem to a finite-dimensional one in a fully controlled way with an effective estimate of the error terms, using the Feshbach–Schur map (FSM) method. This method originated in works of I. Schur on the Dirichlet problem in planar domains and H. Feshbach, on resonances in nuclear physics, and was then developed independently in numerical analysis, computational quantum chemistry and mathematical physics, see [16, 18] with the original techniques called variously the Feshbach projection and Schur complements methods.
We combine this approach with spectral Fourier discretizations which are widely used in numerical methods in electronic structure calculation, especially for con- densed matter simulations and in materials science, and known in this community as planewave discretization. Electronic structure calculation is indeed one of the problems we have in our sight. And one particular very useful aspect of planewaves is that they are eigenfunctions of the Laplace operator, entering the Hamiltonian H = − ∆ + V that needs to be diagonalized in order to determine the electronic structure of the system.
1
Our analysis is thus relying on non-variational perturbative techniques. Perturbation- based approaches have a long history in quantum mechanics. For example, dif- ferent perturbation methods have been proposed, such as [4, 22], traditionally to introduce more physical details e.g. many-particle interactions in a given approx- imation. The mathematical justification has been provided by the seminal work of Kato [21]. Perturbation methods have also been used to study van der Waals interactions between two hydrogen atoms in the dissociation limit in a mathemat- ically rigorous way in [7, 11]. More recently, a post-processing strategy has been proposed by some of the authors for planewave discretizations for non-linear eigen- value problems [8, 9, 10, 15], which considers the exact solution as a perturbation of the discrete (using the planewave basis) approximation.
This is in spirit not so far from so-called two-grid methods, where a first problem is solved on a coarse basis, i.e. in a small discretization space, and a small problem is solved on a fine basis. In the case of eigenvalue problems using a same Hamil- tonian H = − ∆ + V as in this article, two-grid and three-grid methods have been proposed e.g. in [14, 27] within a variational approximation. A two-grid method has also been proposed for nonlinear eigenvalue problems of a Gross–Pitaevskii type equation in [5].
As emphasized above, we extend in this article the FSM-method to establish finite-dimensional approximations based on the spectral Fourier basis to solve the Schr¨ odinger eigenvalue problem with controlled errors on the eigenvalues and eigenvectors. To be a little more concrete, we define a new problem in a coarse, finite-dimensional, subspace X
M⊂ X spanned by Fourier modes yielding the exact eigenvalue one would obtain when computing it in the infinite-dimensional space X.
Indeed, our contribution follows a new Ansatz based on the question: Can we find a discrete Hamiltonian acting on the finite-dimensional space X
Mthat has the exact eigenvalue λ
?of the original Hamiltonian (acting on X) as eigenvalue? It turns that the answer is yes, but that the discrete Hamiltonian depends itself on λ
?, through the Feshbach–Schur (FS) -map, leading to an eigenvalue problem in X
Mthat is nonlinear in the spectral parameter. Not surprisingly, the map cannot be computed exactly but only be approximated through a fast decaying series, that is truncated based on a parameter K , and which requires computations in a larger space X
Nwith X
M⊂ X
N⊂ X. However, this defines a new numerical method that is defined by the three parameters σ = (N, M, K) and that can be rigorously analysed. In this work we quantify the error introduced due to the discretization parameters σ = (N, M, K).
This article is organized as follows. In Section 2 we present the problem and numerical method that is used to find approximations thereof, as well as the main approximation result of the article and the error bounds on the eigenvalues.
Section 3 provides the above-mentioned abstract framework of Feshbach–Schur
perturbation theory based on the regularized version of form-boundedness whereas
Section 4 contains some technical results needed to prove the main result which
follows in Section 5. Finally, we present in Section 6 some numerical results to illustrate the convergence as well as the error bounds, and we conclude with some perspectives in Section 7.
2. Set-up and results
2.1. Problem statement. In order to simplify the notation, we consider a cubic lattice R = L Z
d(L > 0, d = 1, 2, 3), but all our arguments straightforwardly apply to the general case of any Bravais lattice. In this paper we are interested in the spectral theory of the self-adjoint Schr¨ odinger operators (quantum Hamiltonians)
H := − ∆ + V,
with reasonably regular, R -periodic potentials V , acting on the Hilbert space L
2per:=
u ∈ L
2loc( R
d)
u is R -periodic , endowed with the scalar product h u, v i = R
Ω
u(r) v(r) dr and the induced norm k · k , where Ω = [0, L)
dis the chosen fundamental cell of the lattice R = L Z
d. Note that V acts as a multiplicative operator, whereas V is either a function (in the more regular case) or a distribution (in the less regular case), whose regularity will be discussed shortly.
Specifically, we would like to solve the eigenvalue problem
H ϕ = λϕ, (1)
in a space X ⊂ H
1per. Here, H
1peris the Sobolev space of index 1 of periodic functions on Ω, which is defined in precise terms later on and equation (1) is considered in the weak sense.
To this end we use the Feshbach–Schur method to reduce the problem to a finite dimensional one. To simplify the exposition, we will assume that the eigenvalue of interest λ
?is isolated, which is true for the smallest eigenvalue under fairly general assumptions of V , see Theorems XIII.46 - XIII.48 of the textbook [25]. We denote by k · k the operator norm on L (L
2per), the space of bounded linear operators on L
2per. To formulate our condition on the potential V , we introduce the following norm measuring its regularity
k V k
r:= k ( − ∆ + Id)
−1/2+r/2V ( − ∆ + Id)
−1/2+r/2k ,
where the operator ( − ∆ + Id)
sis defined by the Fourier transform (cf. Appendix A). In what follows, we thus assume that the potential V satisfies the following condition.
Assumption 1. The potential V is real, R -periodic and satisfies k V k
r< ∞ for some r > 0.
Assumption 1 implies that V is ∆-form bounded [13, 24], which corresponds
to r = 0. The latter, weaker property implies that H (a) is self-adjoint; (b) is
bounded below and (c) has purely discrete spectrum (see e.g. [13, 24, 25, 20]).
Moreover, potentials V belonging to the Sobolev spaces, H
sper:= ( − ∆ + 1)
s/2L
2persatisfy this assumption as shown in Appendix A, Lemma 13 for r ≤ s + 1 and r < 1 +
s2−
d4. In terms of Sobolev spaces, Assumption 1 states that V , as an operator, maps H
1−rperinto H
−1+rper.
As an example, we note that the Coulomb potential V (x) =
|x|Zin three dimen- sions (d = 3) satisfies V ∈ H
sper, s < 1/2. Thus, in view of the above discussion we conclude that Assumption 1 is satisfied for any r < 1/2. For potentials V ∈ L
2per, thus with s = 0, the above discussion yields that Assumption 1 is satisfied for any r < 1 −
d4.
2.2. Approach. In our approach, we reduce the exact infinite dimensional eigen- value problem to a finite dimensional one in a controlled way for fairly irregular potentials. Of course, we have to pay a price for this, which is that at one point we solve a one-dimensional fixed point problem that can be equivalently seen as a non-linear eigenvalue problem. A key ingredient of our method is the finite di- mensional space and the corresponding orthogonal projection onto which we map the original problem to obtain a reduced, finite-dimensional one.
Let X
Mdenote the subspace of L
2perspanned by the eigenfunctions of − ∆ on R , with eigenvalues smaller than ρ
M, as
X
M=
X
k∈R∗,|k|≤M
ˆ u
ke
k(x)
ˆ
u
∗k= ˆ u
−k, u ˆ
k∈ C
, (2)
where e
k(x) = | Ω |
−12e
ik·x(Fourier modes, also called planewaves), R
∗=
2πLZ
d, and
ρ
M:=
2πM L
2.
Let P
Mbe the L
2per-orthogonal projection onto X
Mand P
M⊥:= Id − P
M. We consider the Galerkin approximation of the linear Hamiltonian H := − ∆ + V,
H
M:= P
M( − ∆ + V )P
M.
Let ϕ denote an eigenfunction of (1), introduce the projections ϕ
M= P
Mϕ and ϕ
⊥M= P
M⊥ϕ and project the exact eigenvalue problem (1) onto the subspace X
Mand its complement X
⊥Mto obtain
P
M( H
M− λ)ϕ
M= − P
MV ϕ
⊥M, (3)
P
M⊥( H
M⊥− λ)ϕ
⊥M= − P
M⊥V ϕ
M, (4)
where H
M⊥:= P
M⊥H P
M⊥. Here and in the remainder of this article, we abuse notation
and write λ instead of λId in order to denote the multiplicative operator. Next,
in Appendix A, we prove the following
Lemma 1. Let Assumption 1 hold and define κ
M:= ρ
M− (ρ
M+ 1) ρ
−rMk V k
r. Then
H
M⊥≥ κ
Mon Ran P
M⊥. (5) Here, Ran denotes the range (or image) of the following operator.
Thus for λ < κ
M, the operator H
M⊥− λ is invertible and we can solve (4) for ϕ
⊥Mand thus ϕ
⊥M= − ( H
M⊥− λ)
−1P
M⊥V ϕ
M. Substituting the result into (3), we obtain the non-linear eigenvalue problem
H
M+ U
M(λ)
ϕ
M= λϕ
M, (6)
where we introduced the effective interaction U
M(λ) : X
M→ X
M, or a Schur complement,
U
M(λ) := − P
MV P
M⊥( H
M⊥− λ)
−1P
M⊥V P
M. (7) We then have the following proposition, which is proved in Appendix A.
Proposition 1. For each λ such that λ < κ
M, U
M(λ) is a well-defined operator as a product of three maps: P
M, V and P
M⊥( H
M⊥− λ)
−1P
M⊥between various but matching Sobolev spaces.
Now, we construct a completely computable approximation of the eigenvalue problem (6), with the operators involved being sums of products of finite matrices.
Namely, we expand the resolvent ( H
M⊥− λ)
−1|RanP⊥ M= ( − ∆ + V
M⊥− λ)
−1|RanP⊥ Min (7) in the formal Neumann series in V
M⊥:= P
M⊥V P
M⊥, then truncate this series at K ∈ N and replace the projections P
M⊥= Id − P
Mby P
NM:= P
N− P
M, with N > M.
Introducing the notation
G
MN(λ) := ( − ∆ − λ) |
−1RanPNM
(8) and V
MN:= P
NMV P
NM, we obtain the following truncated effective interaction
U
σ(λ) := − P
MV P
NMR
σ(λ)P
NMV P
M, (9) where σ := (N, M, K) and R
σ(λ) := P
Kk=0
( − 1)
kh
G
MN(λ)V
MNi
kG
MN(λ). Since all the operators involved in (9) are finite matrices, this family is well-defined and computable. Now, we define H
σ(λ) := H
M+ U
σ(λ) on X
Mand consider the eigenvalue problem: find an eigenvalue λ
σiand the corresponding eigenfunctions ϕ
σi∈ X
Msuch that
H
σ(λ
σi)ϕ
σi= λ
σiϕ
σi. (10) Next, we define the approximate ‘lifting’ operator whose origin will be become clear in the next section:
Q
σ(λ) := Id − R
σ(λ)P
NMV P
M. (11)
Note that in the case of N = M there holds P
NM= 0, U
σ(λ) = 0, and thus, equation (10) (and (6)) simply reduces to the variational approximation involving the Hamiltonian H
M.
2.3. Main results. Within this manuscript, we denote by . upper bounds in- volving constants that do not depend on the parameters σ = (N, M, K), α, r, k V k
r. Then, we have the following result, whose proof will be provided in Section 5.
Theorem 1. Let Assumption 1 hold, let λ
?be an isolated eigenvalue of H of finite multiplicity m, with eigenfunctions ϕ
i, and let γ
0denote the gap between λ
?and the rest of the spectrum of H .
Then, there exists α > 0 and M
0∈ N such that for N ≥ M ≥ M
0, problem (10) has m solutions (ϕ
σi, λ
σi) ∈ X
M× [λ
?−
γ20, λ
?+
γ20] approximating (ϕ
i, λ
?) in the following sense:
| λ
?− λ
σi| . (λ
?+ α) k V k
2rα
rε(σ, r, V ), (12) k ϕ
i− Q
σ(λ
σi)ϕ
σik . k V k
r1 + λ
◦γ
0k V k
rα
rε(σ, r, V ), (13) where λ
◦= λ
?+ γ
0+ α and
ε(σ, r, V ) := ρ
−rN+ ρ
−rM4ρ
−rMk V k
r K+1. This Theorem is subject to several remarks.
Remark 1. Note that ε is equivalent to ε(σ, r, V ) ≈ N
−2r+ M
−2rh
4
2πL2rM
−2rk V k
ri
K+1,
where the equivalence constants do not depend on the parameters σ = (N, M, K), r, α, V . Remark 2. In some cases, for instance in multi-scale problems, one might be only interested in the coarse-scale solution, i.e. the best-approximation in the coarse space X
Mgiven by P
Mϕ
i. In such cases, a useful byproduct of the proof of Theorem 1 is the following estimate
k ( − ∆ + Id)
s(P
Mϕ
i− ϕ
σi) k . λ
◦γ
0k V k
2rα
rρ
sMε(σ, r, V ), (14) for any s ≥ 0, which thus compares the eigenfunctions in the space X
M.
Remark 3. Note that convergence of the eigenvalues and the eigenfunctions can be achieved by taking the limit K, N → ∞ for fixed M ≥ M
0. For practical purposes, the idea is to set N large enough so that the error is dominated by the error introduced in K < + ∞ .
Further, note that the eigenvalue and eigenvector errors have the same rate of
convergence with respect to K . However, the error in the eigenvector depends on
the gap γ
0while the error in the eigenvalue does not.
The estimate with respect to N in Theorem 1 is not sharp in all cases, in particular for sufficiently regular potentials V . Nonetheless, our analysis has the merit of presenting the convergence result in one combined analysis based on perturbative techniques which also holds for low regularities of the potential where standard a priori convergence results of the variational approximation do not hold.
In fact, still in the low regularity regime, an estimate of the variational problem can be obtained by setting K → ∞ or M = N .
Note that we can adapt the result whenever a priori approximation results are available by employing the triangle inequality. Indeed, if the potential V belongs to the Sobolev space V ∈ H
sper, with s > d/2, we resort to a priori results in a first place to obtain a sharp bound with respect to N , see e.g., [1, 6, 3], and also [23] for a certain class of discontinuous potentials in H
1/2−εfor all ε > 0, in two dimensions.
More precisely, we consider H acting on X
Ndirectly, i.e. substituting H by H
N:= P
NH P
Nand using X = X
Nwith variational solution (ϕ
N, λ
N), assuming a simple eigenvalue for simplicity. It is important to note that problem (10) remains unchanged and thus, the result of Theorem 1 holds with
e ε(σ, r, V ) := ρ
−rM4ρ
−rMk V k
rK+1,
but where the exact solution is substituted by (ϕ
N, λ
N). Proceeding then by the triangle inequality yields
| λ
?− λ
σi| ≤ | λ
?− λ
N| + | λ
N− λ
σi| ,
k ϕ
i− Q
σ(λ
σi)ϕ
σik ≤ k ϕ
i− ϕ
Nk + k ϕ
N− Q
σ(λ
σi)ϕ
σik .
Combining then the aforementioned a priori estimates from [1, 6, 3] for the first terms of the right hand sides with Theorem 1 for the latter parts yields the fol- lowing corollary.
Corollary 2. Under the conditions of Theorem 1 and if V ∈ H
sper, s > d/2, then
| λ
?− λ
σi| ≤ C N
−(2s+2)+ e ε(σ, r, V ) , k ϕ
i− Q
σ(λ
σi)ϕ
σik ≤ C N
−(s+2)+ ε(σ, r, V e )
, for some constant C > 0 independent on σ = (N, M, K).
Remark 4. Instead of the space X
Mdefined in (2) spanned by the eigenfunctions of − ∆ on R , we could have taken another finite dimensional approximation of the space L
2( R ).
Remark 5. Given a sequence of larger and larger finite dimensional spaces, one
could apply the FS maps consequently with larger and larger projections. In par-
ticular, this could lead to a version of a multigrid technique.
2.4. Theoretical background. Let us shed light on the theoretical foundation on the eigenvalue formulation in form of (6), instead of (1). For a pair of projec- tions P and P
⊥such that P + P
⊥= Id, we define the set D
Pof operators H on a Hilbert space such that H
⊥:= P
⊥HP
⊥is invertible on Ran P
⊥and the operators HP and P H are bounded. Furthermore, for an operator H ∈ D
P, we define the bounded operator
F
P(H) := P (H − HR
⊥H)P, (15)
where R
⊥:= P
⊥(H
⊥)
−1P
⊥, acting on the subspace Ran P . Our approach is based on the following results, originally presented in [18, Theorem 11.1].
Theorem 3. Let P and P
⊥be a pair of projections such that P + P
⊥= Id.
Then F
P: H → F
P(H), considered as a map from the subspace D
Pinto the space of bounded operators acting on the subspace Ran P , is isospectral in the following sense: if an operator H and a number λ ∈ R are such that H − λ ∈ D
P, then
(a) λ ∈ σ(H) ⇐⇒ 0 ∈ σ(F
P(H − λ));
(b) Hψ = λψ ⇐⇒ F
P(H − λ) ϕ = 0;
(c) dim N ull(H − λ) = dim N ullF
P(H − λ).
Moreover, ψ and ϕ in (b) are related as ϕ = P ψ and ψ = Q
P(λ)ϕ, where Q
P(λ) := P − P
⊥(H
⊥− λ)
−1P
⊥HP.
Finally, if H is self-adjoint, then so is F
P(H).
Here, N ull denotes the null space (or kernel) of the following operator. The map F
Pon the space of operators, is called the Feshbach–Schur map. The relation ψ = Q
P(λ)ϕ allows us to reconstruct the full eigenfunction from the projected one.
By statement (a), we have
Corollary 4. Let ν
i(λ) denote the i-th eigenvalue of the operator F
P(H − λ) + λ for each λ in an interval I ⊂ R . Then, there exists a bijection between the eigenvalues of H in I and the solutions of the equation
ν
i(λ) = λ.
In the current setting of the spectral Fourier approximations, P = P
M, P
⊥= P
M⊥, Proposition 1 implies that the results of Theorem 3 apply for each choice of M ∈ N and yield
F
PM( H − λ) = H
M(λ) − λP
M, (16) where we introduced the notation
H
M(λ) := H
M+ U
M(λ). (17)
Note that H
M(λ) is exactly the operator entering (6). Thus, we have the following.
Corollary 5. Let λ ∈ C with Re λ < κ
M. Then (a) H ψ = λψ ⇐⇒ ( H
M(λ) − λ) ϕ
M= 0;
(b) dim N ull( H − λ) = dim N ull( H
M(λ) − λ).
(c) ψ and ϕ in (a) are related as ϕ
M= P
Mψ and ψ = Q
M(λ)ϕ
M, where Q
M(λ) = Id − ( H
M⊥− λ)
−1P
M⊥V P
M. (18) i.e. the corresponding eigenfunction can be reconstructed from ϕ
Mby an explicit linear map.
This result shows that the original infinite-dimensional spectral problem (1) is equivalent to the finite dimensional spectral problem (6) which is nonlinear in the spectral parameter λ. We now state a few properties of the effective inter- action U
M(λ), in order to characterize the solutions of the fixed-point problems ν
i(λ) = λ. First, we give a definition. A family of bounded, self-adjoint opera- tors T (λ) : X → X, with λ ∈ I ⊂ R , is said to be monotonically decreasing if T (λ) < T (λ
0), in the sense of quadratic forms (i.e. h u, T (λ)u i < h u, T (λ
0)u i ), for all λ > λ
0. Similarly, one can define increasing, non-increasing, and non-decreasing families of operators. If T (λ) is a weakly differentiable family, then, by the fun- damental theorem of calculus, if T
0(λ) ≤ 0 and is not identically 0, then T (λ) is monotonically decreasing.
Proposition 2. For λ ∈ R such that λ < κ
M, U
M(λ) is (i) non-positive, (ii) monotonically decreasing with λ, (iii) vanishing as λ → −∞ . For λ ∈ C such that Re λ < κ
M, U
M(λ) is (iv) complex analytic in λ and (v) symmetric.
Proof. Properties (i)-(iv) follow directly from definition (7) and Lemma 1 above.
For the last one, we use that H is self-adjoint.
Proposition 3. Denote by ν
M i(λ) the i-th eigenvalue of H
M(λ) and assume that the i-th eigenvalue of H is less than κ
M. Then, the equation ν
M i(λ) = λ has a unique solution in the interval ( −∞ , κ
M).
Proof. Since, for λ < κ
M, U
M(λ) is symmetric, the operator H
M(λ) defined by (17) is (a) self-adjoint, (b) monotonically decreasing with λ, (c) converging to H
Mas λ → −∞ , (d) is complex analytic in λ for Re λ < κ
M. We deduce from (b) that the functions ν
M iare decreasing on ( −∞ , κ
M) and thus, if the i-th eigenvalue of H is less than κ
M, the equation ν
M i(λ) = λ has a unique solution (that equals the
i-th eigenvalue of H ).
Note also that lim
λ→−∞ν
M i(λ) is the i-th eigenvalue of H
Mwhich is larger than the i-th eigenvalue of H due to the variational principle.
These considerations motivate the numerical strategies to compute solutions to (10) in the following section.
2.5. Numerical strategy. In order to find solutions to the non-linear eigenvalue problem (10), we propose two strategies:
Strategy 1: For a fixed index i = 1, . . . , M , consider the sequence of iterates λ
(j)σobtained by
λ
(j)σ: is the i-th eigenvalue of H
σ(λ
(j−1)σ). (19)
We thus introduce the notation ν
σi(λ) denoting the i-th eigenvalue (counting mul- tiplicities) of the Hamiltonian H
σ(λ) and thus have λ
(j)σ= ν
σi(λ
(j−1)σ). The limit value λ
σ:= lim
j→∞λ
(j)σthen satisfies λ
σ= ν
σi(λ
σ) and thus (10).
Strategy 2: For a given target value λ
t∈ R , consider the sequence of iterates λ
(j)σobtained by
λ
(j)σ: is the eigenvalue of H
σ(λ
(j−1)σ) closest to λ
t. (20) We thus introduce the notation ν
σt(λ) denoting the eigenvalue of the Hamiltonian H
σ(λ) closest to λ
tand thus have λ
(j)σ= ν
σt(λ
(j−1)σ). The limit value λ
σ:=
lim
j→∞λ
(j)σthen satisfies λ
σ= ν
σt(λ
σ) and thus (10).
In both cases, as outlined in the upcoming Remark 10, convergence of the fixed- point maps (19) and (20) can be guaranteed under some conditions and for N, M large enough.
Remark 6. This numerical strategy can be easily extended for eigenvalues with multiplicity higher than one. Indeed, using Proposition 3, provided that the sought eigenvalue is less than κ
M, one can iteratively compute a given number of the low- est eigenvalues of the operator H
σ(λ
σ), with λ
σupdated at each iteration, matching a close value of the multiple eigenvalue. At convergence, this eigenvalue and the multiplicity will match the exact one, up to an error given in Theorem 1. Regard- ing bands of eigenvalues, one could as well iteratively compute a given number of the lowest eigenvalues of the operators H
σ(λ
σ i), with the λ
σ iupdated at each iteration, but this multiplies the number of problems that have to be solved by the number of eigenvalues in the band. To do it in an efficient manner would re- quire to consider a density-matrix formalism, in order to consider all interesting eigenvectors together, but this goes beyond the scope of the present paper.
The complexity of the numerical method can be summarized as follows:
(1) We assume that the resolution of the non-linearity with the iterative scheme (19) or (20) requires J iterations.
(2) At each iteration j = 1, . . . , J , one requires the resolution of an eigenvalue problem with O(M
d) degrees of freedom and we assume that this problem is solved with an iterative solver using J
evpMmatrix-vector multiplications.
(3) The computation of one matrix-vector product corresponding to the Hamil- tonian H
M+ U
M(λ) = − P
M∆P
M+ P
MV P
M+ U
M(λ) contains three parts.
(i) the matrix corresponding to P
M∆P
Mis diagonal in the Fourier basis;
(ii) the application of P
MV P
Mcan be effected using the Fast Fourier Trans- form (FFT) on the grid X
Mand scales as M
dlog(M
d);
(iii) the matrix-vector product corresponding with the application of the
multiplicative potential U
σ(λ). For convenience we recall here its defini- tion:
U
σ(λ) = − P
MV P
NMK
X
k=0
( − 1)
kh
G
MN(λ)V
MNi
kG
MN(λ)
!
P
NMV P
M, with G
MN(λ) = ( − ∆ − λ) |
−1RanPNM
. The application of G
MN(λ) scales as N
d− M
dand the application of the potentials P
MV P
NM, V
MNand P
NMV P
Mcan be performed again using the FFT on the entire grid X
Nand it scales as N
dlog(N
d).
The overall complexity is thus proportional to J × J
evpM×
M
d+ M
dlog(M
d) + (K + 3)N
dlog(N
d) + (K + 2)(N
d− M
d) In contrast, following the same notation, the variational problem requires a com- plexity of
J
evpN× N
dlog(N
d)
Therefore in order to compare the methods, it depends on J × J
evpM(K + 3) versus J
evpNat comparable accuracy.
Finally, we emphasize that the focus of the present paper is to present the numerical analysis of this new method, which covers cases where the analysis of the standard variational approximation does not hold and efficiency becomes a secondary factor.
3. Perturbation estimates
In this article, we often deal with the following eigenvalue perturbation problem:
Given an operator H on a Hilbert space X of the form
H = H
0+ W, (21)
where H
0is an operator with some isolated eigenvalues and W is small in an appropriate norm, show that H has eigenvalues near those of H
0and estimate these eigenvalues and the corresponding eigenvectors. We therefore start by presenting an abstract theory which will be applied to our concrete problem in the following sections.
Specifically, we assume that H and H
0are self-adjoint and bounded from below and that W is α-form-bounded w.r.t. of H
0, in the sense that for α ∈ R such that H
0+ α is a positive operator (H
0+ α > 0), we have
k W k
H0,α:= k (H
0+ α)
−1/2W (H
0+ α)
−1/2k < ∞ , (22) where (H
0+ α)
−s, s > 0, is defined either by the spectral theory or by the explicit formula
(H
0+ α)
−s:= c
αZ
∞0
(H
0+ α + ω)
−1dω/ω
s,
where c
α:= [ R
∞0
(α + ω)
−1dω/ω
s]
−1. This notion is equivalent to that of the relative form-boundedness, but it gives an important quantification of the latter.
We also note here that, by a known result about relatively form-bounded opera- tors (see e.g. [24, 20]), if H
0is a self-adjoint, bounded below operator on X and W is symmetric and α-form-bounded w.r.t. of H
0, then H = H
0+ W is self-adjoint.
We start with a general result on the eigenvalue difference.
Proposition 4. Let H
0be a self-adjoint bounded below operator on X and W symmetric and α-form-bounded w.r.t. of H
0, and let H = H
0+ W . Let α ∈ R be such that H
0+ α > 0. Then the eigenvalues of H and H
0satisfy the estimates
| ν
i(H) − ν
i(H
0) | ≤ (ν
i(H
0) + α) k W k
H0,α, (23) where ν
i(A) denotes the i-th eigenvalue of the operator A.
Proof. Let u ∈ X be arbitrary and define v = (H
0+α)
1/2u noting that (H
0+α) >
0. Then,
h u, Hu i = h u, H
0u i + h v, (H
0+ α)
−1/2W (H
0+ α)
−1/2v i . Note that
h v, (H
0+ α)
−1/2W (H
0+ α)
−1/2v i ≤ k W k
H0,αh v, v i = k W k
H0,αh u, (H
0+ α)u i , and therefore
h u, H
0u i (1 − k W k
H0,α) − α k u k
2k W k
H0,α≤ h u, Hu i
≤ h u, H
0u i (1 + k W k
H0,α) + α k u k
2k W k
H0,α. Using the min-max principle (Courant–Fisher), there holds
ν
i(H
0) (1 − k W k
H0,α) − α k W k
H0,α≤ ν
i(H) ≤ ν
i(H
0) (1 + k W k
H0,α) + α k W k
H0,α,
which leads to the result.
Let us now assume that λ
0is an isolated eigenvalue of H
0of finite multiplicity m and let P
0be the orthogonal projection onto the span of the the eigenfunctions of H
0corresponding to the eigenvalue λ
0, and let P
0⊥:= Id − P
0. We further introduce H
0,−λ:= H
0⊥− λP
0⊥and thus H
0,α= H
0⊥+ αP
0⊥.
Let γ
0denote the gap of λ
0to its closest eigenvalue in the remaining spectrum of H
0and we introduce the spectral interval I
0= [λ
0−
12γ
0, λ
0+
12γ
0].
Our next result gives estimates on the difference of eigenvectors of H and H
0, as well as on the difference of their corresponding eigenvalues. For standard ap- proaches to the spectral perturbation theory, see [26, 21, 25, 20].
Theorem 6. Let H
0be a self-adjoint bounded below operator on X, with the
eigenvalue λ
0as above, and W symmetric and α-form-bounded w.r.t. H
0, and let
H = H
0+ W . Let α ∈ R be such that H
0+ α > 0. If k W k
H0,α≤
12λ0γ+α0, then
the self-adjoint operator H has exactly m eigenvalues (counting the multiplicities), denoted by µ
i, in the interval I
0= [λ
0−
12γ
0, λ
0+
12γ
0] which satisfy
| µ
i− λ
0| ≤ (λ
0+ α) k W k
H0,α≤ 1
2 γ
0. (24)
Further, if k W k
H0,α≤
14γλ0◦, then any normalized eigenfunction, ψ
i, of H for the eigenvalue µ
isatisfies the estimates
k H
0,α1/2(ψ
0i− ψ
i) k ≤ 4 λ
◦γ
0(λ
0+ α)
1/2k W k
H0,α, (25) k ψ
0i− ψ
ik ≤ 4 λ
◦γ
0k W k
H0,α, (26) where λ
◦= λ
0+α +γ
0and ψ
0iis an appropriate eigenfunction of H
0corresponding to the eigenvalue λ
0, namely ψ
0i:= P
0ψ
i.
Remark 7. We note that similar estimates can be obtained for normalized eigen- functions ψ e
0i:= P
0ψ
i/ k P
0ψ
ik with an additional factor 2 using the estimate k ψ e
0i− ψ
ik ≤
1 − k ψ
0ik
+ k ψ
0i− ψ
ik ≤
k ψ
ik − k ψ
0ik
+ k ψ
0i− ψ
ik ≤ 2 k ψ
0i− ψ
ik . We first develop the following preliminary results.
Lemma 2. Let α ∈ R be such that H
0+ α > 0 and λ
◦:= λ
0+ γ
0+ α. Then, for all λ ∈ I
0c:= { z ∈ C : Re z ∈ I
0} , there holds | P
0⊥− (λ + α)H
0,α−1P
0⊥| ≥
2λγ0◦.
Proof. The eigenvalues of | P
0⊥− (λ + α)H
0,α−1P
0⊥| on Ran P
0⊥are
1 − λ + α λ
0i+ α
=
λ
0i− λ λ
0i+ α ,
where λ
0idenotes the eigenvalues of H
0and the index i runs over all eigenvalues except i such that λ
0i= λ
0. For λ ∈ I
0c, we write λ = λ
r+ iλ
i, with λ
r∈ I
0, λ
i∈ R . Since for any x ∈ R , | x − λ | ≥ | x − λ
r| we have thus to study the function
f(x) =
x − λ x + α
, x ∈ K
α:= [ − α, + ∞ ) \ (λ
0− γ
0, λ
0+ γ
0), for λ ∈ I
0in order to lower bound the eigenvalues. Since
f
0(x) = x − λ
| x − λ | · α + λ (x + α)
2, if α + λ ≤ 0, there holds f
0(x) < 0 for x > − α so that
x∈K
min
αf(x) ≥ 1.
If α + λ > 0, there holds
f
0(x) < 0 for x < λ, f
0(x) > 0 for x > λ,
and thus, for λ ∈ I
0,
x∈K
min
αf (x) = min (f (λ
0− γ
0), f(λ
0+ γ
0)) = min
| λ
0− γ
0− λ |
λ
0− γ
0+ α , | λ
0+ γ
0− λ | λ
0+ γ
0+ α
≥ 1 2
γ
0λ
◦,
yielding the result.
Denote H
⊥:= P
0⊥HP
0⊥|
RanP0⊥and R
⊥(λ) := P
0⊥(H
⊥− λ)
−1P
0⊥. We have Lemma 3. Let α ∈ R be such that H
0+ α > 0 and λ
◦:= λ
0+ γ
0+ α. Let I
0c:= { z ∈ C : Re z ∈ I
0} and assume k W k
H0,α≤
14λγ0◦. Then, for λ ∈ I
0c, the following statements hold
(a) The operator H
⊥− λ is invertible on Ran P
0⊥;
(b) The inverse R
⊥(λ) := P
0⊥(H
⊥− λ)
−1P
0⊥defines a bounded, analytic operator- family;
(c) The expression
U(λ) := − P
0HR
⊥(λ)HP
0(27) defines a finite-rank, analytic operator-family and bounded as
k U(λ) k
H0,α≤ 4 [λ
◦/γ
0] k P
0W P
0⊥k
2H0,α≤ 4 [λ
◦/γ
0] k W k
2H0,α. (28) Further, U (λ) is symmetric for any λ ∈ I
0.
Proof. (a) Since H
⊥is self-adjoint, the operator H
⊥− λ is invertible for any λ ∈ C\R . For λ ∈ I
0, we argue as follows. With the notation A
⊥:= P
0⊥AP
0⊥|
RanP0⊥, we write
H
⊥= H
0⊥+ W
⊥. Now, we write
H
⊥− λP
0⊥= H
0,α1/2[P
0⊥− (λ + α)H
0,α−1+ K
λ]H
0,α1/2, (29) with K
λ= H
0,α−1/2W
⊥H
0,α−1/2. Lemma 2 yields that | P
0⊥− (λ + α)H
0,α−1P
0⊥| ≥
2λγ0◦and thus, the operator Id − (λ + α)H
0,α−1+ K
λis invertible as soon as k K
λk <
γ0
2λ◦
, which is in particular the case if k W k
H0,α≤
14γλ0◦. Then, we also have Id − (λ + α)H
0,α−1+ K
λ≥
14γλ0◦. Hence the operator H
⊥− λ is a product of three invertible operators and therefore is invertible itself on Ran P
0⊥.
For (b), since H
⊥− λ is invertible on Ran P
0⊥, the interval I
0is contained in the resolvent set, ρ(H
⊥|
RanP0⊥), of H
⊥|
RanP0⊥and therefore, since H
⊥|
RanP0⊥is self-adjoint, I
0c⊂ ρ(H
⊥|
RanP0⊥). Since
R
⊥(λ) := P
0⊥(H
⊥− λ)
−1P
0⊥(30) is the resolvent of the operator H
⊥− λ restricted to Ran P
0⊥, it is analytic on its resolvent set and in particular on I
0c.
To prove statement (c), we note that the operators R
⊥(λ), HP
0and P
0H =
(HP
0)
∗are bounded and R
⊥(λ) is symmetric for λ ∈ I
0. Hence so is U (λ). The
analyticity of U(λ) follows from the analyticity of R
⊥(λ). It it clear that U(λ) is of finite rank due to its definition.
Finally, to prove estimate (28), we first show that the operator R
⊥(λ) :=
P
0⊥(H
⊥− λ)
−1P
0⊥, λ ∈ I
0c, satisfies
k H
0,α1/2R
⊥(λ)H
0,α1/2k ≤ 4 λ
◦γ
0. (31)
To this end, we invert (29) on Ran P
0⊥and use that
Id − (λ + α)H
0,α−1+ K
λ≥
14λγ0◦to obtain (31) for λ ∈ I
0c.
Finally, we prove inequality (28). Since P
0H
0= H
0P
0and P
0P
0⊥= 0, we have P
0HP
0⊥= P
0W P
0⊥, P
0⊥HP
0= P
0⊥W P
0.
These relations and definition (27) yield
U (λ) = − P
0W R
⊥(λ)W P
0. (32)
Combining (31) and (32), we obtain (28).
Remark 8. In addition, we have the estimate k U (λ) k ≤ 4 λ
2◦γ
0k P
0W P
0⊥k
2H0,α. (33) Indeed, since H
0P
0= λ
0P
0, we have k (H
0+ α)
1/2P
0k
2= λ
0+ α, which implies the estimate
k P
0AP
0k = (λ
0+ α) k P
0AP
0k
H0,α, (34) which, together with estimate (28), yields (33).
Hence, under the conditions of Lemma 3 and for λ ∈ I
0the following Hamil- tonian is well defined
H(λ) := P
0HP
0+ U (λ). (35) Note that P
0HP
0= λ
0P
0. Lemma 3 above implies
Corollary 7. The operator family H(λ) is (i) self-adjoint for λ ∈ I
0and (ii) complex analytic in λ ∈ I
0c.
In what follows, we label the eigenvalue families ν
i(λ), i = 1, . . . , m, of H(λ) in the order of their increase and so that
ν
1(λ) ≤ . . . ≤ ν
m(λ). (36) Note that the eigenvalue branches ν
i(λ) can also be of higher multiplicity. On a subinterval I
i⊂ I
0, we say that the branch ν
i(λ) is isolated on I
iif each other branch ν
j(λ), with λ ∈ I
i, either i) coincides with ν
i(λ) or ii) satisfies
min
λ∈Ii| ν
i(λ) − ν
j(λ) | ≥ γ
i> 0. (37)
16 GENEVI `EVE DUSSON, ISRAEL MICHAEL SIGAL, AND BENJAMIN STAMM
0 1
2 0 0+12 0
0
0 1
2 0 0 0+12 0
⌫1( )
⌫2( )
⌫3( ) f(x) =x
01
2 0 0+12 0
0
01
2 0 0+1
0 2 0
⌫1( )
⌫2( )
⌫3( ) f(x) =x
01
2 0 0+12 0
0
01
2 0 0+1
0 2 0
⌫1( ) f(x) =x
0,1 0,2 0,3 0,4 0,5
Figure 1. (Left) Schematic illustration of the eigenvalues ν
i(λ) of H(λ) in the neighborhood of λ
0for the case of m = 3. (Right) Illustration of the spectrum of H
0consisting of five eigenvalues λ
0,1. . . , λ
0,5of multiplicity m
0,1= 1, m
0,2= 2, m
0,3= 4, m
0,4= 2, m
0,5= 1 and the corresponding situation when zooming in close to λ
0= λ
0,i.
Further, we have the following result.
Proposition 5. Let α ∈ R be such that H
0+ α > 0 and let I
i⊂ I
0be such that the branch ν
i(λ) is isolated on I
i. For λ ∈ I
i, (i) the eigenvalues ν
i(λ) of H(λ) are continuously differentiable; (ii) the derivative ν
i0(λ) is non-positive; (iii) the solutions to the equations ν
i(λ) = λ are unique if λ ∈ I
i; (iv) if k W k
H0,α≤
14λγ0◦, the derivatives ν
i0(λ), λ ∈ I
00:= [λ
0−
14γ
0, λ
0+
14γ
0] ∩ I
i, are bounded as
| ν
i0(λ) | ≤ 8 π
(λ
0+ α)λ
◦γ
02k P
0W P
0⊥k
2H0,α. where λ
◦:= λ
0+ γ
0+ α.
Proof. Proof of (i) of a simple eigenvalue λ
0, i.e., m = 1. In such a case, P
0is a rank-one projector on the space spanned by the eigenvector ϕ
0of H
0corresponding to the eigenvalue λ
0and therefore Eq. (35) implies that H(λ) = ν
1(λ)P
0, with
ν
1(λ) := h ϕ
0, H (λ)ϕ
0i . (38) This and Corollary 7 show that the eigenvalue ν
1(λ) is analytic.
We now prove (i) in the general case. First, we claim the following well-known formula
ν
i0(λ) = h χ
i(λ), U
0(λ)χ
i(λ) i , (39)
for λ ∈ I
i, where χ
i(λ) are well-chosen normalized eigenfunction of H(λ) corre- sponding to the eigenvalue ν
i(λ), namely that they are differentiable in λ. To this end, we observe that for each µ ∈ I
i, we can find a local neighborhood I
µ⊂ I
iof µ such that
[
j6=i
{ ν
j(λ) | λ ∈ I
µ} ∩ { ν
i(λ) | λ ∈ I
µ} = ∅ , (40) due to the isolated branch property, i.e., γ
i> 0 in (37). Second, since H(λ) is self-adjoint for λ ∈ I
0and analytic (say, in the resolvent sense) in λ ∈ I
0c, the Riesz projection, corresponding to the eigenvalue ν
i(λ):
P
i(λ) := 1 2πi
I
Γi(µ)
(H(λ) − z)
−1dz, (41)
where Γ
i(µ) is a closed curve in the resolvent set of H(λ) surrounding the eigen- value branch { ν
i(λ) : λ ∈ I
µ} , is also self-adjoint and analytic in λ ∈ I
µcand therefore in λ ∈ I
0c(see [25, 20]), condition (40) guarantees that we can choose such a closed curve which contains no other points of σ(H(µ)) on I
µ, and that, combining all neighborhoods of µ for µ ∈ I
i, there holds that P
i(λ) is analytic in I
i. From [25, Theorem XII.12], there exists an analytic family of unitary operators V
i(λ) such that P
i(λ) = V
i(λ)P
i(λ
0)[V
i(λ)]
−1, λ
0being possibly replaced by some arbitrary µ ∈ I
iif λ
0does not belong to I
i. We then define
χ
i(λ) = V
i(λ)ψ
0i,
where ψ
0iis an eigenvector of H
0corresponding to the eigenvalue λ
0. Since V
i(λ) is analytic, χ
i(λ) is also analytic in I
i, so in particular differentiable, and one can easily check that χ
i(λ) is of norm 1 and that P
i(λ)χ
i(λ) = χ
i(λ), which guarantees that χ
i(λ) is a normalized eigenfunction of H(λ). Now, we use that
h χ
0i(λ), H(λ)χ
i(λ) i + h χ
i(λ), H (λ)χ
0i(λ) i = ν
i(λ)( h χ
0i(λ), χ
i(λ) i + h χ
i(λ), χ
0i(λ) i )
= h χ
i(λ), χ
i(λ) i
0= 0
to obtain ν
i0(λ) = h χ
i(λ), H
0(λ)χ
i(λ) i , which gives (39). The differentiability of χ
i(λ) and the analyticity of H(λ) then implies the differentiability of ν
iin each neighborhood of λ.
In order to prove (ii), note that U
0(λ) ≤ 0, as follows by the explicit formula U
0(λ) := − P W P
0⊥(H
⊥− λ)
−2P
0⊥W P ≤ 0. (42) Hence, ν
i0(λ) < 0 by (39). The monotonicity of ν
i(λ) also implies the well- posedness of the equations ν
i(λ) = λ under the condition that λ ∈ I
i, thus statement (iii).
We now aim to prove (iv). Starting from (39), we estimate ν
i0(λ) with
| ν
i0(λ) | ≤ k (H
0+ α)
1/2P
0k
2k U
0(λ) k
H0,α. (43) The first factor on the right hand side is exactly known as
k (H
0+ α)
1/2P
0k
2= (λ
0+ α). (44)
To investigate the second factor on the r.h.s. of (43), we use the analyticity U (λ) and the estimate (28). Indeed, by the Cauchy integral formula, we have
k U
0(λ) k
H0,α≤ 1
2πR sup
µ∈C,
|µ−λ|=R