Quantitative quantum chemistry:
Introduction to wave-function methods
Julien Toulouse Laboratoire de Chimie Th´eorique
Sorbonne Universit´ e and CNRS, Paris, France Institut Universitaire de France
Course 5C208 of Master 2 Chemistry Sorbonne Universit´e, Paris
www.lct.jussieu.fr/pagesperso/toulouse/enseignement/presentation_wft_19.pdf
Why and how learning wave-function methods?
Understanding the different quantum-chemistry wave-function methods is important for:
improving them and developing new ones;
using them in applications in an appropriate way.
Recommended books:
J.-L. Rivail, El´ ´ ements de chimie quantique ` a l’usage des chimistes, EDP Sciences / CNRS Editions, 1999.
G. Berthier, N´ ecessaire de chimie th´ eorique, Ellipses, 2009.
A. Szabo and N. S. Ostlund, Modern Quantum Chemistry, Dover Publications Inc., NY, 1996.
T. Helgaker, P. Jørgensen, J. Olsen, Molecular Electronic-Structure Theory, Wiley, 2002.
My lecture notes:
www.lct.jussieu.fr/pagesperso/toulouse/enseignement/introduction_pt_cc.pdf
2/58
Outline
1 Overview of wave-function methods The quantum many-electron problem The Hartree-Fock approximation
Electron correlation and post-Hartree-Fock methods
2 Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3 Coupled-cluster theory The exponential ansatz
Truncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4 Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
3/58
Outline
1 Overview of wave-function methods The quantum many-electron problem The Hartree-Fock approximation
Electron correlation and post-Hartree-Fock methods
2
Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3
Coupled-cluster theory
The exponential ansatzTruncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
4/58
The Hamiltonian and the many-electron wave function
We consider a N-electron system (atom, molecule, solid) in the Born-Oppenheimer and non-relativistic approximations.
The electronic Hamiltonian in the position representation is, in atomic units, H(r
1,r
2, ...,r
N) =
N
X
i
h(r
i) + 1 2
N
X
i N
X
i6=j
1
| r
i− r
j| where h(r
i) = − (1/2) ∇
2ri+ v
ne(r
i) is the one-electron contribution and v
ne(r
i) = −
Pα
Z
α/| r
i− R
α| is the nuclei-electron interaction.
Stationary states satisfy the time-independent Schr¨ odinger equation H(r
1,r
2, ...,r
N)Ψ(x
1,x
2, ...,x
N) = EΨ(x
1,x
2, ...,x
N) where Ψ(x
1,x
2, ...,x
N) is a wave function written with space-spin coordinates x
i= (r
i, ωi) (with r
i∈
R3and
ωi∈ {↑ , ↓} ) which is antisymmetric with respect to the exchange of two coordinates, and E is the associated energy.
Using Dirac notations (representation-independent formalism):
H ˆ | Ψ i = E | Ψ i
5/58
The ground-state electronic energy and other properties
The ground-state electronic energy E
0can be expressed with the variational principle E
0= min
Ψ
h Ψ | H ˆ | Ψ i
where the minimization is over all N-electron antisymmetric wave functions Ψ, normalized to unity h Ψ | Ψ i = 1.
Properties (such as dipole moments or optical spectra) can be essentially obtained as derivatives of the ground-state energy with respect to a perturbation.
For systems containing heavy atoms, we need to include relativistic effects, e.g. using the four-component Dirac Hamiltonian.
For photochemistry studies, we sometimes need to go beyond the Born-Oppenheimer approximation, e.g. calculating non-adiabatic couplings (or vibronic couplings) between the electronic and nuclear vibrational states.
We aim at about 1% accuracy on energy differences and other properties.
6/58
Outline
1 Overview of wave-function methods The quantum many-electron problem The Hartree-Fock approximation
Electron correlation and post-Hartree-Fock methods
2
Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3
Coupled-cluster theory
The exponential ansatzTruncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
7/58
The Hartree-Fock approximation (1/2)
The (unrestricted) Hartree-Fock (HF) method consists in approximating the ground-state wave function as a single Slater determinant, Ψ
0≈ Φ
0:
Φ
0(x
1,x
2, ...,x
N) = 1
√ N!
χ1
(x
1)
χ2(x
1) · · ·
χN(x
1)
χ1(x
2)
χ2(x
2) · · ·
χN(x
2)
.. . .. . . .. .. .
χ1(x
N)
χ2(x
N) · · ·
χN(x
N)
where
χi(x) are orthonormal spin orbitals.
The HF electronic energy is written in terms of integrals over these spin orbitals (using Slater’s rules for calculating expectation values over Slater determinants)
E
HF[ {
χa} ] = h Φ
0| H ˆ | Φ
0i =
occ
X
a
h
aa+ 1 2
occ
X
a,b
h ab || ab i
where a and b refer to occupied spin orbitals, h
aaare the one-electron integrals h
aa=
Z
dx
χ∗a(x)h(r)χ
a(x)
and h ab || ab i = h ab | ab i − h ab | ba i are the antisymmetrized two-electron integrals with h ij | kl i =
xdx
1dx
2χ∗i
(x
1)χ
∗j(x
2)χ
k(x
1)χ
l(x
2)
| r
1− r
2| (in physicists’ notation)
8/58
The Hartree-Fock approximation (2/2)
The spin orbitals are determined by minimizing the HF energy subject to the spin-orbital orthonormalization constraints h
χa|
χbi =
δa,bE
HF= min
{χa}
E
HF[ {
χa} ]
This minimization leads to the (canonical) HF eigenvalue equations f (x)χ
i(x) =
εiχi(x)
giving both the occupied and virtual spin orbitals
χi(x) and associated orbital energies
εi.
The one-electron HF Hamiltonian (or Fock Hamiltonian) is
f (x) = h(r) + v
HF(x) where v
HF(x) is the one-electron HF potential operator
v
HF(x) =
occ
X
a
J
a(x) − K
a(x)
composed of a Coulomb (or Hartree) operator and an exchange (or Fock) operator J
a(x
1) =
Z
dx
2χ∗a
(x
2)χ
a(x
2)
| r
1− r
2| and K
a(x
1)χ
i(x
1) =
Zdx
2χ∗a
(x
2)χ
i(x
2)
| r
1− r
2|
χa(x
1)
9/58
Hartree-Fock calculations in a basis set
The spin orbitals are written as the product a spatial orbital
ψi(r) and a spin function
α(ω) orβ(ω)χi
(x) =
ψi(r)α(ω) or
χi(x) =
ψi(r)β(ω) with
α(↑ ) = 1,
α(↓ ) = 0 and
β(↑ ) = 0,
β(↓ ) = 1.
The spatial orbitals are expanded on a basis set {
φµ(r) }
µ=1,...,Kψi
(r) =
K
X
µ=1
C
µiφµ(r)
In the basis, the HF equations take the form a self-consistent generalized eigenvalue matrix equation
F C
i=
εiS C
iwhere F
µν=
Rφ∗µ
(r)f (x)φ
ν(r)dr are the elements of the Fock matrix and S
µν=
Rφ∗µ
(r)φ
ν(r)dr are the elements of the overlap matrix.
In practice, we use atom-centered Gaussian-type orbital (GTO) basis sets
φµ(r) =
Xp
d
pµg
p(r) with g
p(r) = N
pr
ℓpe
−αpr2Y
ℓmpp
(θ, φ) using spherical coordinates r = (r, θ, φ) around each nucleus.
In a straightforward implementation, the computational time of HF scales as O (K
4).
10/58
Outline
1 Overview of wave-function methods The quantum many-electron problem The Hartree-Fock approximation
Electron correlation and post-Hartree-Fock methods
2
Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3
Coupled-cluster theory
The exponential ansatzTruncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
11/58
The electron correlation energy
The HF potential v
HF(x) is an one-electron mean-field potential approximating the effect of the two-electron interaction (1/2)
PNi
PN
i6=j
1/ | r
i− r
j| . In other words, the HF approximation only accounts for the electron-electron interaction in an averaged, mean-field way.
The effect of the electron-electron interaction beyond the HF approximation is called electron correlation. The difference between the exact ground-state energy E
0and the HF energy E
HFis called the correlation energy
E
c= E
0− E
HF
Even though E
cis usually a small percentage of the total energy, it very often makes a large and crucial contribution to energy differences (such as reaction energies, reaction barrier heights, ...). It is therefore important to go beyond the HF approximation and calculate the value of the correlation energy, which is the goal of the post-HF methods.
12/58
A straightforward post-HF method: Full configuration interaction (FCI)
In FCI, the wave function is expanded in terms of the HF determinant Φ
0, the singly excited determinants Φ
ra, the doubly excited determinants Φ
rsab, and so on up to N-fold excited determinants:
| Ψ
FCIi = c
0| Φ
0i +
occ
X
a vir
X
r
c
ar| Φ
rai +
occ
X
a<b vir
X
r<s
c
abrs| Φ
rsabi +
occ
X
a<b<c vir
X
r<s<t
c
abcrst| Φ
rstabci + · · ·
and the coefficients c = (c
0,c
ar,c
abrs,c
abcrst, ...) are found by minimizing the FCI energyE
FCI= min
c
h Ψ
FCI| H ˆ | Ψ
FCIi (with the normalization constraint h Ψ
FCI| Ψ
FCIi = 1), which corresponds to diagonalizing ˆ H in the space spanned by all determinants.
In a finite basis set, FCI gives an upper bound to the exact ground-state energy:
E
FCI≥ E
0. In the limit of a complete basis set (K → ∞ ), FCI becomes exact:
E
FCI= E
0.
The correlation energy E
c= E
0− E
HFhas a slow convergence with the basis size:
the error on E
ctypically decreases as O K
−1.
Combinatorial explosion of number of determinants: N
det=
2KN= O (2K)
N= ⇒ necessity to find low-power-scaling approximations
To develop approximations to FCI, we identify two regimes of electron correlation:
dynamic (or weak) correlation and static (or strong) correlation.
13/58
Post-HF methods for dynamic/weak correlation: single-reference methods
Correlation is called “dynamic” or “weak” if c
ar,c
abrs, ...≪ c
0= ⇒ HF is a good starting point
This corresponds to situations with a large HOMO-LUMO HF gap.
Example: He atom, H
2molecule at equilibrium distance.
B
Even if each coefficient is small, their total contribution can be large.
For dynamic correlation, one considers “single-reference” methods which are approximations to FCI assuming the predominance of the single HF determinant.
Three main families of single-reference post-HF methods:
Truncated configuration interaction (CI)
Møller-Plesset (MP) perturbation theory (PT)
Coupled-cluster (CC) theory
14/58
Single-reference truncated configuration interaction (CI)
The FCI wave function is truncated at a given excitation level, e.g. keeping only single and double excitations (CISD):
| Ψ
CISDi = c
0| Φ
0i +
occ
X
a vir
X
r
c
ar| Φ
rai +
occ
X
a<b vir
X
r<s
c
abrs| Φ
rsabi
The coefficients c = (c
0,c
ar,c
abrs) corresponding to the ground-state wave function are found by minimizing the CISD energy (with the constraint of the normalization of the wave function)
E
CISD= min
c
h Ψ
CISD| H ˆ | Ψ
CISDi which, according to the variational principle, gives E
CISD≥ E
FCI.
The CISD computational cost scales as O K
6.
Serious shortcoming of truncated CI: it is not size-consistent.
We prefer methods that satisfy the size-consistency property: the energy of a system composed of two non-interacting fragments A and B is equal to the sum of the energies of the separate fragments:
E (A · · · B) = E (A) + E (B )
This property is important in chemistry. There is also the related size-extensity property: E (N) ∝ N for N → ∞ which is important for extended systems.
15/58
Single-reference Møller-Plesset (MP) perturbation theory
Perturbation theory starting from the HF Hamiltonian: ˆ H = ˆ F + ˆ V
The first-order wave function includes only doubly excited determinants:
| Ψ
(1)i =
occ
X
a<b vir
X
r<s
c
abrs,(1)| Φ
rsabi with c
abrs,(1)= − h rs || ab i
εr+
εs−
εa−
εb
The second-order energy gives the MP2 correlation energy:
E
cMP2= h Φ
0| V ˆ | Ψ
(1)i = −
occ
X
a<b vir
X
r<s
|h ab || rs i|
2 εr+
εs−
εa−
εb
MP2 is not variational (E
MP2can be lower than E
FCI) but is size-consistent.
The MP2 computational cost scales as O K
5.
MP2 is a simple largely used post-HF method that often reasonably accounts for dynamic/weak correlation.
However, accuracy is limited by missing higher-order terms.
Including higher-order terms (MP3, MP4, etc...) can be computationally costly and the series does not generally converge!
16/58
Single-reference coupled-cluster (CC) theory
In CC, the wave function is taken as an exponential of a truncated excitation expansion, e.g. with single and double excitations (CCSD):
| Ψ
CCSDi = e
Tˆ1+ ˆT2| Φ
0i where ˆ T
1=
occ
X
a vir
X
r
t
arˆ a
†rˆ a
aand ˆ T
2=
occ
X
a<b vir
X
r<s
t
abrsˆ a
†rˆ a
†sˆ a
bˆ a
aare operators generating singly and doubly excited determinants when acting on the HF wave function | Φ
0i .
The CCSD wave function contains all excited determinants of the FCI wave
function, but the coefficients of triply, quadruply, etc.. excited determinants are (antisymmetrized) products of the coefficients of singly and doubly excited determinants.
The amplitudes t
arand t
abrsare found by projecting the Schr¨ odinger equation H ˆ | Ψ
CCSDi = E | Ψ
CCSDi onto h Φ
ra| and h Φ
rsab| .
CCSD is not variational (E
CCSDcan be lower than E
FCI) but is size-consistent.
The CCSD computational cost scales as O K
6.
CCSD is more accurate than MP2 because it contains higher-order terms.
Possibility to perturbatively add the triple-excitation operator ˆ T
3in the expansion
= ⇒ CCSD(T) which is often considered as the gold standard for dynamic/weak correlation with computational cost scaling as O K
7.
17/58
Post-HF methods for static/strong correlation: multi-reference methods
Correlation is called “static” or “strong” if there are some coefficients in the FCI expansion that are not small compared to c
0= ⇒ HF is NOT a good starting point
This corresponds to situations with a small HOMO-LUMO HF gap.
Example: Be atom, H
2molecule at dissociation, transition metals (Fe, Cu, etc...).
Single-reference methods give too much importance to the HF determinant and tend to fail for static/strong correlation (e.g., MP2 diverges for zero HOMO-LUMO HF gap).
Instead of HF, we use now the multiconfiguration self-consistent field (MCSCF) method which is a multideterminant extension of HF:
E
MCSCF= min
{cn,χi}
h Ψ
MCSCF| H ˆ | Ψ
MCSCFi with | Ψ
MCSCFi =
Xn
c
n| Φ
ni Usually, we include all determinants | Φ
ni that can be generated from a small orbital subspace, called complete active space (CAS) = ⇒ accounts for static/strong correlation but combinatorial explosion with the size of the active space!
Starting from MCSCF, remaining dynamic/weak correlation can be added with multi-reference (MR) methods:
MRCI (e.g., MRCISD): not size-consistent but used for very small systems
MRPT (e.g., CASPT2, NEVPT2): fairly used but requires large enough CAS
MRCC: several proposed methods but no consensual method yet
18/58
Outline
1
Overview of wave-function methods
The quantum many-electron problem The Hartree-Fock approximationElectron correlation and post-Hartree-Fock methods
2 Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3
Coupled-cluster theory
The exponential ansatzTruncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
19/58
General Rayleigh-Schr¨odinger perturbation theory (1/3)
Consider a Hamiltonian ˆ H
λdepending on a coupling constant
λH ˆ
λ= ˆ H
(0)+
λV ˆ
where ˆ H
(0)is a zeroth-order Hamiltonian operator and ˆ V is a perturbation operator.
The “physical” Hamiltonian corresponds to
λ= 1, i.e. ˆ H = ˆ H
λ=1.
H ˆ
(0)is chosen such that its eigenstates Φ
nand associated eigenvalues E
n(0)are known H ˆ
(0)| Φ
ni = E
n(0)| Φ
ni
and the eigenstates are chosen to be orthonormal, i.e. h Φ
n| Φ
mi =
δn,m.
We want to determine the eigenstates Ψ
λnand associated eigenvalues E
nλof the
Hamiltonian ˆ H
λ, e.g. for the ground state
H ˆ
λ| Ψ
λ0i = E
0λ| Ψ
λ0i
We assume that E
0λand Ψ
λ0can be expanded in powers of
λE
0λ= E
0(0)+
λE0(1)+
λ2E
0(2)+ · · · and | Ψ
λ0i = | Ψ
(0)0i +
λ| Ψ
(1)0i +
λ2| Ψ
(2)0i + · · · where Ψ
(0)0= Φ
0.
It is convenient to choose the so-called intermediate normalization for Ψ
λ0: h Φ
0| Ψ
λ0i = 1 for all
λ= ⇒ h Φ
0| Ψ
(i)0i = 0 for all i ≥ 1
20/58
General Rayleigh-Schr¨odinger perturbation theory (2/3)
Insertion of the expansion of Ψ
λ0and E
0λinto the eigenvalue equation gives H ˆ
(0)+
λV ˆ | Φ
0i +
λ| Ψ
(1)0i +
λ2| Ψ
(2)0i + · · ·
=
E
0(0)+
λE0(1)+
λ2E
0(2)+ · · · | Φ
0i +
λ| Ψ
(1)0i +
λ2| Ψ
(2)0i + · · ·
Looking at this equation order by order in
λ, we recover atzeroth order
H ˆ
(0)| Φ
0i = E
0(0)| Φ
0i
At first order, we obtain
H ˆ
(0)| Ψ
(1)0i + ˆ V | Φ
0i = E
0(0)| Ψ
(1)0i + E
0(1)| Φ
0i
Projecting this equation on the bra state h Φ
0| gives the first-order energy correction E
0(1)= h Φ
0| V ˆ | Φ
0i
Projecting now on the bra states h Φ
n| for n 6 = 0 gives
E
n(0)h Φ
n| Ψ
(1)0i + h Φ
n| V ˆ | Φ
0i = E
0(0)h Φ
n| Ψ
(1)0i leading to the first-order wave-function correction
h Φ
n| Ψ
(1)0i = − h Φ
n| V ˆ | Φ
0i
E
n(0)− E
0(0)= ⇒ | Ψ
(1)0i = −
Xn6=0
h Φ
n| V ˆ | Φ
0i E
n(0)− E
0(0)| Φ
ni
21/58
General Rayleigh-Schr¨odinger perturbation theory (3/3)
Similarly, at second order, we obtain
H ˆ
(0)| Ψ
(2)0i + ˆ V | Ψ
(1)0i = E
0(0)| Ψ
(2)0i + E
0(1)| Ψ
(1)0i + E
0(2)| Φ
0i
Projecting this equation on the bra state h Φ
0| gives the second-order energy correction E
0(2)= h Φ
0| V ˆ | Ψ
(1)0i = −
Xn6=0
|h Φ
0| V ˆ | Φ
ni|
2E
n(0)− E
0(0)
Note that E
0(2)diverges if there is a state Φ
n(with n 6 = 0) of energy E
n(0)equals to E
0(0), i.e. if the zeroth-order Hamiltonian ˆ H
(0)has a degenerate ground state. In this case, one must instead diagonalize the Hamiltonian in the degenerate space before applying perturbation theory, which is known as degenerate perturbation theory.
Similarly, one can show that the third-order energy correction has the following expression
E
0(3)= h Φ
0| V ˆ | Ψ
(2)0i
=
Xn,m6=0
h Φ
0| V ˆ | Φ
nih Φ
n| V ˆ | Φ
mih Φ
m| V ˆ | Φ
0i
(E
n(0)− E
0(0))(E
m(0)− E
0(0)) − E
0(1)Xn6=0
|h Φ
0| V ˆ | Φ
ni|
2(E
n(0)− E
0(0))
222/58
Outline
1
Overview of wave-function methods
The quantum many-electron problem The Hartree-Fock approximationElectron correlation and post-Hartree-Fock methods
2 Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3
Coupled-cluster theory
The exponential ansatzTruncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
23/58
Møller-Plesset perturbation theory (1/5)
Møller-Plesset (MP) perturbation theory is a particular case of Rayleigh-Schr¨odinger perturbation theory for which the zeroth-order Hamiltonian is chosen to be the Hartree-Fock (sometimes also simply called Fock) Hamiltonian
H ˆ
(0)= ˆ F
The expression of ˆ F in the position representation is
F (x
1,x
2, ...,x
N) =
N
X
i
f (x
i)
The corresponding perturbation operator ˆ V is thus the difference between the electron-electron Coulomb interaction and the HF potential
V (x
1,x
2, ...,x
N) = 1 2
N
X
i N
X
i6=j
1
| r
i− r
j| −
N
X
i
v
HF(x
i)
24/58
Møller-Plesset perturbation theory (2/5)
The zeroth-order ground-state energy is given by the sum of occupied orbital energies E
0(0)= h Φ
0| F ˆ | Φ
0i =
occ
X
a
εa
The first-order energy correction is the expectation value of the HF determinant over the perturbation operator
E
0(1)= h Φ
0| V ˆ | Φ
0i
Therefore, the sum of the zeroth-order energy and first-order energy correction just gives back the HF energy
E
0(0)+ E
0(1)= h Φ
0| F ˆ + ˆ V | Φ
0i = E
HF
The second-order energy correction, also called the second-order Møller-Plesset (MP2) correlation energy, is
E
0(2)= E
cMP2= −
Xn6=0
|h Φ
0| V ˆ | Φ
ni|
2E
n(0)− E
0(0)where Φ
ncan be a priori single, double, triple, ... excited determinants.
25/58
Møller-Plesset perturbation theory (3/5)
In fact, since ˆ V is a two-body operator, according to Slater’s rules, triple and higher excitations with respect to Φ
0give vanishing matrix elements h Φ
0| V ˆ | Φ
ni .
In addition, it turns out that single excitations only give a vanishing contribution h Φ
0| V ˆ | Φ
rai =
occ
X
b
h ab || rb i − h a | v ˆ
HF| r i =
occ
X
b
h ab || rb i −
occ
X
b
h ab || rb i = 0
It thus remains only the double excitations, Φ
n= Φ
rsab. Only the two-body part of the perturbation operator gives a non-zero matrix element
h Φ
0| V ˆ | Φ
rsabi = h ab || rs i
and the zeroth-order energy corresponding to the doubly-excited determinants Φ
rsabis E
n(0)= E
abrs,(0)= E
0(0)+
εr+
εs−
εa−
εb
We arrive at the following expression for the MP2 correlation energy E
cMP2= −
occ
X
a<b vir
X
r<s
|h ab || rs i|
2 εr+
εs−
εa−
εb= − 1 4
occ
X
a,b vir
X
r,s
|h ab || rs i|
2 εr+
εs−
εa−
εbwhere we have used the antisymmetry property of the integrals, i.e.
h ab || rs i = −h ab || sr i = −h ba || rs i .
26/58
Møller-Plesset perturbation theory (4/5)
The MP2 correlation energy is always negative.
It diverges to −∞ if one energy denominator
εr+
εs−
εa−
εbis zero. This happens for systems with zero HF HOMO-LUMO gap, in which case MP perturbation theory cannot be applied.
The MP2 energy is
E
MP2= E
HF+ E
cMP2Since it is not a variational theory, E
MP2is not necessarily above E
FCI.
Similarly, it can be shown, after much work, that the third-order Møller-Plesset (MP3) energy correction has the following expression
E
0(3)= 1 8
occ
X
a,b,c,d vir
X
r,s
h ab || rs ih rs || cd ih cd || ab i (ε
r+
εs−
εa−
εb)(ε
r+
εs−
εc−
εd)
+ 1 8
occ
X
a,b vir
X
r,s,t,u
h ab || rs ih rs || tu ih tu || ab i (ε
r+
εs−
εa−
εb)(ε
t+
εu−
εa−
εb)
+
occ
X
a,b,c vir
X
r,s,t
h ab || rs ih cs || tb ih rt || ac i (ε
r+
εs−
εa−
εb)(ε
r+
εt−
εa−
εc)
The calculation of the third- or higher-order terms is often considered as not worthwhile
in comparison with coupled-cluster methods for example.
27/58Møller-Plesset perturbation theory (5/5)
The MP2 correlation energy can be rewritten as the sum of a direct and exchange term
E
cMP2= − 1 2
occ
X
a,b vir
X
r,s
h ab | rs ih rs | ab i
εr+
εs−
εa−
εb+ 1 2
occ
X
a,b vir
X
r,s
h ab | rs ih rs | ba i
εr+
εs−
εa−
εb
These terms can be represented by Feynman diagrams (also called Goldstone diagrams in quantum chemistry)
E
cMP2= a r s b + a s r
b
When starting from an unrestricted HF calculation, MP perturbation theory is correctly size consistent at each order. In terms of diagrams, this comes from the fact that each diagram contributing to the perturbative expansion of the energy is made of a single linked piece. Unlinked diagrams do not contribute. This is known as the linked-cluster theorem.
28/58
Outline
1
Overview of wave-function methods
The quantum many-electron problem The Hartree-Fock approximationElectron correlation and post-Hartree-Fock methods
2
Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3 Coupled-cluster theory The exponential ansatz
Truncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
29/58
The exponential ansatz (1/5)
In coupled-cluster (CC) theory, one starts by an exponential ansatz for the CC wave function
| Ψ
CCi = e
Tˆ| Φ
0i
where | Φ
0i is the HF wave function, and ˆ T is the cluster (excitation) operator which is the sum of cluster operators of different excitation levels
T ˆ = ˆ T
1+ ˆ T
2+ · · · + ˆ T
N
T ˆ
1is the cluster operator for the single excitations T ˆ
1=
occ
X
a vir
X
r
t
arˆ a
†rˆ a
awhere t
raare the single-excitation cluster amplitudes to be determined.
We have used the second-quantization formalism:
ˆaais theannihilation operatorof the spin orbitala ˆa†r is thecreation operatorof the spin orbitalr
When the operator ˆa†rˆaaacts on the HF single determinant|Φ0i, it generates the single-excited determinant|Φrai= ˆa†rˆaa|Φ0i
30/58
The exponential ansatz (2/5)
Similarly, ˆ T
2is the cluster operator for the double excitations T ˆ
2=
occ
X
a<b vir
X
r<s
t
abrsˆ a
†rˆ a
†sˆ a
bˆ a
awhere t
rsabare the double-excitation cluster amplitudes to be determined.
When the operator ˆ a
†rˆ a
†sˆ a
bˆ a
aacts on the HF single determinant | Φ
0i , it generates the double-excited determinant | Φ
rsabi = ˆ a
†rˆ a
†sˆ a
bˆ a
a| Φ
0i .
And so on up to the ˆ T
Ncluster operator for N-fold excitations.
31/58
The exponential ansatz (3/5)
To understand the action of the operator e
Tˆon the HF wave function | Φ
0i , one can expand the exponential and rearrange the operators in terms of excitation levels
e
Tˆ= ˆ 1 + ˆ T + T ˆ
22! + T ˆ
33! + · · · + T ˆ
NN! = ˆ 1 + ˆ C
1+ ˆ C
2+ · · · + ˆ C
Nwhere the operator ˆ C
1generates single excitations, ˆ C
2generates double excitations, etc.
Noting that the cluster operators ˆ T
1, ˆ T
2, ..., ˆ T
Ncommute with each other, we find for example for the first four excitation operators
C ˆ
1= ˆ T
1C ˆ
2= ˆ T
2+ 1 2 T ˆ
12C ˆ
3= ˆ T
3+ ˆ T
1T ˆ
2+ 1
6 T ˆ
13C ˆ
4= ˆ T
4+ ˆ T
1T ˆ
3+ 1
2 T ˆ
22+ 1
2 T ˆ
12T ˆ
2+ 1 24 T ˆ
14and so on.
32/58
The exponential ansatz (4/5)
The CC wave function has thus the same form as the FCI wave function (with intermediate normalization, i.e. c
0= 1)
| Ψ
CCi = | Φ
0i +
occ
X
a vir
X
r
c
ar| Φ
rai +
occ
X
a<b vir
X
r<s
c
abrs| Φ
rsabi
+
occ
X
a<b<c vir
X
r<s<t
c
abcrst| Φ
rstabci +
occ
X
a<b<c<d vir
X
r<s<t<u
c
abcdrstu| Φ
rstuabcdi + · · ·
with coefficients related to the cluster amplitudes by c
ar= t
arc
abrs= t
abrs+ t
ra∗ t
bsc
abcrst= t
abcrst+ t
ar∗ t
bcst+ t
ar∗ t
bs∗ t
ctc
abcdrstu= t
abcdrstu+ t
ar∗ t
bcdstu+ t
abrs∗ t
cdtu+ t
ar∗ t
bs∗ t
cdtu+ t
ar∗ t
bs∗ t
ct∗ t
duand so on.
Here, ∗ means an antisymmetric product with respect to exchange of indices:
t
ar∗ t
bs= t
art
bs− t
brt
ast
ar∗ t
bcst= t
rat
bcst− t
brt
acst+ t
crt
abst− t
ast
bcrt+ t
bst
acrt− t
cst
abrt+ t
att
bcrs− t
tbt
acrs+ t
ctt
abrst
ar∗ t
bs∗ t
ct= t
art
bst
ct− t
art
cst
bt− t
rbt
ast
ct− t
crt
bst
at+ t
crt
ast
bt+ t
brt
cst
atand so on.
33/58
The exponential ansatz (5/5)
Thus, the CC wave function contains all excited determinants, just as the FCI wave function.
If the cluster operator ˆ T is not truncated, the CC wave function is just a nonlinear reparametrization of the FCI wave function with the same number of parameters.
Optimizing the cluster amplitudes t = (t
ar,t
abrs,t
abcrst, ...) so as to minimize the total energywould lead to the FCI wave function.
The interest of the CC approach only appears when the cluster operator is truncated.
34/58
Outline
1
Overview of wave-function methods
The quantum many-electron problem The Hartree-Fock approximationElectron correlation and post-Hartree-Fock methods
2
Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3 Coupled-cluster theory The exponential ansatz
Truncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
35/58
Truncation of the cluster operator
For example, let us consider coupled-cluster singles doubles (CCSD) T ˆ = ˆ T
1+ ˆ T
2
The expansion of e
Tˆgives e
Tˆ= ˆ 1 + ˆ T
1+
T ˆ
2+ 1
2 T ˆ
12+
T ˆ
1T ˆ
2+ 1 6 T ˆ
13
+
1 2 T ˆ
22+ 1
2 T ˆ
12T ˆ
2+ 1 24 T ˆ
14
+ · · ·
The CCSD wave function has the same form as the FCI wave function, i.e. it contains all excited determinants, but with coefficients now given by
c
ar= t
arc
abrs= t
abrs+ t
ar∗ t
bsc
abcrst= t
ar∗ t
bcst+ t
ar∗ t
bs∗ t
ctc
abcdrstu= t
abrs∗ t
cdtu+ t
ar∗ t
bs∗ t
cdtu+ t
ar∗ t
bs∗ t
ct∗ t
duand so on.
The coefficients of the triple excitations c
abcrstare now only approximated by products of single- and double-excitation amplitudes t
arand t
bcst, and similarly for all higher-level excitations.
In comparison with the CISD wave function, the CCSD wave function has many more excited determinants but the same number of free parameters t = (t
ar,t
abrs) to optimize!
36/58
Truncation of the cluster operator: size-consistency
Contrary to truncated CI, truncated CC is size-consistent, which directly stems from the exponential form of the CC wave function.
Proof: consider a system composed of two infinitely separated (and thus non-interacting) fragments A and B :
H ˆ = ˆ H
A+ ˆ H
BIf the HF calculation is size-consistent, the HF wave function is multiplicatively separable:
|ΦA···B0 i=|ΦA0i ⊗ |ΦB0i(where⊗is the antisymmetric tensor product)
Because the orbitals of each fragments do not overlap, the cluster operator of the system is additively separable: ˆTA···B= ˆTA+ ˆTB
We can then show that theCC wave function is multiplicatively separable:
|ΨA···BCC i=eTˆA···B|ΦA···B0 i
=eTˆA+ ˆTB|ΦA0i ⊗ |ΦB0i
=eTˆA|ΦA0i ⊗eTˆB|ΦB0i
=|ΨACCi ⊗ |ΨBCCi
This implies that theCC energy is additively separable:
ECC(A· · ·B) =hΨA···BCC |HˆA+ ˆHB|ΨA···BCC i
=hΨACC|HˆA|ΨACCihΨBCC|ΨBCCi+hΨACC|ΨACCihΨBCC|HˆB|ΨBCCi
=ECC(A) +ECC(B)
37/58
Outline
1
Overview of wave-function methods
The quantum many-electron problem The Hartree-Fock approximationElectron correlation and post-Hartree-Fock methods
2
Perturbation theory
General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory
3 Coupled-cluster theory The exponential ansatz
Truncation of the cluster operator
The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles
Coupled-cluster methods in practice
4
Appendix
Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets
38/58
The coupled-cluster energy and the coupled-cluster equations (1/4)
We consider in all the following an arbitrary truncation level of the cluster operator.
The most natural way to calculate the CC amplitudes t and CC energy would be by using the variational method:
E
CCvar= min
t
h Ψ
CC| H ˆ | Ψ
CCi with the constraint h Ψ
CC| Ψ
CCi = 1.
However, the CC wave function includes all excited determinants up to N-fold excitations which contribute to the expectation value, giving too complex equations to be efficiently solved.
A more convenient approach for obtaining the CC amplitudes and CC energy is the projection method: we require that Ψ
CCsatisfies the Schr¨odinger equation
( ˆ H − E
CC) | Ψ
CCi = 0
projected onto the space spanned by h Φ
0| , h Φ
ra| , h Φ
rsab| , h Φ
rstabc| ,...
h Φ
0| ( ˆ H − E
CC) | Ψ
CCi = 0 h Φ
ra| ( ˆ H − E
CC) | Ψ
CCi = 0 h Φ
rsab| ( ˆ H − E
CC) | Ψ
CCi = 0
h Φ
rstabc| ( ˆ H − E
CC) | Ψ
CCi = 0 and so on.
39/58