Quantitative quantum chemistry: Introduction to wave-function methods

(1)

Quantitative quantum chemistry:

Introduction to wave-function methods

Julien Toulouse Laboratoire de Chimie Th´eorique

Sorbonne Universit´ e and CNRS, Paris, France Institut Universitaire de France

Course 5C208 of Master 2 Chemistry Sorbonne Universit´e, Paris

www.lct.jussieu.fr/pagesperso/toulouse/enseignement/presentation_wft_19.pdf

(2)

Why and how learning wave-function methods?

Understanding the different quantum-chemistry wave-function methods is important for:

improving them and developing new ones;

using them in applications in an appropriate way.

Recommended books:

J.-L. Rivail, El´ ´ ements de chimie quantique ` a l’usage des chimistes, EDP Sciences / CNRS Editions, 1999.

G. Berthier, N´ ecessaire de chimie th´ eorique, Ellipses, 2009.

A. Szabo and N. S. Ostlund, Modern Quantum Chemistry, Dover Publications Inc., NY, 1996.

T. Helgaker, P. Jørgensen, J. Olsen, Molecular Electronic-Structure Theory, Wiley, 2002.

My lecture notes:

www.lct.jussieu.fr/pagesperso/toulouse/enseignement/introduction_pt_cc.pdf

2/58

(3)

Outline

1 Overview of wave-function methods The quantum many-electron problem The Hartree-Fock approximation

Electron correlation and post-Hartree-Fock methods

2 Perturbation theory

General Rayleigh-Schr¨odinger perturbation theory Møller-Plesset perturbation theory

3 Coupled-cluster theory The exponential ansatz

Truncation of the cluster operator

The coupled-cluster energy and the coupled-cluster equations Example: coupled-cluster doubles

Coupled-cluster methods in practice

4 Appendix

Slater’s rules for matrix elements The formalism of second quantization Gaussian basis sets

3/58

(4)

Outline

2

Perturbation theory

3

Coupled-cluster theory

The exponential ansatz

4

Appendix

4/58

(5)

The Hamiltonian and the many-electron wave function

We consider a N-electron system (atom, molecule, solid) in the Born-Oppenheimer and non-relativistic approximations.

The electronic Hamiltonian in the position representation is, in atomic units, H(r

1,

r

2, ...,

r

N

) =

N

X

i

h(r

i

) + 1 2

N

X

i N

X

i6=j

1 | r

i

− r

j

| where h(r

i

) = − (1/2) ∇

²r_i

+ v

ne

(r

i

) is the one-electron contribution and v

_ne

(r

i

) = −

P

α

Z

α/

| r

i

− R

α

| is the nuclei-electron interaction.

Stationary states satisfy the time-independent Schr¨ odinger equation H(r

1,

r

2, ...,

r

N

)Ψ(x

1,

x

2, ...,

x

N

) = EΨ(x

1,

x

2, ...,

x

N

) where Ψ(x

1,

x

2, ...,

x

N

) is a wave function written with space-spin coordinates x

i

= (r

i, ωi

) (with r

i

∈

R³

and

ωi

∈ {↑ , ↓} ) which is antisymmetric with respect to the exchange of two coordinates, and E is the associated energy.

Using Dirac notations (representation-independent formalism):

H ˆ | Ψ i = E | Ψ i

5/58

(6)

The ground-state electronic energy and other properties

The ground-state electronic energy E

0

can be expressed with the variational principle E

₀

= min

Ψ

h Ψ | H ˆ | Ψ i

where the minimization is over all N-electron antisymmetric wave functions Ψ, normalized to unity h Ψ | Ψ i = 1.

Properties (such as dipole moments or optical spectra) can be essentially obtained as derivatives of the ground-state energy with respect to a perturbation.

For systems containing heavy atoms, we need to include relativistic effects, e.g. using the four-component Dirac Hamiltonian.

For photochemistry studies, we sometimes need to go beyond the Born-Oppenheimer approximation, e.g. calculating non-adiabatic couplings (or vibronic couplings) between the electronic and nuclear vibrational states.

We aim at about 1% accuracy on energy differences and other properties.

6/58

(7)

Outline

2

Perturbation theory

3

Coupled-cluster theory

4

Appendix

7/58

(8)

The Hartree-Fock approximation (1/2)

The (unrestricted) Hartree-Fock (HF) method consists in approximating the ground-state wave function as a single Slater determinant, Ψ

0

≈ Φ

0

:

Φ

0

(x

1,

x

2, ...,

x

N

) = 1

√ N!

χ1

(x

1

)

χ2

(x

1

) · · ·

χN

(x

1

)

χ1

(x

2

)

χ2

(x

2

) · · ·

χN

(x

2

)

.. . .. . . .. .. .

χ1

(x

N

)

χ2

(x

N

) · · ·

χN

(x

N

)

where

χi

(x) are orthonormal spin orbitals.

The HF electronic energy is written in terms of integrals over these spin orbitals (using Slater’s rules for calculating expectation values over Slater determinants)

E

_HF

[ {

χa

} ] = h Φ

0

| H ˆ | Φ

0

i =

occ

X

a

h

aa

+ 1 2

occ

X

a,b

h ab || ab i

where a and b refer to occupied spin orbitals, h

aa

are the one-electron integrals h

aa

=

Z

dx

χ^∗a

(x)h(r)χ

a

(x)

and h ab || ab i = h ab | ab i − h ab | ba i are the antisymmetrized two-electron integrals with h ij | kl i =

x

dx

1

dx

2

χ^∗_i

(x

1

)χ

^∗_j

(x

2

)χ

k

(x

1

)χ

l

(x

2

)

| r

1

− r

2

| (in physicists’ notation)

8/58

(9)

The Hartree-Fock approximation (2/2)

The spin orbitals are determined by minimizing the HF energy subject to the spin-orbital orthonormalization constraints h

χa

|

χb

i =

δa,b

E

HF

= min

{χa}

E

HF

[ {

χa

} ]

This minimization leads to the (canonical) HF eigenvalue equations f (x)χ

i

(x) =

εiχi

(x)

giving both the occupied and virtual spin orbitals

χi

(x) and associated orbital energies

εi

.

The one-electron HF Hamiltonian (or Fock Hamiltonian) is

f (x) = h(r) + v

HF

(x) where v

HF

(x) is the one-electron HF potential operator

v

_HF

(x) =

occ

X

a

J

a

(x) − K

a

(x)

composed of a Coulomb (or Hartree) operator and an exchange (or Fock) operator J

a

(x

1

) =

Z

dx

2

χ^∗a

(x

2

)χ

a

(x

2

)

| r

1

− r

2

| and K

a

(x

1

)χ

i

(x

1

) =

Z

dx

2

χ^∗a

(x

2

)χ

i

(x

2

)

| r

1

− r

2

|

χa

(x

1

)

9/58

(10)

Hartree-Fock calculations in a basis set

The spin orbitals are written as the product a spatial orbital

ψi

(r) and a spin function

α(ω) orβ(ω)

χi

(x) =

ψi

(r)α(ω) or

χi

(x) =

ψi

(r)β(ω) with

α(

↑ ) = 1,

α(

↓ ) = 0 and

β(

↑ ) = 0,

β(

↓ ) = 1.

The spatial orbitals are expanded on a basis set {

φµ

(r) }

µ=1,...,K

ψi

(r) =

K

X

µ=1

C

µiφµ

(r)

In the basis, the HF equations take the form a self-consistent generalized eigenvalue matrix equation

F C

i

=

εi

S C

i

where F

µν

=

R

φ^∗µ

(r)f (x)φ

ν

(r)dr are the elements of the Fock matrix and S

µν

=

R

φ^∗µ

(r)φ

ν

(r)dr are the elements of the overlap matrix.

In practice, we use atom-centered Gaussian-type orbital (GTO) basis sets

φµ

(r) =

X

p

d

pµ

g

p

(r) with g

p

(r) = N

p

r

^ℓ^p

e

^−α^p^r²

Y

_ℓ^m^p

p

(θ, φ) using spherical coordinates r = (r, θ, φ) around each nucleus.

In a straightforward implementation, the computational time of HF scales as O (K

⁴

).

10/58

(11)

Outline

2

Perturbation theory

3

Coupled-cluster theory

4

Appendix

11/58

(12)

The electron correlation energy

The HF potential v

_HF

(x) is an one-electron mean-field potential approximating the effect of the two-electron interaction (1/2)

PN

i

PN

i6=j

1/ | r

i

− r

j

| . In other words, the HF approximation only accounts for the electron-electron interaction in an averaged, mean-field way.

The effect of the electron-electron interaction beyond the HF approximation is called electron correlation. The difference between the exact ground-state energy E

0

and the HF energy E

_HF

is called the correlation energy

E

c

= E

0

− E

HF

Even though E

_c

is usually a small percentage of the total energy, it very often makes a large and crucial contribution to energy differences (such as reaction energies, reaction barrier heights, ...). It is therefore important to go beyond the HF approximation and calculate the value of the correlation energy, which is the goal of the post-HF methods.

12/58

(13)

A straightforward post-HF method: Full configuration interaction (FCI)

In FCI, the wave function is expanded in terms of the HF determinant Φ

0

, the singly excited determinants Φ

^ra

, the doubly excited determinants Φ

^rs_ab

, and so on up to N-fold excited determinants:

| Ψ

FCI

i = c

0

| Φ

0

i +

occ

X

a vir

X

r

c

_a^r

| Φ

^ra

i +

occ

X

a<b vir

X

r<s

c

_ab^rs

| Φ

^rsab

i +

occ

X

a<b<c vir

X

r<s<t

c

_abc^rst

| Φ

^rstabc

i + · · ·

and the coefficients c = (c

0,

c

_a^r,

c

_ab^rs,

c

_abc^rst, ...) are found by minimizing the FCI energy

E

FCI

= min

c

h Ψ

FCI

| H ˆ | Ψ

FCI

i (with the normalization constraint h Ψ

FCI

| Ψ

FCI

i = 1), which corresponds to diagonalizing ˆ H in the space spanned by all determinants.

In a finite basis set, FCI gives an upper bound to the exact ground-state energy:

E

FCI

≥ E

0

. In the limit of a complete basis set (K → ∞ ), FCI becomes exact:

E

_FCI

= E

₀

.

The correlation energy E

c

= E

0

− E

HF

has a slow convergence with the basis size:

the error on E

c

typically decreases as O K

⁻¹

.

Combinatorial explosion of number of determinants: N

_det

=

^2K_N

= O (2K)

^N

= ⇒ necessity to find low-power-scaling approximations

To develop approximations to FCI, we identify two regimes of electron correlation:

dynamic (or weak) correlation and static (or strong) correlation.

13/58

(14)

Post-HF methods for dynamic/weak correlation: single-reference methods

Correlation is called “dynamic” or “weak” if c

_a^r,

c

_ab^rs, ...

≪ c

0

= ⇒ HF is a good starting point

This corresponds to situations with a large HOMO-LUMO HF gap.

Example: He atom, H

2

molecule at equilibrium distance.

^B

Even if each coefficient is small, their total contribution can be large.

For dynamic correlation, one considers “single-reference” methods which are approximations to FCI assuming the predominance of the single HF determinant.

Three main families of single-reference post-HF methods:

Truncated configuration interaction (CI)

Møller-Plesset (MP) perturbation theory (PT)

Coupled-cluster (CC) theory

14/58

(15)

Single-reference truncated configuration interaction (CI)

The FCI wave function is truncated at a given excitation level, e.g. keeping only single and double excitations (CISD):

| Ψ

CISD

i = c

₀

| Φ

0

i +

occ

X

a vir

X

r

c

_a^r

| Φ

^ra

i +

occ

X

a<b vir

X

r<s

c

_ab^rs

| Φ

^rsab

i

The coefficients c = (c

0,

c

_a^r,

c

_ab^rs

) corresponding to the ground-state wave function are found by minimizing the CISD energy (with the constraint of the normalization of the wave function)

E

CISD

= min

c

h Ψ

CISD

| H ˆ | Ψ

CISD

i which, according to the variational principle, gives E

CISD

≥ E

FCI

.

The CISD computational cost scales as O K

⁶

.

Serious shortcoming of truncated CI: it is not size-consistent.

We prefer methods that satisfy the size-consistency property: the energy of a system composed of two non-interacting fragments A and B is equal to the sum of the energies of the separate fragments:

E (A · · · B) = E (A) + E (B )

This property is important in chemistry. There is also the related size-extensity property: E (N) ∝ N for N → ∞ which is important for extended systems.

15/58

(16)

Single-reference Møller-Plesset (MP) perturbation theory

Perturbation theory starting from the HF Hamiltonian: ˆ H = ˆ F + ˆ V

The first-order wave function includes only doubly excited determinants:

| Ψ

⁽¹⁾

i =

occ

X

a<b vir

X

r<s

c

_ab^rs,(1)

| Φ

^rsab

i with c

_ab^rs,(1)

= − h rs || ab i

εr

+

εs

−

εa

−

εb

The second-order energy gives the MP2 correlation energy:

E

_c^MP2

= h Φ

0

| V ˆ | Ψ

⁽¹⁾

i = −

occ

X

a<b vir

X

r<s

|h ab || rs i|

² εr

+

εs

−

εa

−

εb

MP2 is not variational (E

MP2

can be lower than E

_FCI

) but is size-consistent.

The MP2 computational cost scales as O K

⁵

.

MP2 is a simple largely used post-HF method that often reasonably accounts for dynamic/weak correlation.

However, accuracy is limited by missing higher-order terms.

Including higher-order terms (MP3, MP4, etc...) can be computationally costly and the series does not generally converge!

16/58

(17)

Single-reference coupled-cluster (CC) theory

In CC, the wave function is taken as an exponential of a truncated excitation expansion, e.g. with single and double excitations (CCSD):

| Ψ

CCSD

i = e

^T^ˆ¹^{+ ˆ}^T²

| Φ

0

i where ˆ T

1

=

occ

X

a vir

X

r

t

a^r

ˆ a

^†r

ˆ a

a

and ˆ T

2

=

occ

X

a<b vir

X

r<s

t

ab^rs

ˆ a

^†r

ˆ a

^†s

ˆ a

b

ˆ a

a

are operators generating singly and doubly excited determinants when acting on the HF wave function | Φ

0

i .

The CCSD wave function contains all excited determinants of the FCI wave

function, but the coefficients of triply, quadruply, etc.. excited determinants are (antisymmetrized) products of the coefficients of singly and doubly excited determinants.

The amplitudes t

a^r

and t

_ab^rs

are found by projecting the Schr¨ odinger equation H ˆ | Ψ

CCSD

i = E | Ψ

CCSD

i onto h Φ

^ra

| and h Φ

^rsab

| .

CCSD is not variational (E

CCSD

can be lower than E

FCI

) but is size-consistent.

The CCSD computational cost scales as O K

⁶

.

CCSD is more accurate than MP2 because it contains higher-order terms.

Possibility to perturbatively add the triple-excitation operator ˆ T

3

in the expansion

= ⇒ CCSD(T) which is often considered as the gold standard for dynamic/weak correlation with computational cost scaling as O K

⁷

.

17/58

(18)

Post-HF methods for static/strong correlation: multi-reference methods

Correlation is called “static” or “strong” if there are some coefficients in the FCI expansion that are not small compared to c

0

= ⇒ HF is NOT a good starting point

This corresponds to situations with a small HOMO-LUMO HF gap.

Example: Be atom, H

2

molecule at dissociation, transition metals (Fe, Cu, etc...).

Single-reference methods give too much importance to the HF determinant and tend to fail for static/strong correlation (e.g., MP2 diverges for zero HOMO-LUMO HF gap).

Instead of HF, we use now the multiconfiguration self-consistent field (MCSCF) method which is a multideterminant extension of HF:

E

_MCSCF

= min

{cn,χ_i}

h Ψ

MCSCF

| H ˆ | Ψ

MCSCF

i with | Ψ

MCSCF

i =

X

n

c

n

| Φ

n

i Usually, we include all determinants | Φ

n

i that can be generated from a small orbital subspace, called complete active space (CAS) = ⇒ accounts for static/strong correlation but combinatorial explosion with the size of the active space!

Starting from MCSCF, remaining dynamic/weak correlation can be added with multi-reference (MR) methods:

MRCI (e.g., MRCISD): not size-consistent but used for very small systems

MRPT (e.g., CASPT2, NEVPT2): fairly used but requires large enough CAS

MRCC: several proposed methods but no consensual method yet

18/58

(19)

Outline

1

Overview of wave-function methods

The quantum many-electron problem The Hartree-Fock approximation

3

Coupled-cluster theory

4

Appendix

19/58

(20)

General Rayleigh-Schr¨odinger perturbation theory (1/3)

Consider a Hamiltonian ˆ H

^λ

depending on a coupling constant

λ

H ˆ

^λ

= ˆ H

⁽⁰⁾

+

λ

V ˆ

where ˆ H

⁽⁰⁾

is a zeroth-order Hamiltonian operator and ˆ V is a perturbation operator.

The “physical” Hamiltonian corresponds to

λ

= 1, i.e. ˆ H = ˆ H

^λ=1

.

H ˆ

⁽⁰⁾

is chosen such that its eigenstates Φ

n

and associated eigenvalues E

n⁽⁰⁾

are known H ˆ

⁽⁰⁾

| Φ

n

i = E

n⁽⁰⁾

| Φ

n

i

and the eigenstates are chosen to be orthonormal, i.e. h Φ

n

| Φ

m

i =

δn,m

.

We want to determine the eigenstates Ψ

^λn

and associated eigenvalues E

_n^λ

of the

Hamiltonian ˆ H

^λ

, e.g. for the ground state

H ˆ

^λ

| Ψ

^λ₀

i = E

₀^λ

| Ψ

^λ₀

i

We assume that E

₀^λ

and Ψ

^λ₀

can be expanded in powers of

λ

E

₀^λ

= E

₀⁽⁰⁾

+

λE₀⁽¹⁾

+

λ²

E

₀⁽²⁾

+ · · · and | Ψ

^λ0

i = | Ψ

⁽⁰⁾₀

i +

λ

| Ψ

⁽¹⁾₀

i +

λ²

| Ψ

⁽²⁾₀

i + · · · where Ψ

⁽⁰⁾₀

= Φ

0

.

It is convenient to choose the so-called intermediate normalization for Ψ

^λ0

: h Φ

0

| Ψ

^λ0

i = 1 for all

λ

= ⇒ h Φ

0

| Ψ

⁽ⁱ⁾₀

i = 0 for all i ≥ 1

20/58

(21)

General Rayleigh-Schr¨odinger perturbation theory (2/3)

Insertion of the expansion of Ψ

^λ0

and E

₀^λ

into the eigenvalue equation gives H ˆ

⁽⁰⁾

+

λ

V ˆ | Φ

0

i +

λ

| Ψ

⁽¹⁾₀

i +

λ²

| Ψ

⁽²⁾₀

i + · · ·

=

E

₀⁽⁰⁾

+

λE₀⁽¹⁾

+

λ²

E

₀⁽²⁾

+ · · · | Φ

0

i +

λ

| Ψ

⁽¹⁾₀

i +

λ²

| Ψ

⁽²⁾₀

i + · · ·

Looking at this equation order by order in

λ, we recover at

zeroth order

H ˆ

⁽⁰⁾

| Φ

0

i = E

₀⁽⁰⁾

| Φ

0

i

At first order, we obtain

H ˆ

⁽⁰⁾

| Ψ

⁽¹⁾₀

i + ˆ V | Φ

0

i = E

₀⁽⁰⁾

| Ψ

⁽¹⁾₀

i + E

₀⁽¹⁾

| Φ

0

i

Projecting this equation on the bra state h Φ

0

| gives the first-order energy correction E

₀⁽¹⁾

= h Φ

0

| V ˆ | Φ

0

i

Projecting now on the bra states h Φ

n

| for n 6 = 0 gives

E

n⁽⁰⁾

h Φ

n

| Ψ

⁽¹⁾₀

i + h Φ

n

| V ˆ | Φ

0

i = E

₀⁽⁰⁾

h Φ

n

| Ψ

⁽¹⁾₀

i leading to the first-order wave-function correction

h Φ

n

| Ψ

⁽¹⁾₀

i = − h Φ

n

| V ˆ | Φ

0

i

E

n⁽⁰⁾

− E

₀⁽⁰⁾

= ⇒ | Ψ

⁽¹⁾₀

i = −

X

n6=0

h Φ

n

| V ˆ | Φ

0

i E

n⁽⁰⁾

− E

₀⁽⁰⁾

| Φ

n

i

21/58

(22)

General Rayleigh-Schr¨odinger perturbation theory (3/3)

Similarly, at second order, we obtain

H ˆ

⁽⁰⁾

| Ψ

⁽²⁾₀

i + ˆ V | Ψ

⁽¹⁾₀

i = E

₀⁽⁰⁾

| Ψ

⁽²⁾₀

i + E

₀⁽¹⁾

| Ψ

⁽¹⁾₀

i + E

₀⁽²⁾

| Φ

0

i

Projecting this equation on the bra state h Φ

0

| gives the second-order energy correction E

₀⁽²⁾

= h Φ

0

| V ˆ | Ψ

⁽¹⁾₀

i = −

X

n6=0

|h Φ

0

| V ˆ | Φ

n

i|

²

E

n⁽⁰⁾

− E

₀⁽⁰⁾

Note that E

₀⁽²⁾

diverges if there is a state Φ

n

(with n 6 = 0) of energy E

n⁽⁰⁾

equals to E

₀⁽⁰⁾

, i.e. if the zeroth-order Hamiltonian ˆ H

⁽⁰⁾

has a degenerate ground state. In this case, one must instead diagonalize the Hamiltonian in the degenerate space before applying perturbation theory, which is known as degenerate perturbation theory.

Similarly, one can show that the third-order energy correction has the following expression

E

₀⁽³⁾

= h Φ

0

| V ˆ | Ψ

⁽²⁾₀

i

=

X

n,m6=0

h Φ

0

| V ˆ | Φ

n

ih Φ

n

| V ˆ | Φ

m

ih Φ

m

| V ˆ | Φ

0

i

(E

n⁽⁰⁾

− E

₀⁽⁰⁾

)(E

m⁽⁰⁾

− E

₀⁽⁰⁾

) − E

₀⁽¹⁾X

n6=0

|h Φ

0

| V ˆ | Φ

n

i|

²

(E

n⁽⁰⁾

− E

₀⁽⁰⁾

)

²

22/58

(23)

Outline

1

Overview of wave-function methods

3

Coupled-cluster theory

4

Appendix

23/58

(24)

Møller-Plesset perturbation theory (1/5)

Møller-Plesset (MP) perturbation theory is a particular case of Rayleigh-Schr¨odinger perturbation theory for which the zeroth-order Hamiltonian is chosen to be the Hartree-Fock (sometimes also simply called Fock) Hamiltonian

H ˆ

⁽⁰⁾

= ˆ F

The expression of ˆ F in the position representation is

F (x

1,

x

2, ...,

x

N

) =

N

X

i

f (x

i

)

The corresponding perturbation operator ˆ V is thus the difference between the electron-electron Coulomb interaction and the HF potential

V (x

1,

x

2, ...,

x

N

) = 1 2

N

X

i N

X

i6=j

1 | r

i

− r

j

| −

N

X

i

v

HF

(x

i

)

24/58

(25)

Møller-Plesset perturbation theory (2/5)

The zeroth-order ground-state energy is given by the sum of occupied orbital energies E

₀⁽⁰⁾

= h Φ

0

| F ˆ | Φ

0

i =

occ

X

a

εa

The first-order energy correction is the expectation value of the HF determinant over the perturbation operator

E

₀⁽¹⁾

= h Φ

0

| V ˆ | Φ

0

i

Therefore, the sum of the zeroth-order energy and first-order energy correction just gives back the HF energy

E

₀⁽⁰⁾

+ E

₀⁽¹⁾

= h Φ

0

| F ˆ + ˆ V | Φ

0

i = E

HF

The second-order energy correction, also called the second-order Møller-Plesset (MP2) correlation energy, is

E

₀⁽²⁾

= E

_c^MP2

= −

X

n6=0

|h Φ

0

| V ˆ | Φ

n

i|

²

E

n⁽⁰⁾

− E

₀⁽⁰⁾

where Φ

n

can be a priori single, double, triple, ... excited determinants.

25/58

(26)

Møller-Plesset perturbation theory (3/5)

In fact, since ˆ V is a two-body operator, according to Slater’s rules, triple and higher excitations with respect to Φ

0

give vanishing matrix elements h Φ

0

| V ˆ | Φ

n

i .

In addition, it turns out that single excitations only give a vanishing contribution h Φ

0

| V ˆ | Φ

^ra

i =

occ

X

b

h ab || rb i − h a | v ˆ

HF

| r i =

occ

X

b

h ab || rb i −

occ

X

b

h ab || rb i = 0

It thus remains only the double excitations, Φ

n

= Φ

^rs_ab

. Only the two-body part of the perturbation operator gives a non-zero matrix element

h Φ

0

| V ˆ | Φ

^rsab

i = h ab || rs i

and the zeroth-order energy corresponding to the doubly-excited determinants Φ

^rs_ab

is E

n⁽⁰⁾

= E

_ab^rs,(0)

= E

₀⁽⁰⁾

+

εr

+

εs

−

εa

−

εb

We arrive at the following expression for the MP2 correlation energy E

_c^MP2

= −

occ

X

a<b vir

X

r<s

|h ab || rs i|

² εr

+

εs

−

εa

−

εb

= − 1 4

occ

X

a,b vir

X

r,s

|h ab || rs i|

² εr

+

εs

−

εa

−

εb

where we have used the antisymmetry property of the integrals, i.e.

h ab || rs i = −h ab || sr i = −h ba || rs i .

26/58

(27)

Møller-Plesset perturbation theory (4/5)

The MP2 correlation energy is always negative.

It diverges to −∞ if one energy denominator

εr

+

εs

−

εa

−

εb

is zero. This happens for systems with zero HF HOMO-LUMO gap, in which case MP perturbation theory cannot be applied.

The MP2 energy is

E

MP2

= E

HF

+ E

_c^MP2

Since it is not a variational theory, E

_MP2

is not necessarily above E

_FCI

.

Similarly, it can be shown, after much work, that the third-order Møller-Plesset (MP3) energy correction has the following expression

E

₀⁽³⁾

= 1 8

occ

X

a,b,c,d vir

X

r,s

h ab || rs ih rs || cd ih cd || ab i (ε

r

+

εs

−

εa

−

εb

)(ε

r

+

εs

−

εc

−

εd

)

+ 1 8

occ

X

a,b vir

X

r,s,t,u

h ab || rs ih rs || tu ih tu || ab i (ε

r

+

εs

−

εa

−

εb

)(ε

t

+

εu

−

εa

−

εb

)

+

occ

X

a,b,c vir

X

r,s,t

h ab || rs ih cs || tb ih rt || ac i (ε

r

+

εs

−

εa

−

εb

)(ε

r

+

εt

−

εa

−

εc

)

The calculation of the third- or higher-order terms is often considered as not worthwhile

in comparison with coupled-cluster methods for example.

_27/58

(28)

Møller-Plesset perturbation theory (5/5)

The MP2 correlation energy can be rewritten as the sum of a direct and exchange term

E

_c^MP2

= − 1 2

occ

X

a,b vir

X

r,s

h ab | rs ih rs | ab i

εr

+

εs

−

εa

−

εb

+ 1 2

occ

X

a,b vir

X

r,s

h ab | rs ih rs | ba i

εr

+

εs

−

εa

−

εb

These terms can be represented by Feynman diagrams (also called Goldstone diagrams in quantum chemistry)

E

_c^MP2

= a r s b + a s r

b

When starting from an unrestricted HF calculation, MP perturbation theory is correctly size consistent at each order. In terms of diagrams, this comes from the fact that each diagram contributing to the perturbative expansion of the energy is made of a single linked piece. Unlinked diagrams do not contribute. This is known as the linked-cluster theorem.

28/58

(29)

Outline

1

Overview of wave-function methods

2

Perturbation theory

4

Appendix

29/58

(30)

The exponential ansatz (1/5)

In coupled-cluster (CC) theory, one starts by an exponential ansatz for the CC wave function

| Ψ

CC

i = e

^T^ˆ

| Φ

0

i

where | Φ

0

i is the HF wave function, and ˆ T is the cluster (excitation) operator which is the sum of cluster operators of different excitation levels

T ˆ = ˆ T

₁

+ ˆ T

₂

+ · · · + ˆ T

N

T ˆ

₁

is the cluster operator for the single excitations T ˆ

1

=

occ

X

a vir

X

r

t

a^r

ˆ a

^†r

ˆ a

a

where t

^r_a

are the single-excitation cluster amplitudes to be determined.

We have used the second-quantization formalism:

ˆaais theannihilation operatorof the spin orbitala ˆa^†r is thecreation operatorof the spin orbitalr

When the operator â^†râaacts on the HF single determinant|Φ0i, it generates the single-excited determinant|Φ^r_ai= â^†_râa|Φ0i

30/58

(31)

The exponential ansatz (2/5)

Similarly, ˆ T

2

is the cluster operator for the double excitations T ˆ

2

=

occ

X

a<b vir

X

r<s

t

ab^rs

ˆ a

^†r

ˆ a

^†s

ˆ a

b

ˆ a

a

where t

^rs_ab

are the double-excitation cluster amplitudes to be determined.

When the operator ˆ a

^†r

ˆ a

^†s

ˆ a

b

ˆ a

a

acts on the HF single determinant | Φ

0

i , it generates the double-excited determinant | Φ

^rsab

i = ˆ a

^†_r

ˆ a

^†_s

ˆ a

b

ˆ a

a

| Φ

0

i .

And so on up to the ˆ T

N

cluster operator for N-fold excitations.

31/58

(32)

The exponential ansatz (3/5)

To understand the action of the operator e

^T^ˆ

on the HF wave function | Φ

0

i , one can expand the exponential and rearrange the operators in terms of excitation levels

e

^T^ˆ

= ˆ 1 + ˆ T + T ˆ

²

2! + T ˆ

³

3! + · · · + T ˆ

^N

N! = ˆ 1 + ˆ C

₁

+ ˆ C

₂

+ · · · + ˆ C

N

where the operator ˆ C

₁

generates single excitations, ˆ C

₂

generates double excitations, etc.

Noting that the cluster operators ˆ T

₁

, ˆ T

₂

, ..., ˆ T

N

commute with each other, we find for example for the first four excitation operators

C ˆ

1

= ˆ T

1

C ˆ

₂

= ˆ T

₂

+ 1 2 T ˆ

₁²

C ˆ

3

= ˆ T

3

+ ˆ T

1

T ˆ

2

+ 1

6 T ˆ

₁³

C ˆ

4

= ˆ T

4

+ ˆ T

1

T ˆ

3

+ 1

2 T ˆ

₂²

+ 1

2 T ˆ

₁²

T ˆ

2

+ 1 24 T ˆ

₁⁴

and so on.

32/58

(33)

The exponential ansatz (4/5)

The CC wave function has thus the same form as the FCI wave function (with intermediate normalization, i.e. c

0

= 1)

| Ψ

CC

i = | Φ

0

i +

occ

X

a vir

X

r

c

a^r

| Φ

^ra

i +

occ

X

a<b vir

X

r<s

c

ab^rs

| Φ

^rsab

i

+

occ

X

a<b<c vir

X

r<s<t

c

_abc^rst

| Φ

^rstabc

i +

occ

X

a<b<c<d vir

X

r<s<t<u

c

_abcd^rstu

| Φ

^rstuabcd

i + · · ·

with coefficients related to the cluster amplitudes by c

a^r

= t

a^r

c

_ab^rs

= t

_ab^rs

+ t

^r_a

∗ t

_b^s

c

abc^rst

= t

abc^rst

+ t

a^r

∗ t

bc^st

+ t

a^r

∗ t

b^s

∗ t

c^t

c

_abcd^rstu

= t

_abcd^rstu

+ t

_a^r

∗ t

_bcd^stu

+ t

_ab^rs

∗ t

_cd^tu

+ t

_a^r

∗ t

_b^s

∗ t

_cd^tu

+ t

_a^r

∗ t

_b^s

∗ t

_c^t

∗ t

_d^u

and so on.

Here, ∗ means an antisymmetric product with respect to exchange of indices:

t

_a^r

∗ t

_b^s

= t

_a^r

t

_b^s

− t

_b^r

t

_a^s

t

a^r

∗ t

bc^st

= t

^ra

t

bc^st

− t

b^r

t

ac^st

+ t

c^r

t

ab^st

− t

a^s

t

bc^rt

+ t

b^s

t

ac^rt

− t

c^s

t

ab^rt

+ t

a^t

t

bc^rs

− t

^tb

t

ac^rs

+ t

c^t

t

ab^rs

t

_a^r

∗ t

_b^s

∗ t

_c^t

= t

_a^r

t

_b^s

t

_c^t

− t

_a^r

t

_c^s

t

_b^t

− t

^r_b

t

_a^s

t

_c^t

− t

_c^r

t

_b^s

t

_a^t

+ t

_c^r

t

_a^s

t

_b^t

+ t

_b^r

t

_c^s

t

_a^t

and so on.

33/58

(34)

The exponential ansatz (5/5)

Thus, the CC wave function contains all excited determinants, just as the FCI wave function.

If the cluster operator ˆ T is not truncated, the CC wave function is just a nonlinear reparametrization of the FCI wave function with the same number of parameters.

Optimizing the cluster amplitudes t = (t

a^r,

t

_ab^rs,

t

_abc^rst, ...) so as to minimize the total energy

would lead to the FCI wave function.

The interest of the CC approach only appears when the cluster operator is truncated.

34/58

(35)

Outline

1

Overview of wave-function methods

2

Perturbation theory

4

Appendix

35/58

(36)

Truncation of the cluster operator

For example, let us consider coupled-cluster singles doubles (CCSD) T ˆ = ˆ T

1

+ ˆ T

2

The expansion of e

^T^ˆ

gives e

^T^ˆ

= ˆ 1 + ˆ T

1

+

T ˆ

2

+ 1

2 T ˆ

₁²

+

T ˆ

1

T ˆ

2

+ 1 6 T ˆ

₁³

+

1 2 T ˆ

₂²

+ 1

2 T ˆ

₁²

T ˆ

2

+ 1 24 T ˆ

₁⁴

+ · · ·

The CCSD wave function has the same form as the FCI wave function, i.e. it contains all excited determinants, but with coefficients now given by

c

a^r

= t

a^r

c

_ab^rs

= t

_ab^rs

+ t

_a^r

∗ t

_b^s

c

_abc^rst

= t

_a^r

∗ t

_bc^st

+ t

_a^r

∗ t

_b^s

∗ t

_c^t

c

_abcd^rstu

= t

_ab^rs

∗ t

_cd^tu

+ t

_a^r

∗ t

_b^s

∗ t

_cd^tu

+ t

_a^r

∗ t

_b^s

∗ t

_c^t

∗ t

_d^u

and so on.

The coefficients of the triple excitations c

_abc^rst

are now only approximated by products of single- and double-excitation amplitudes t

a^r

and t

_bc^st

, and similarly for all higher-level excitations.

In comparison with the CISD wave function, the CCSD wave function has many more excited determinants but the same number of free parameters t = (t

a^r,

t

_ab^rs

) to optimize!

36/58

(37)

Truncation of the cluster operator: size-consistency

Contrary to truncated CI, truncated CC is size-consistent, which directly stems from the exponential form of the CC wave function.

Proof: consider a system composed of two infinitely separated (and thus non-interacting) fragments A and B :

H ˆ = ˆ H

A

+ ˆ H

B

If the HF calculation is size-consistent, the HF wave function is multiplicatively separable:

|Φ^A···B₀ i=|Φ^A₀i ⊗ |Φ^B₀i(where⊗is the antisymmetric tensor product)

Because the orbitals of each fragments do not overlap, the cluster operator of the system is additively separable: ˆT^A···B= ˆT^A+ ˆT^B

We can then show that theCC wave function is multiplicatively separable:

|ΨÂ···B_CC i=e^T^ˆÂ···B|ΦÂ···B₀ i

=e^T^ˆ^A^{+ ˆ}^T^B|Φ^A₀i ⊗ |Φ^B₀i

=e^T^ˆ^A|Φ^A₀i ⊗e^T^ˆ^B|Φ^B₀i

=|Ψ^A_CCi ⊗ |Ψ^B_CCi

This implies that theCC energy is additively separable:

E_CC(A· · ·B) =hΨÂ···B_CC |HÂ+ ˆHB|ΨÂ···B_CC i

=ECC(A) +E_CC(B)

37/58

(38)

Outline

1

Overview of wave-function methods

2

Perturbation theory

4

Appendix

38/58

(39)

The coupled-cluster energy and the coupled-cluster equations (1/4)

We consider in all the following an arbitrary truncation level of the cluster operator.

The most natural way to calculate the CC amplitudes t and CC energy would be by using the variational method:

E

_CC^var

= min

t

h Ψ

CC

| H ˆ | Ψ

CC

i with the constraint h Ψ

CC

| Ψ

CC

i = 1.

However, the CC wave function includes all excited determinants up to N-fold excitations which contribute to the expectation value, giving too complex equations to be efficiently solved.

A more convenient approach for obtaining the CC amplitudes and CC energy is the projection method: we require that Ψ

CC

satisfies the Schr¨odinger equation

( ˆ H − E

CC

) | Ψ

CC

i = 0

projected onto the space spanned by h Φ

0

| , h Φ

^ra

| , h Φ

^rs_ab

| , h Φ

^rst_abc

| ,...

h Φ

0

| ( ˆ H − E

CC

) | Ψ

CC

i = 0 h Φ

^ra

| ( ˆ H − E

CC

) | Ψ

CC

i = 0 h Φ

^rsab

| ( ˆ H − E

_CC

) | Ψ

CC

i = 0

h Φ

^rstabc

| ( ˆ H − E

CC

) | Ψ

CC

i = 0 and so on.

39/58