HAL Id: jpa-00208221
https://hal.archives-ouvertes.fr/jpa-00208221
Submitted on 1 Jan 1974
HAL
is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire
HAL, estdestinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Spectral limits in the tight-binding approximation
F. Ducastelle
To cite this version:
F. Ducastelle. Spectral limits in the tight-binding approximation. Journal de Physique, 1974, 35 (12),
pp.983-988. �10.1051/jphys:019740035012098300�. �jpa-00208221�
SPECTRAL LIMITS IN THE TIGHT-BINDING APPROXIMATION
F. DUCASTELLE
Office National d’Etudes et de Recherches
Aérospatiales (ONERA)
92320
Châtillon,
France(Reçu
le 15 mai1974)
Résumé. 2014 On passe en revue les méthodes et théorèmes
principaux
qui permettent d’estimer les limites spectrales pour des systèmes décrits dansl’approximation
des liaisons fortes. On donnequelques applications concernant les métaux de transition, les alliages désordonnés et les semi- conducteurs amorphes.
Abstract. 2014 A brief review is given of some basic theorems and methods which
give
estimates of the spectral limits in systems described within thetight-binding approximation. Applications
totransition metals, disordered alloys and amorphous semiconductors are given.
Classification Physics Abstracts
8.120
1. Introduction. --- The
tight-binding approxima-
tion is
widely
used to describe systemsinvolving
electrons which are more or less localized on the atoms. In
spite
of theapproximations
made in thesimplest
schemes(the assumptions
that the atomic orbitals areorthogonal
and that the transferintegrals
are
only important
between firstneighbours),
itoften leads to very convenient model Hamiltonians.
In
particular
this kind of model has been used often in recent years to describe disorderedsystems
suchas disordered
alloys, liquids
oramorphous
semi-conductors.
It is well known that within the
tight-binding approximation
the bandwidth isfinite,
and that thiswidth is
mainly
determinedby
the value of the transferintegrals.
Whendealing
with disordered systems, the Bloch theorem cannot beused,
and it is ingeneral impossible
to calculateexactly
thedensity
of states ; it is therefore very useful todispose
of some theoremswhich
give
theregions
of the real axis where theeigenvalues
of agiven Hamiltonian
must lie. As shownby Cyrot-Lackmann [1]
andby
Khomskii[2],
two useful theorems in this context are the Perron- Frobenius theorem and the
Hadamard-Gerschgorin theorem,
but some other methods can also be used.In this
article,
we want togive
a brief review ofthe basic theorems or methods which
gives
estimatesof the
spectral
limits and which do notdepend
onthe lattice
periodicity.
Someportions
of this workare not
actually
new, but we havethought
that itmight
be useful to show that the various results obtainedrecently
in this field arestraightforward
consequences of a very limited number of
general
theorems.
In section
2,
we recall ’the Perron-Frobenius theorem. In section3,
we recall the Hadamard-Gerschgorin
theorem and wegive
an extension of it well fitted to thestudy
ofdegenerate
d bands,Section 4 is concemed with a method referred to as the scalar
product
method and section 5 is devoted to the min-max theorem of Courant and Fischer.In these two last
sections, particular
attention ispaid
to theapplications
to disorderedalloys
andamorphous
semiconductors.2. The Perron-Frobenius theorem. -- The Perron- Frobenius theorem states that if all the elements
aij
of a matrix arepositive,
then theeigenvalue Àmax
whose absolute value is maximum is
real, positive
and non
degenerate.
Moreover thecomponents
of thecorresponding eigenvector
may be taken to bepositive
andÀmax
may be characterized as follows[3], [4] :
:where x varies over all non
negative
vectors differentfrom zero. In
particular
from(2. 1)
we obtain :As shown
by Cyrot-Lackmann [1],
this theoremapplies
in aquite
natural way tonon-degenerate tight-binding
systems in which all the transfer inte-.grals
arenegative.
Itgives
the well known result that the lowest energy in the spectrum is obtained forArticle published online by EDP Sciences and available at http://dx.doi.org/10.1051/jphys:019740035012098300
984
a linear combination of atomic orbitals with
positive
coefficients
(bonding state) ;
furthermore it tells us that the absolute value of the lower bound islarger
than
(or equal to)
that of the upper bound. Forexample
let us consider aperiodic
lattice in which every atom has Z firstneighbours;
from(2.2)
wehave :
where
fl
is the transferintegral
between firstneighbours.
3. The
Gerschgorin
theorem and itsgeneralization.
- 3.1 THE GERSCHGORIN THEOREM. - The Gersch-
gorin
theorem isequivalent
to the Hadamard criterion whichgives
a condition for adeterminant
to be diffe- rent from zero. Theproof
is sosimple
that it is worthgiving
itexplicitly [5], [6]. Let t/1
be aneigenvector
ofthe matrix aij
(with
componentst/1i)
and  the corres-ponding eigenvalue ;
then :and therefore :
Let us chose i such that
1 t/1i
=max 1 j 1,
it followsj
that :
Since the
inequality (3.3)
is satisfied for at least onevalue
of i,
we may assert that is contained in the union of the discs centred on aii and with radiiFurthermore, by using continuity
arguments, it can be shown that if the union of p discs is a connected part of the totalunion,
then it containsexactly
peigenvalues.
In the case where anoff-diagonal
matrixelement aik is
larger
in modulus than the sum of the moduli of the other ones, we have also[5] :
The
Gerschgorin
criteriongeneralizes
a small part of the Perron theorem.,From(3.3)
we indeed deduce :which is to be
compared
to(2.2).
Let us notice that eq.
(3.6)
may be considered as aparticular
case of a moregeneral result, namely :
where Il A II denotes
the norm of the matrix A whichis subordinate to some vector norm. Such a norm is defined
by :
Let us also define the p-norm of a vector x :
with the
particular
cases :The
corresponding
subordinate norms are[6] :
Il A Il 2
is of course the usual norm and isindependent
of the
basis,
which is not the case of the other ones., A subordinate norm has the
particular properties :
Let us now write
that §
is aneigenvector
of A witheigenvalue :
By taking
the norm of both sidesof (3.13),
we obtain :which proves
(3.7).
Useful
applications
of theGerschgorin
theoremhave been
given by Cyrot-Lackmann [1] and
Khom-skii
[2].
We shall now show that it ispossible
toextend this criterion.
3.2 A GENERALIZATION OF THE GERSCHGORIN THEO- REM. - We shall try to
generalize
theGerschgorin
theorem in
allowing
the matrix elements aij to be square matrices. We have in mind adegenerate tight- binding
system inwhich q
atomic states are associatedwith each site. The matrix of the Hamiltonian in the basis
generated by
these states is a block matrix in which each block is a q x q matrix. LetPi
be theprojection
operator which separates out all orbitalson
site i,
and Âand Çi
aneigenvalue
of the Hamilto- nian H and itscorresponding eigenvector ;
then :Using
thesimplified
notationHij for Pi HPj,
andtaking
the norm, we have :where :
Now we take into account the
specific properties
ofour
model ;
that H is an hermitian operator and wechoose the usual norm in the Hilbert space defined from the scalar
product. Let pf
be theeigenvalues
of
Hih
it is clear that :and therefore :
We may now argue as in section
3.1 ;
we chose i insuch a way
that Il t/1i Il
=max II ; II,
j whichimplies
that :
Finally
we obtain :which is the
required generalization
of(3.4).
Let us
apply
this criterion to the case of a pure transition metal with coordination number Z. Wekeep only
transferintegrals
between firstneighbours separated by
a distance R. Then all blocksHij
are thesame,
H;;
=H(R),
and we have :Il H(R) Il
isgiven by
thelargest
transferintegral (in modulus),
that isby 1 ddu 1
in Slater-Koster’s nota- tions[7] ;
hence :This result is the natural extension of the corres-
ponding inequality
obtained for nondegenerate
bandsin eq.
(2.3).
The main difference is that because the various transferintegrals
included inHij
have differentsigns,
we are nolonger
sure that the boundZ [ ddJ 1
is
actually
attained. Infact,
in three dimensions thetrue bound is much smaller
(for example
in a FCClattice, 1 À Imax ~ 5 ) 1 ddu with
realistic values of the transferintegrals [8]).
The one dimensional case is aparticular
one because the orbitals of different sym- metries do notmix;
as a consequence the Perron theorem stillapplies
for each sub-band and the bound2 1 dda is
attained. This indicates that better bounds cannot be obtained ingeneral
without further information on the system. From(3.20)
we can alsoderive a criterion for the occurrence of a
crystal
fieldsplitting
of the bands. Let us assume thatHu
has twodifferent
eigenvalues (which
wouldcorrespond
to thet2g and eg symmetries
for d electrons in a cubic envi-ronment),
oc, and (X2, then a gap willnecessarily
appear
if 1 oc, - (X2 1 > 2 Z 1 dda [ .
Finally
it should bepointed
out that(3.21)
is still aconsequence of the
general
result(3.7) provided
wedefine a new
nonn Il H Il’ :
:which is the norm subordinate to the
following
vecto-rial norm :
4. The scalar
product
method. -- 4. 1 FORMULATIONOF THE METHOD. - When
dealing
withphysical
sys-tems, we know that the Hamiltonian H is hermitian and therefore the
spectral
radius(that
is theeigenvalue
whose absolute value is
maximum)
isequal
to theusual norm defined
by (3.llb).
Moregenerally
wemay use the
properties
of the Hilbert space. Inparti- cular,
considerations based on scalarproducts
may be fruitful asexplained
below. Forexample,
let usassume that H can be written as :
where
Ho
and V are hermitianoperators
whosespectra
are known. Let
hi and
be theeigenvalues
ofHo and V, and 1 t/J
> aneigenstate
of H associated with theeigenvalue
À. From :we obtain :
where we have assumed
that Il t/J Il
=t/J 1 > 1/2
,-- 1.But :
and
finally :
In the same way :
986
Equations (4.5)
and(4.6)
areequivalent
to the moreexplicit propositions :
with :
4.2 APPLICATION TO AMORPHOUS SEMICONDUCTORS.
- As an
application
of the criterion(4.7),
let usconsider the now well-known model hamiltonian for
amorphous
semiconductors usedby Thorpe,
Weaire[9]
and others. We shall use here the notations of Friedel and Lannoo
[10] :
. The
state 1 iJ >
is the atomichybridized sp"
orbitalpointing
from site ialong
the bond J. Thepositive promotion
energyEp - ES
isequal
to -(n
+1)
dand fl
is thenegative
transferintegral.
The
spectra of Ho
and V are verysimple.
Theeigen-
values
hi and vi
aregiven by :
Therefore
min 1 vi 1 = max 1 vi 1 = 1 P 1,
and the inter-i i
vals
Di and di
have the same bounds. As a conse-quence we realize at once that there is
always
a gap in thespectrum.
This is clear if the intervalsDl
andD2
do notoverlap (that is d 1/1 PI> 2/(n
+1)),
butfrom
(4.7b)
this is also true in the other case because the interior of the intersection of the intervals is forbidden. Fromcontinuity arguments
it is then rather obvious to count the number of states in each subband. In the atomiclimit P
--.0,
the subbands have s and p character with theweights
1 and nrespectively ;
in the other limit 4 -0,
each subbandcontains (n
+1)/2
states.Actually,
the Hamiltonian(4.9)
is sosimple
that amuch more detailed
description
of thespectrum
can be obtained rathereasily (see
references[9], [10], [11]
for
example),
but theadvantage
of our criterion is that it allowsgeneralizations
to more realisticmodels ;
for
example
we could take a distribution of transferintegrals ;
it is clear that in this case the gap will survive in the covalent limit 4 - 0provided
thatmin 1 Vi 1 =1=
0.i
Finally
let us put someemphasis
on theduality
of the
descriptions
from atomic orbitals and bond orbitals in thisproblem.
In(4.9)
the basis of atomic orbitals waschosen,
but wemight
have taken abasis of bond orbitals from which the
eigenstates
of Vare
easily
built up. This isequivalent
toexchange
the partsplayed by Ho
and V. Thisduality
isquite
clearif we rewrite H in the
following
canonical form :Now V and
Jeo
have the sameproperty :
and we can still
apply
the criterion(4.7)
whereH0 plays
the part of V andconversely :
,with :
and
d’1,2
are thecorresponding
open intervals. Ofcourse from
(4.14)
we find the same gap as before.4.3 APPLICATION ’TO DISORDERED ALLOYS. - We shall now consider the usual model hamiltonian for disordered
alloys :
This is a
simple tight-binding
Hamiltonian in which the disorder is introducedthrough
a statistical distri- bution of the atomic levels El. As we may notice from the discussion of section4.2,
the best bounds arefound when the
spectral
limits of V aresymmetric.
We assume that this is the case here. If this was not true, it would be sufficient to substract
from
Ho
and to add the samequantity
to V. Forsimplicity,
we also assume that there is no gap in the pure system describedby V,
so thatmin 1 vi 1
= 0.i
Therefore we are left with the
single
criterion(4. 7a) :
which is the well-known theorem about spectra of disordered
alloys ;
the spectrum of analloy
is contained in the union of the spectra of thealloy
components.4.4 DISCUSSION. - The
proofs given
in section 4. 2and 4.3 are not
actually
very different from those usedby
other authors. Forexample,
Schwartz and Ehrenreich[12] (for
the case ofamorphous
semi-conductors)
andKirkpatrick, Velickÿ
and Ehren- reich[13] (for
the case of disorderedalloys)
havegiven
similarproofs by using
the convergence proper- ties of seriesexpansions
of the resolvent[14].
Bothmethods are
equivalent
but the present one isperhaps simpler.
It should also be
pointed
out that from theprevious
discussion
nothing
can be said about thepossibility
of the bounds to be
actually
attained. Further infor- mation canonly
be obtained from a detailedanalysis
of the
problem
understudy.
Forexample
it can beshown that the bounds in
alloys
are attained for aperfectly
disorderedalloy [15], [16].
Theassumption
of
perfect
disorder is crucial. This is clear whenlooking
at thefollowing example.
Let us consider anAB
alloy
in which all A atoms have Z firstneighbours
of B
type
andconversely,
as in a CICstype
structure forexample.
The energy levels BAand EB
are measuredfrom the centre of the
symmetrical
spectrum :Let
PA(B)
be theprojection operator
whichseparates
out the orbitals on
A(B)
sites. The Hamiltonian Hcan then be written as
(1) :
If we notice that :
then :
so that the
eigenvalues
of H aregiven by :
We find therefore that there is
always
a ap and that the extreme bounds aregiven by
±ô2 + II Y II2
which are smaller in modulus than the
corresponding
bounds of the disordered
system.
±[ô
+il V Il].
5. The min-max theorem. -- The min-max theorem of Courant and Fischer is a very
elegant
theoremwhich
gives
a characterization of eacheigenvalue
ofan hermitian
operator
A[18], [3].
If theseeigenvalues
are labelled in
decreasing
order :then Âk
isgiven by :
where
the y’
arelinearly independent
vectors.(1) This formulation has been already used by Straley [17] in
the case of compound semiconductors.
If the
increasing order À 1
...ÀN
is chosen the theorem becomes a max-min theorem. Animportant
consequence of this theorem is that if we consider the
projection PiAPi = Ai
of the operator A into asubspace
of dimensionN - i,
then :Another consequence is that if a
non-negative
ope- rator B is added toA,
thenThis theorem has
already
been usedby
Thouless[16]
to
give
ageneral proof
of the theorem on bounds in disorderedalloys
mentioned in section 4.3. The basic idea is to notice that we can find aprojection
operator which
gives
the sameoperator
whenapplied
either to
the
disorderedalloy hamiltonian,
or to thepure metal one.
(The operators PA
andPB
defined insection 4.4 fulfil this
condition.)
We willapply
herethe same method to the
amorphous
semiconductorproblem.
Let
Ps
andPp
be theprojection operators
whichseparate
out the s and p orbitals on each site. The Hamiltonian(4.9)
can then be written as :Let us define
H2
andH3 by :
The
eigenvalues
ofHi, H2
andH3
areÂp,
lip and vprespectively.
Since nN is the dimension of the spacegenerated by the p
atomic states(N
is the number of atoms, and the dimension of the whole Hilbert space is(n
+1) N),
we obtain froID(5.2) :
But since L1
0, H2 - Hi
is nonnegative,
and(5.3)
leads to :
Finally
from(5.7)
and(5.8),
we get :The
eigenvalues
ofH2
= V are :988
and therefore
(5.9) gives :
This result is remarkable because it shows that there
are two ô functions in the spectrum
of Hl
withweight
n 2 corresponding to the énergies that
is to
- L1
± p
for the spectrum of H. For these statesÀp =’ Jlp
= vp which proves that these states arepurep states. Of course this result is not new,
[9], [10],
but the present
proof
is a clear illustration of thepossibilities
of the min-max theorem. Furthermore there is nodifficulty
inapplying
if to more elaboratemodels.
Let us now
study
the 2N remaining
states.Eq. (5 .11) gives
someinformation,
but it can becompleted
ifwe repeat the
previous
argumentsby using
the pro-jection
operator into the s-space. We therefore defineH2
andH3 by :
The
comparison
of theeigenvalues Àp, /Zp
andvp
ofHl, H2
andH3 gives :
hence :
Let us
forget
the à functions and relabel the 2 Nremaining
states. From(5 .11 )
and(5.15),
we get :where :
Therefore we have
always
a gap whose width is12 1 fl 1 - (n + 1) Id 11 .
.It is rather obvious that we
might repeat
the sameproofs by starting
from a bond orbitalrepresentation.
It is
easily
checked that the same result would be obtained once more.Although
morecomplete,
theargument would be
actually
similar to thatgiven by
Heine
[19].
We
might give
some otherapplications
of themin-max theorem. For
example,
if we consider the Slater-Kosterproblem
of asingle
localizedpotential,
eq.
(5.2) gives directly
the familiar result that theeigenvalues
of the pure systemseparate
theeigenvalues
of the
perturbed
system,except
for oneeigenvalue
which may
correspond
to a localized state.As a conclusion of this
section,
let uspoint
out thatthe min-max theorem enables us to recover all the results obtained in section 4. The min-max theorem is a little more subtle to
apply
than the scalarproduct method,
but itprovides
us with results which are moregeneral
andcomplete
as can be seen from theprevious example (see
also Thouless[16]
for the caseof disordered
alloys).
6. Conclusion. -- The main conclusion which can be drawn from the
present study
isthat,
within atight-binding approximation,
we can in fact obtain asubstantial information on the
spectrum
without any reference toperiodicity.
It can also be said that the theorems we used areparticularly
well suited to thetight-binding approximation
because in this caseadvantage
is taken of arepresentation
in real space.These theorems have allowed us to recover some well known
results,
but due to theirgenerality they
canlead to
straightforward
extensions as shown in some cases. There areprobably
many otherapplications
to be
explored,
and it ishoped
that this work hassome use in
stimulating
further researches in this field.Références [1] CYROT-LACKMANN, F., J. Phys. C. Solid State Phys. 5 (1972)
300.
[2] KHOMSKII, D. I., Sov. Phys. Solid State 8 (1966) 1264. (Fizika Tverdogo Tela 8 (1966) 1592.)
[3] BELLMANN, T., Introduction to matrix analysis (Mc Graw-Hill, New York) 1960.
[4] GANTMACHER, F. R., Théorie des matrices (Tome 2) (Dunod, Paris) 1966.
[5] PARODI, M., La localisation des valeurs caractéristiques des
matrices et ses applications (Gauthier-Villars, Paris) 1959.
[6] WILKINSON, J. H., The algebraic eigenvalue problem (Claren-
don Press, Oxford) 1965.
[7] SLATER, J. C. and KOSTER, G., Phys. Rev. 94 (1954) 1498.
[8] DUCASTELLE, F., Thèse Orsay, 1972, published in Publication ONERA n° 144.
[9] THORPE, M. F. and WEAIRE, D., Phys. Rev. B4 (1971) 2508.
Phys. Rev. B4 (1971) 3518.
[10] FRIEDEL, J. and LANNOO, M., J. Physique 34 (1973) 115.
[11] JOHN, W., Phys. Stat. Sol. (b) 55 (1973) K9.
[12] SCHWARTZ, L. and EHRENREICH, H., Phys. Rev. B6 (1972) 4088.
[13] KIRKPATRICK, S., VELICKY, B. and EHRENREICH, H. Phys. Rev.
B1 (1970) 3250.
[14] KATO, T., Perturbation theory for linear operator (Springer Verlag, Berlin) 1966.
[15] LIFSHITZ, I. M., Sov. Phys. Usp. 7 (1964) 549.
[16] THOULESS, D. J., J. Phys. C. Solid State Phys. 3 (1970) 1559.
[17] STRALEY, J. P., Phys. Rev. B6 (1972) 4086.
[18] COURANT, R., and HILBERT, D., Methods of Mathematical
Physics Vol. I (Interscience Pub., New York) 1953.
[19] HEINE, V., J. Phys. C. Solid State Phys. 4 (1971) L221.