A NNALES DE L ’I. H. P., SECTION B
O LIVIER C ATONI
Applications of sharp large deviations estimates to optimal cooling schedules
Annales de l’I. H. P., section B, tome 27, n
o4 (1991), p. 463-518
<http://www.numdam.org/item?id=AIHPB_1991__27_4_463_0>
© Gauthier-Villars, 1991, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
Applications of sharp large deviations estimates to optimal cooling schedules
Olivier CATONI
D.I.A.M., Intelligence Artificielle et Mathematiques,
Laboratoire de Mathematiques, U.A. 762 du C.N.R.S., Departement de Mathematiques et d’lnformatique, Ecole Normale Superieure, 45, rue d’Ulm, 75005 Paris, France
Vol. 27, n° 4, 1991, p. 463-518. Probabilités et Statistiques
ABSTRACT. - This is the second
part
of"Sharp large
deviations estimates for simulatedannealing algorithms".
Wegive applications
of our estimatesfocused on the
problem
ofquasi-equilibrium.
Isquasi-equilibrium
maintai-ned when the
probability
ofbeing
in theground
state at some finitelarge
time is maximized? The answer is
negative.
Nevertheless thedensity
of thelaw of the
system
withrespect
to theequilibrium
lawstays
bounded in the "initialpart"
of thealgorithm.
Thecorresponding shape
of theoptimal cooling
schedules is where d isHajek’s
criticaldepth.
The influence of variations of B on the law of thesystem
and itsdensity
withrespect
to thermalequilibrium
is studied: if B is above somecritical value the
density
has anexponential growth
and the rate ofconvergence can be made
arbitrarily
poorby increasing
B. All thesecomputations
are made in thenon-degenerate
case when there isonly
oneground
state andHajek’s
criticaldepth
is reachedonly
once.Key words : Simulated annealing, large deviations, non-stationary Markov chains, optimal cooling schedules.
RESUME. - Cet article constitue le deuxième volet de notre étude des
« Estimees
précises
degrandes
deviations pour lesalgorithmes
de recuitsimulé ». Nous donnons ici des
applications
de ces estiméesayant
trait àClassification A.M.S. : 60 F 10, 60 J 10, 93 E 25.
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques - 0246-0203
464 O.CATONI
la
question
del’équilibre thermique.
Unquasi-équilibre thermique
s’établit-il
lorsque
la suite destemperatures
est choisie defaçon
à maximiser laprobabilité
pour que lesystème
soit dans 1’etat fondamental a un instant lointain fixé ? Laréponse
estnegative.
Néanmoins la densité de la loi dusysteme
parrapport
a la loid’équilibre
reste bornée dans lapartie
initialede
l’algorithme.
La formecorrespondante
des suitesoptimales
detempera-
tures a un
développement
de la forme1 /Tn = d - i ln n + B + o ( 1 )
où d estla
profondeur critique
deHajek.
Nous étudions l’influence des variations de B sur la loi dusystème
et sur sa densité parrapport
a la loid’équilibre :
si B
dépasse
une certaine valeurcritique,
la densité a une croissanceexponentielle
avec letemps
et la vitesse de convergencepeut
etre rendue arbitrairement lente enaugmentant
B. Tous ces calculs sont menes dans le cas nondégénéré
où il existe ununique
etat fondamental et où laprofondeur
deHajek
n’est pas atteinteplus
d’une fois.CONTENTS
Introduction... , ... 464
1. Estimation of the probability of the critical cycle... 465
1.1. A lower bound for any cooling schedule ... 466
1.2. The critical feedback cooling schedule... 476
1 . 3. Other feedback relations ... 482
1.4. Stability under small perturbations ol’ the cooling schedule ... 485
2. Asymptotics of the of the system ... 493
3. Triangular cooling schedules ... 500
3.1. A rough upper bound for the convergence rate ... 500
3 . 2. Description of the optimization problem ... 500
4. The optimization problem far from the horizon ... 502
Conclusion ... 517
. INTRODUCTION
The
study
ofcooling
schedules of thetype
is thesubject
ofmost of the literature about
annealing (D.
Geman and S.Geman[9], Holley
andStroock[13], Hwang
andSheu [14], Chiang
and Chow[6]).
Weinvestigate
here for the first time the criticalcase ~ =1 /d
where d isHajek’s
criticaldepth.
We show that it is natural to introduce a secondconstant and that
quasi-equilibrium
is never maintainedAnnales de l’lnstitut Henri Poincaré - Probabilités et Statistiques
in this case, but that the distance to
quasi-equilibrium depends heavily
onthe choice ofB.
These critical schedules are not
merely
acuriosity:
when one fixes thesimulation time a
priori
the "best"cooling
schedules take thisshape
intheir initial
part.
The finalpart
ofoptimal
schedules isbeyond
the scope of this paper and will bereported
elsewhere.This paper is the second
part
of"Sharp large
deviations estimates for simulatedannealing algorithms".
We will assume that the reader isacquainted
with the results and notations of the firstpart.
This
work,
as well as thepreceding
one, would not have come to birthwithout the incentive influence of R.
Azencott,
who has directed the thesis from which it isdrawn,
let him receive my sincere thanksagain.
1. ESTIMATION OF THE PROBABILITY OF THE CRITICAL CYCLE Our aim here is to
study cooling
schedules of thetype
The introduction of the non classical constant B will be
justified
when Atakes its critical value.
We restrict ourselves to the case where there is
only
oneglobal
minimumand one
deepest cycle
notcontaining
thisglobal
minimum. We will callthis
deepest cycle
the criticlecycle,
or r. We are interested in the casewhen the
annealing algorithm
converges. The condition for convergencen
When
A H ~~’) - i ,
then~
can be madelarge and
can be made small at the same
time,
and we can consider thattemperature
is almost constant. We will prove in that case thatThe
system
remains in a state ofquasi-equilibrium during
thecooling
process. This result is not new
(cf Hwang
andSheu [14], Chiang
andChow
[6]).
When
A = ~I (~’) - ~
we can no more consider thattemperature
is almostconstant. We have to
study
the behaviour of thesystem
in the subdomainE - ~ ~ ~’~ ~ h’) where f
is the fundamental state(the global minimum).
Indeed the
probability
tostay U r) during
times m to n is oforder
466 O.CATONI I
and
H (E - { ~ , f’~ U T)) H (T).
Thusduring
its life inE - ( ~ f ~ U r)
thesystem
can be considered to be at almost constanttemperature. Studying
the
jumps
of thesystem
fromf
to r and back willgive
an estimate ofP (Xn E r).
From this estimate we will derive in a secondstep
anequivalent
of the law
ofXn-
We will show that the influence of the second term B is decisive when A = H
(F)’ ~ =
H’(E) -
and that there is a value of B for whichWe will need the
following
definitionsthroughout
the discussion:DEFINITION 1 . 1. -
Let (E, U, q)
be an energylandscape.
An initialdistribution
F0
will be calledy-uniform if
DEFINITION 1. 2. - An energy
landscape (E, U, q)
will be said to be non-degenerate if.~
2022 The energy U reaches its minimum on a
single
stateof E, equivalently
~F(E)~=l.
2022 There is in
(E, U, q)
asingle cycle of
maximaldepth, equivalently
We will call this
cycle
the criticalcycle of (E, U, q).
1. l. A lower bound for any
cooling
scheduleThe
speed
at which acycle
may beemptied
will turn to be linked with thefollowing quantity
which we will call the"difficulty"
of thecycle.
DEFINITION 1 . 3 . - Let
(E, U, q)
be anyannealing framework.
Let C bea
cycle of (E)
which ismaximal for
inclusion. Wedefine
thedifficulty of
C to be the
quantity
[Let
us remind that U(E)
= 0by convention.]
DEFINITION 1 . 4. - Let
(E, U, q)
be anon-degenerate
energylandscape
We introduce the second critical
depth
S(E)
to bewhere r is the critical
cycle of
E.Let
f
be the natural context of r(that
is the smallestcycle containing
both r
and f ).
Let 1.... , r be the naturalpartition
ofT.
Assumethat the
Gks
are indexedaccording
todecreasing depths, thus f E G 1 and
de l’Institart Henri Poincaré - Probabilités et Statistiques
G 2 = r.
Let Y be the Markov chain on the "abstract"space {1,
...,r ~
with transitions
Let 03C3 be the
stopping
timeWe
define q
to be the critical communication rateThe introduction of the
term
F(r) I - ~
1 + s~ is somewhatarbitrary.
Themeaning of q
will be clear fromequation (34)
in lemma 1.9. It is themultiplicative
constant in theexpression
of the transitions betweenf
andr viewed as two abstract states in a
large
scale of time.We define also a critical communication kernel K on the abstract space
~ 1,2~ by
Let us notice that K
(2, 2) ~ 0
and K( 1, 1 ) ~ 0.
K( 1, 2)
is theprobability
to
jump
fromf
into rknowing
that thesystem
hasjumped
out ofG1 ~ f.
PROPOSITION 1 . 5. - There are
positive
constantsTo,
oc and~i
such thatin the
cooling framework G(T0, H (I-’)), for
any initial distributionputting
b = ~(r),
and
we have
Remark. -
During
times 0 toN1
the chain X reaches a state of"partial equilibrium"
where the memory of the initial distribution is reduced to thequantity ~o (r)
in a sens which theproof
will make clear.Proof of proposition
1. 5. - It willrequire
a succession of lemmas.Let g be some state in
F (r)
and let usput E* * = E - ~ f, g ~ .
Webegin
with a formula.
468 O.CATONI
LEMMA 1. 6. - We have
Proof -
We writethen we condition each term of this difference
by
the last visitto ~ f, g ~ .
End
of
theproof of
lemma 1. 6.Let us state now to which class
belong
the GTKs of lemma 1.6. The results ofproposition
1 . 4 . 5 about the behaviour ofannealing
in restricteddomains and the
composition
lemmas areleading
to thefollowing
theorem:LEMMA 1 7. - There exist
positive
constantsTo,
a and oc such that inthe
cooling
schedule ~(To,
S(E))
~ the kernel M
(E**, h)g
isof
class~ the kernel M
(E* *, T ) f
isof
classs the kernel M
(E * *,
E -h)9,’ m
isof
class~ the kernel M
(E* *,
E - isof
class2022 for
any iEE** the kernelsM (E* *, T)E
andM (E* *, E - h)E
areof
class
We deduce from this lemma the
following proposition:
PROPOSITION 1 . 8. - For any energy
landscape
with communications(E, U, q),
there arepositive
constantsTo and 03B2
such that in G(To,
H(r)) for
anypositive
constant oc(3, for NI
= N(H (h), T1, --
a,0)
and anyAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
N2
> we haveProof of proposition
1 . 8. - Wehave
moreover
The second term in the
right
member of thisequality
islower than
and the first is
lower
than470 O. CATONI
Moreover
Let us
put T 1, - 2 oc, 0)
andN 1. 5 = N (H (T), T 1,
- cc,N 1 ) .
For any
m E [N 1 _ ~, N2] (if any)
we havebecause g is a concentration set of r.
For any
as we can see from the fact that
and the fact
that f is
a concentration set ofGland
that H’(G 1)
H(r).
Moreover
hence
For a
because as can be seen from the fact that
~ _ f ’ ~
is a concentration set of E. This is true even form - N
1small,
thiscan be seen
by starting
at time ssufficiently
remote in thepast (if
necessarywe start at a
negative time, putting TI = T
1 forl ~ 0).
Then(whatever
theinitial
distribution) P(X~==/)~1-~~
1 andP (XN 1=, f ’) >_ 1- e - ~‘~T 1,
hence
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
Hence
Hence
End
of
theproof of proposition
1. 8.We deduce from
proposition
1 . 8 thatLEMMA 1 . 9. - There is a
positive
constantTo
such that in thecooling framework G (To,
H(r)), for
any smallenough
constants 03B13a2/2 03B11/10 putting
we have
472 O. CATONI
LEMMA 1 . 10. - There are
positive
constantsTo and 03B2
such that in thecooling framework G (To,
H(r))
we haveProof of
lemma 1.1 a. - The minimum of the functionis reached for
End
of
theproof of
lemma 1 . 10.Continuation
of
theproof of proposition
1 . 5. - Let usput
and i we have
from which we deduce
proposition
I . 5 with thehelp
of lemma 1. 1 0.End
of
theproof of proposition
1. 5.PROPOSITION 1 .11. - For any energy
landscape (E, U, q),
there existpositive
constantsTa
and oc such that in F(To), for
anyG0
we havefor
any n E N
Proof of proposition
1 .11. - Let us chooseTo
as inproposition
1 . 5.With the same notations in the framework ff
(To),
ifN 1 = +00,
for any nhence
equation (39)
is satisfied in this case.If
N 1
+ oo , we have in the same way for any 1and
equation (39)
is satisfied forNow
putting
= V(H (r), TI,
it is easy to see that it isenough
to prove
equation (39)
for n = uk, We deduce from lemma 1. 9 thatAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
if,
for some n> 1then
Let us choose a2 > 0 such that
We are able now to show that
equation (39)
is true for oc = a2 and n = uk,by
induction on k. It is true fork = 0,
1according
toequation (41).
Assume that it is true for somek >__ 1,
then we candistinguish
betweentwo cases:
hence
hence
End
of
theproof of proposition
1 . I I.THEOREM 1 . I2. - For any
positive constants (3
and ’to,for
any energylandscape (E, U, q),
there existpositive
constants ac and N such that in thecooling framework
~(’to), for
any initial distribution~Q satisfying
we
have for
any474 O.CATONI I
Proof.
LEMMA 1 . 13. - For any
positive
constants Toand 03C41
such that io > il, anypositive
constant~3,
any initial distribution such that(T) >_ (3,
there is a
positive
constant ocl such thatputting
we have in the
cooling framework
iF(To)
Remark. -
Proposition
1 . 11 issharper
but worksonly
for lowtemper-
atures.
Proof of
lemma 1.13 . - As q issymmetric irreducible,
it is easy to see thatPTo
is recurrentaperiodic,
hence we can considerLet a2 =
r).
As E isfinite, a2 > o.
For any T such thatiEE
for
i,jEE
where
Hence,
ifN ~
rIf N r,
putting
we have
End
of
theproof of
lemma 1 . 13 .With the notations of lemma 1 . 13 we
put u0 = N and
un + = N
(H (r),
- a2,un)
where a2 is thepositive
constant of propo- sition1 . 5,
and where Tiis
the"To"
ofproposition
1 . 5. Hence we havewhere
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
Let us
put
and
For
n >_
1 such thatun + 1 >_ M,
wedistinguish
between two cases:in which case we deduce from
proposition
1 . 11 thathence that
in which case,
putting
x~ =a2 ,
we have2H(r)
provided
that il has been chosen smallenough. Lowering
its value ifnecessary, we can assume that a4 1.
If
M ~ N,
we deduce from 1 and 2 that for some a4 > 0If M > N, putting
we have for any n such
that n >_
R + 1476 O.CATONI
Let us recall that M is a constant
(contrary
toN).
We deduce fromequation (69)
thatWe deduce from
equation (67)
that there existpositive
constantsM2
and a 5 such that for any n such that
Now let us consider some
arbitrary n >__ M2.
Let usdistinguish
betweentwo cases:
1. In case
n >_ N,
let us consider k such that We have forsome
positive
r16 and a~2.
Thus we have
proved
that for anyn >__ M 2
hence for some
positive
a§ andM3
we haveEnd
of
theproof of
theorem 1 . 12.1.2. The critical feedback
cooling
scheduleOur aim will be now to prove that theorem 1.12 is
sharp.
We have alower bound for P
(Xn
Er)
whatever thecooling
schedule may be. We willnow be
looking
for acooling
schedule of thetype
for which Pr)
isequivalent
to this lower bound. We will thus prove that there exist values of A and B for which isasymptoti- cally minimizing P (Xn E r).
We will not
study directly
the closed form instead wewill define
1/Tn by
a feedback relation of thetype
It is easy to guess from lemma 1.9 what the
optimal
value of the constant c should be.Solving
thisoptimal
feedbackrelation,
we findrlc. Henri Pnlll(’llr’c’ - Probabilités et Statistiques
with
for nlarge
andaccordingly
Then we show that P
(Xn
Er)
is stable withrespect
to smallperturbations
of
1 /Tn
and conclude thatequation (76)
still holds whenwith the
"optimal"
B.Cooling
schedules of thetype
with nonoptimal
B are studied in the same way
through suboptimal
feedback relations.DEFINITION 1. 14. - Let
(E, U, q, lt) be a simple annealing framework.
Wedefine
the criticalfeedback annealing algorithm
to be theannealing algorithm (E, U,
q,T, X) defined by
where we have put
To
=To.
PROPOSITION 1 . 15. - For any energy
landscape (E, U, q),
there is apositive
constant T _ 1 suchthat for
anysimple cooling framework ~ (To)
such that
To _
T _ 1, there arepositive
constants N and a suchthat for
any initial distribution~o
we haveCOROLLARY. - With the notions
of
theproposition, for
any constants~i
>0,
T _ 1 >0,
there existpositive
constants N and a such thatfor
any~o,
iF
(To)
such that~o (r)
>~3
and T _ 1, we haveProof of proposition
1 . 15. - In all thisproof
we willforget
the tildeson the Ts.
For some
positive
constant a to be chosen later on, let usput uo = 0
and
We have for any
478 O. CATONI
hence
Hence there exists a
positive
constant K such that in %(To,
H(r))
wehave
We start with the
rough inequalities:
for any m for anyhence for some constant K
With the
help
of theseinequalities
we deduce from lemma 1 . 6 thatIt is easy to see that if a and
T _
1 are smallenough,
then the last term isnegligeable
before(un + 1- (r»/Tun,
indeed forAnnales de I’Institut Henri Poincaré - Probabilités et Statistiques
r1
~ (H (r) -
S(E))/2
we haveHence we
have, omitting
thenegative
term aswell,
for somepositive
constant K
Thus there is a
positive
constant k such thatThis enables us to
sharpen
ourrough inequalities
of thebeginning:
form >_ N
(H (h), Tun -1, -
2 a,1 ),
m un + 1On the other
side,
also for m >_ N(H (T),
2 a,un _ 1 ),
m un + 1 we havefor some
positive
constants k andP.
480 O.CATONI
Hence, comming
back to lemma1.6,
weget
thatNoticing
thatwe deduce from this last
inequality
that there arepositive
constantsTo
and
P
such that in G(To,
H(r))
Let us recall now that
hence we see from the
preceding inequality
thatOn the other hand we know from the
study
of the chain at constanttemperature
that for fixed smallenough To
there is some constant N suchthat
From these remarks we deduce that for any small
enough positive
choice of
To
there arepositive
constantsN, P,
such that for~c >_
Nor
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
Let us
put pn
= P(Xn
Er),
weget
that for any n > 1hence
for un >
NWe know that
for un>N
hence
choosing To
smallenough
we havehence
putting
we have
Hence for some
positive
constantK, ~i > o, putting + ~ -1 ) - c 1 + s~~
we have
but
hence
Moreover
uno _ 1 ) --
2e~H
~r) - and482 O. CATONI
for
To
smallenough.
Hencefor un large enough.
Now for have
hence there is a
positive P
such that for anylarge enough n
End
of
theproof of proposition
1 . 15.1.3. Other feedback relations
As we have
already explained
at thebeginning
of the lastsection, changing
the constants in the feedback relation leads tocooling
scheduleswith critical A and non-critical B. One
relatively
innocu-ous
change
is toplay
with constant p inIn order to
get
adecreasing
sequenceTn
we should havep > IF (r) I,
because is the
asymptotical
thermalequili-
brium
equation
forcycle
r at lowtemperatures.
For pranging
in) I F (h) ~, + oo (,
weget
with Branging
in) -
oo,Bo(
whereBo
is a critical value which can becomputed [cf. equation (135)].
A more drastic move away from thermalequilibrium
is tochange
in
(115)
thepotential U(F)
of F to some lower value h. As the influence of achange
in pcompared
with achange
of h isminor,
we chose therelation:
with
h U (r).
With thisrelation,
we findB
ranging
in)Bo,
+oo (.
The conclusion is that the thermal
quasi-equilibrium equation
forcycle
r can be
deeply
affectedby
achange
of the additive constant Bonly, eventhough
themultiplicative
constant A iskept
to the critical value.DEFINITION 1.16. - Let
(E, U,
q,20)
be an energylandscape
withcommunications. For any
positive
constantTo, for
any real numbersI
andh, 0 h U (T),
wedefine
the subcriticalannealing algo-
rithms
A1 (E, U, q, To, 20, P)
andA2 (E, U,
q,To, h) by
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
.
~1 (E, U,
q,To, 20, p)=(E, U,
~’~20, T, X)
with.
d2(E, U,
q,To, 20, h) _ (E, U,
q,20, T, X)
withPROPOSITION 1 . 17. - Let
(E, U, q)
be an energylandscape.
For anyp > ~
F(T) I,
anypositive
y and any smallenough positive To,
there existpositive
constants N and a such thatfor
any initial distribution suchthat
(I-’)
> y, theannealing algorithm satisfies
Consequently
there is N(the same)
and apositive
a such thatProof of proposition
1 . 17. - It is an easyadaptation
of theproof
ofproposition
1 . 15:putting
uo = 0 andwe have in the same way for some
positive
constants K andTo
in theframework %
(To,
H(T))
from which we deduce that there are
positive
constantsp, To
such that in~
(To, H (r)), putting
we have for n > 0
484 O. CATONI
The end of the
proof
areelementary
calculationsstemming
from thisformula and will not be detailed.
End
of
theproof of proposition
1. 17.PROPOSITION 1. 18. - Let
(E, U, q)
be an energylandscape.
For anypositive
constants y and h U(T), for
any smallenough To,
there existpositive
constants N and oc such thatfor
any initial distribution such that(I-’) >_
y, theannealing algorithm
satisfies
Consequently
there exist N(the same)
and apositive
a such thatRemarks. - The
feedbacks
~ 1 are not toofar
the critical one, in thesens that P
{~n
ET)
decreasesaccording
to the same powerof
n, butfor
thefeedbacks A2
the powerof
n ischanged
and tends to 0 when the parameter h tends to 0. This shows that in the classof cooling
schedulesof
theparametric form
the rate
of
convergenceof P (Xn E T)
towards0,
and hence the rateof
convergence
of
P(U (Xn)
= U(E))
towards 1 isstrongly dependent
on thechoice
of
the constant term B. Thisfact
does not appear in the existant literature and is worthbeing
noticed.Proof of proposition
1. 18. - It followsagain
the same lines as theproof
ofproposition
1. 15.We choose a such that
M (T, E - r)~, i E T, j E B (r),
andM (h, E - T)E
are
a-adjacent to q(r) ~ ( H ( h
We have~
Annales de l’Institut Henri Poincar0 - Probabilités et Statistiques
Hence there exists a
positive
constant K such thatfrom which we deduce estimation
(124)
as in theproof
ofproposition
1.15.
We see in the same way that there is some constant N
independent
ofG0
and somepositive P
such thator
Hence we deduce
that, putting
Pn = P(Xn
Er),
we havewith ~, ~
Hencewith
The end of the
proof
is as the end of theproof
ofproposition
1.15.End
of
theproof of proposition
1 . 18.1.4.
Stability
under smallperturbations
of thecooling
scheduleIn the
preceding
section we have built feedbackcooling
schedules withasymptotics
with
lim En
= 0 andn -~ 00
A natural
question
is then to ask for the behaviour of smallperturbations
of these
cooling schedules, i.
e. tostudy
theasymptotics
of P(X~
Er)
whenwe
modify
the sequence En. The present section isanswering
thisquestion.
PROPOSITION 1 . 19. - Let
(E, U, q)
be an energylandscape.
Let B be areal constant such that B
Bo
whereBo
isdefined by equation (135).
Let p486 O.CATONI I
be the
positive
real constantdefined by
Let
~o
be an initial distribution on E. LetT
be thecooling
schedule and letX
be the Markov chainof
thefeedback annealing algorithm
~1 (E, U,
q,p).
Let be a sequenceof
real numberssatisfying
and
is a
cooling
schedule(i.
e. isdecreasing,
infact
the reader willeasily
seethat this condition is not
necessary).
Thenfor
theannealing algorithm (E, U,
q,T, X)
we haveuniformly
in Moreoverif
there existpositive
constants N and a suchthat
then there exist
positive
constants N’and 03B2
such that for any initial distribution~o
Consequently
there existpositive
constants N and a such thatfor
any initial distribution theannealing algorithm (E, U,
q,T, X)
associated with thecooling
schedulesatisfies
PROPOSITION 1 .20. - Let
(E, U,
q, be an energylandscape
withinitial distribution
satisfying (I-’)
> 0. Let B be a real constant suchthat B >
Bo.
Let h bedefined by
Annales de l’Institut Henri Poirzcare - Probabilités et Statistiques
[which implies
that 0 h U(h).]
Let(En)n E
~~ be a sequenceof
real numbers such thatfor
somepositive
a and N and any n >_ Nand such that
is
increasing.
Let
(E, U,
q,T, X)
be thefeedback annealing
~2 (E, U,
q,To, h)
whereTo
is as inproposition
1 . 18.There exists a
positive
constant ~, such thatMoreover, if
we haveonly
we have still
Remark. - The limit ratio ~,
depends
on the sequence 8of perturbations
and on the initial distribution
~fo.
In conclusion we can say that the feedback
algorithms
oftype A1
arestable under small
perturbations
whereas the feedbackalgorithms
oftype j~2
are stableonly according
to thelogarithmic equivalent
of P(Xn
er).
Pr’oo,f’s. -
Let a > 0 be chosen such that in some framework~ (To, H(F))
for anyM (r, E -1,)i
andM (h,
are
a-adjacent F(H(0393)).
Let usput u o = 0
andLet us
put
and We have as in theproofs
of
propositions 1. 15,
1.17 and 1 . 18for some
positive
constant K.We deduce from this that for
any k >
1488 O.CATONI with
and |~ (k)
for somepositive
constantp.
In the same way we have
with
for some
positive constant P.
[We
can use the same sequence un in both cases because for somepositive
constantsKl
andK2, depending
on ~~, we haveand it is
really
all what we needabout M~.]
Let us
put
We have for/~
1and
with
for
a/~
and smallenough.
Hence
Annales de l’In.stitut Henri Pnincnr-e - Probabilités et Statistiques
with
Let us
distinguish
at thispoint
betweenproposition
1. 19 and 1.20.CASE OF PROPOSITION 1. 19. - We have
and
hence
with
In the same way
hence
with
490 O. CATONI
From
equation (160)
we deduce thatand from
and
we deduce that
Now if for some N > 0 and any n > N
then
for uk >
N we have for somepositive P
and
hence for some
P
> 0with
5(k) ~
EHence
Now there is N > 0 such that for any
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
hence for N uk un
As
(k)
is bounded we canput
We have
for
some ~ > 0.
This ends the case
of proposition
1 . 19.CASE OF PROPOSITION 1. 20. - Here for some
~3
> 0hence in case for n
large enough
and
(we
do not know any more thesign
ofdk).
Hence the infiniteproduct
is convergent towards some
strictly positive value,
492 o. CATONI
and xn
converges towardsIn case we have
only
we do not know
whether £ )
+ o0 or not. However we have therougher
estimatesbecause
On the other hand we know that
hence we have
but
with 111 (k) _ e -
henceAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
In the same way
hence
Hence
and
from which it is easy to end the
proof
ofproposition
1 . 20.End
of
theproofs of
the section.2. ASYMPTOTICS OF THE LAW OF THE SYSTEM
In this section we draw the conclusions of the last section
concerning
the law of the
system.
We consideragain converging cooling
schedules of thetype
andnon-degenerate
energylandscapes.
Westudy
the behaviour of the
system
in the domain obtainedby removing
fromthe states space the
ground
state and onepoint
in the bottom of thecritical
cycle.
In this domaintemperature
can be considered to be almost constant. Therefore it isstraightforward
to derive anequivalent
of thelaw of the
system
from anequivalent
of theprobability
to be in any one state of the bottom of the criticalcycle.
We will consider
cooling
schedules of thefollowing types
DEFINITION 2.1. - Let
A,
Band u bepositive
constants, let N be apositive integer.
Wedefine
thecooling framework
~(A, B,
a,N)
to be theset
of cooling
schedulessatisfying
thefollowing
assertion:for
any n >_ N wehave
494 O. CATONI
We will look for an
equivalent
of the law ofXn
inparametric cooling
frameworks Yf in which the
annealing algorithm
converges. We will sometimes make theasumption
that the energylandscape (E, U, q)
is non-degenerate
in the sens of definition 1. 2.This
study
can becompared
with results inHolley
and Stroock[13], Chiang
and Show[6]
andHwang
and Sheu[15].
Theorem 2. 2 isproved
in
[6]
as well as in[15].
The introduction of the constant B is new. Itplays
animportant
role in theorems 2.4 and 2. 5. These theorems show thatHwang
and Sheu’s claim that results for A > H‘(E)
extend to the caseA = H’ (E)
is wrong(cf. [15]
th.4 . 2.).
THEOREM 2.2. - Let
(E, U, q)
be any energylandscape (non necessarly
non
degenerate).
Let A and B be real constants. Assume thatLet a be any
positive
constant, letN1
be apositive integer.
Then there existpositive
constantsN2 and 03B2
such thatfor
any initial law in theannealing framework (E, U,
q,~o, ~ (A, B,
a,N1), ~’)
we havefor
any i E EProof of
theorem 2. 2. - We write for i E E and somef
E F(E)
We will need the
following
lemma:LEMMA 2 . 3. - For any energy
landscape (E, U, q), for
anyf E
F(E),
any
cycle
C cC
E ~~(E - ~ f ~),
there arepositive
constantsTo,
a such thatin the
cooling framework ~ (To,
H(E - { f ~)),
the GTKis
o f
classProof of
lemma 2.3. - Let us examine first the case whenC E ~~ (E - ~ f ~).
Let G be the smallestcycle containing f
and C.We know from the
proof
1 in theorem 1.2.25 thatM (G - ~ , f ~, C) f
is of classAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
Moreover M
(E - ~.f ~’ C)f
(204)
M (G - ~ f ~, E-G)
is of class~l (H (G), H (E - ~ f ~))
and for anyj e E - G,
the kernelM (E - ~ f ~, C)E
is of class~~ (0,
H(E - ~, f ~))
(from proposition 1.4.5),
henceby
thecomposition
lemmas~ M (G - ~ f ~, E - G) M (E - ~ f ~, C) ~ f
is of class~l (H (G),
H(E - ~ f ~)).
But
H (G) > H (C) + U (C),
henceM (E - { f ~, C) f
is itself of classl(H(C)+U(C), q (C), H (E - { f ~), a).
The case of a
general
isby
induction: Assume that the lemma is true for somecycle
G m E. We will show it is still true for any C E ~(G) (the
naturalpartition
ofG).
For this purpose we writeand for any
i E G, g being
somepoint
ofF (G),
(for i = g
there isonly
the secondterm).
This formula shows that
is of class
for some
positive
a in H(E- ~.f’~)). [The
first term isnegligea-
ble because H
(G)
> H(C)
+ U(C) - U~.]
Hence we deduce from
equation (205) (where
the first term isagain negligeable
because H(G)
+ U(G)
> H(C)
+ U(C))
thatis of class
for some
positive
contant a insome % (To, H (E- ~.f’~)).
End
of
theproof of
lemma 2 . 3.Vol. 27, n° 4-1991.
496 O. CATONI
Let us come
back
toequation (203).
Let us assume that H’(E) 1 /2.
For m ~ n we
have )
_/
Hence for n
large enough
there areL~
such thatPutting
we have
Hence
hence
for nlarge enough, putting
we have
Hence there exist
positive
constants N and~i such
that for n > NAnnales de l’lnstitut Henri Poincaré - Probabilités et Statistiques
Thus, as f is
a concentration set ofE,
we have for somepositive 03B22 and any l~ L2,
Thus there are
positive
constants a and b suchthat,
for n > 1~T and somepositive ~i~
but
for n
large enough,
henceOn the other hand for n
large enough
and somepositive a, b, ~34
anda~
End
of
theproof of
theorem 2. 2.THEOREM 2 . 4. ~ Let
(~, U, q)
be anon-degenerate
energylandscape.
Let B be a real constant
satisfying
where
Bo
isgiven by equation (135).
Let p bedefined by equation (136).
There exist
positive
constants such thatfor
anypositive
constantsN1
and a, there arepositive
constantsand 03B2
such thatfor
any initiallaw G0
in theannealing framework (E, U,
q,(H (T), B,
a,N1), X)
we have