A replica analysis of the travelling salesman problem

(1)

HAL Id: jpa-00210320

https://hal.archives-ouvertes.fr/jpa-00210320

Submitted on 1 Jan 1986

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A replica analysis of the travelling salesman problem

M. Mézard, G. Parisi

To cite this version:

M. Mézard, G. Parisi. A replica analysis of the travelling salesman problem. Journal de Physique, 1986, 47 (8), pp.1285-1296. �10.1051/jphys:019860047080128500�. �jpa-00210320�

(2)

A replica analysis ^{of the}travelling ^salesmanproblem

M. Mézard (*) and G. Parisi (**)

Dipartimento ^diFisica, Universita’ di Roma I, ^PiazzaleAldo Moro 2, ⁰⁰¹⁸⁵Roma, Italy (Reçu ^{le 27}janvier, accepti le 23 avril 1986)

Résumé. 2014 Nous proposons ^etanalysons ^une^solutionsymétrique ^{dans les}répliques ^desproblèmes de voyageurs de commerce à liens indépendants. Elle fournit des estimations analytiques raisonnables _{pour les}quantités ^thermo- dynamiques ^comme^lalongueur ^du^{chemin le}plus ^court.

Abstract. 2014 We propose and analyse ^areplica symmetric ^solutionfor random link travelling ^salesmanproblems.

This gives reasonable analytical estimates for thermodynamic quantities ^such^as^thelength ^ofthe shortest path.

Classification Physics ^Abstracts 05.20 - 75.0 - 85.40

1. Introduction.

In trying ^tounderstand the theory ^ofspin glasses,

a number of methods and concepts have been deve-

loped ^which^seem^to ^havepossible applications beyond statistical mechanics, particularly in the field of combinatorial optimization.

It was soon realized [1] ^anddemonstrated [2] ^that

the determination of the ground ^stateôfânînfinite

range spin glass îsân^NPcomplete problem. Âs^this

is probably ^{the NP}complete problem ^which^{has been}

most studied by physicists, ^let^usbriefly ^recall^the

kind of results that ^canbe obtained in this case. The infinite _rangeIsing spin glass ^is^a^set^{of N}spins

as = ± ¹interacting through ^randomcouplings Jig

which are independent Gaussian random variables with Jij ⁼⁰^andJ3 ⁼^{1 /N.}^{The J’s}^are^quenched

variables given ônce^{for all}(a given sample, ^Le.â^set of J’s, îsânînstanceôf^theproblem), ^{and the}problem

consists in finding ^theground ^stateconfiguration ^of

the spins, ^{that is}^the^one^whichminimizes the Hamil- tonian

NP completeness ^meansthat there is ^noknown

algorithm ^which^can^solve^everyinstance of this

problem ⁱⁿ^a^timegrowing less than a power of N, and it is unlikely ^that^such^analgorithm ^can^be

found [3]. Âgood ^heuristic(an algorithm ^which provides ânapproximate solution) îsthe simulated

annealing ^methodwhich consists in sampling ^the configurations ^withâBoltzmann-Gibbs probability e- MIT with the Monte Carlo method, ândsmoothly decreasing ^thetemperature [4]. ^{This is}âgeneral ând powerful approach ^formany complex optimization problems [4-7].

On the other hand some kind of analytical ^infor-

mation is also available. The ground ^stateenergy

density E/N (minimum ^ofHIN) converges ^to^a certain value Eo ^forlarge N, independently ^{of the} sample (1).

The replica method allows one to prove that there is a phase transition in this system ^atthe critical temperature T, ⁼^{1. Below}T, ^{there is}^abreaking

of ergodicity ^and^one^needsreplica symmetry breaking.

A scheme for such a breaking ^wasproposed ⁱⁿ[8],

which allows us to compute Eo precisely

This kind of prediction îsessentially probabilistic (for large ^N^the^valueôfE/N ^forâgiven sample îsâ

Gaussian random variable of mean Eo ^{and width}

but it can be of direct interest,

for instance for testing heuristics.

Although other relevant information can be obtained from the replica ^solutionof this model (e.g ^the

(1) ^Thisproperty ^of^ccself averageness ^»is obtained with the replica method and well established numerically, ^{but has}

not yet been demonstrated rigorously.

Article published online by EDP Sciences and available at http://dx.doi.org/10.1051/jphys:019860047080128500

(3)

ultrametric topology ^{of the}space of equilibrium ^states

below Tc [9] and the correlations of the spin ^values

in these states [10]), ^no^algorithmhas been found yet which can use these informations in order to speed

the search for the ground ^state.Naturally the simulated

annealing algorithm ^{is the}^onewhich takes the greatest advantage ^{of the}analytical results of thermo-

dynamics : ^{once one}^{knows the}^criticaltemperature T, (or ^afreezing temperature ⁱⁿ^the^casewhere there is no

sharp phase transition), ône^mustperform â^very^slow annealing âroundT,,, ^{while the}regions ^{far above}T c (where ^thesystem thermalizes rapidly) ôr^far^belowT, (where ^thesystem îs frozen) ^can^beswept ^more

rapidly.

In this paper ^we^{want to}show how the replica

method can be used generally ^toget ^the^same^{kind of} information as described above ^onother NP complete problems. ^We^have^chosen^toapply ^it^to^thetravelling

salesman problem mainly for aesthetic ^reasons(it ^is

among the easiest ^NPcomplete problems ^tostate, ^and

probably ^the^moststudied), ^{and also}^because^some

numerical data were previously available. It is also

a problem rather far from the spin-glass ône.^{This will} exemplify ônepower of the replica method : the fact that no a priori knowledge ^{of the}^natureof the order and of the order parameter îsrequired

This method has already ^{been used}^topredict ^the ground ^stateenergy in the matching problem [ 11 ] (a polynomial problem) and in the bipartitioning ^of infinitely connected random graphs ^which^was^shown

to be equivalent ^to^{the S.K.}[12].

In section 2 we describe the problem ^{and its}^sta-

tistical mechanics representation.

In section 3 we propose ^areplica symmetric ^solution

and analyse it. It is the most technical part ^of^this

paper and can be skipped by the reader who does not

appreciate ^thebeauty ^ofreplica computations.

The results are presented and discussed in section 4.

2. Statistical mechanics of the travelling ^salesman problem

The travelling ^salesmanproblem (TSP) ^is^thefollowing.

Given N points ^{and the}^distancesdij ^between^any^two

of them, find the shortest tour, ^i.e.closed connected

path, that goes through ^{all the}points.

A popular family of TSP’s is the one where dip’s

are Euclidean distances between points distributed

randomly ⁱⁿâsquare. The resulting correlations between the distances allow for _verypowerful ^heuristics (for înstanceâcombination of simulated annealing

and « divide and conqueer » strategies [4-6]), ^but they ^are^difficult^to^{take into}^accountⁱⁿ^thereplica

method. ,

In the more general ^case^thedij ^representcompli-

cated costs which do not possess Euclidean correlations. We shall study ^afamily ^{in which}good ^numerical

results are far ^moredifficult to achieve than in the Euclidean _case,the one where dij’s ^areindependent

random variables with the same distribution p(d) [7-13] (we keep ^to^thesymmetric ^casedij ⁼dji). ^In

this case the low temperature properties, ^andparti- cularly ^thelength of the shortest tour, depend only

on the behaviour of p(d) ^atsmall d If :

the nearest neighbour ôfany point ^{i is}âtâ^distance

- 1

d-N ^{r + 1} and one can show [ 13] ^{that the}length

_ 1

of the shortest tour is of order N x N r+ 1. A tricky

case is r ⁼0 in which the best known _upperbound

on the shortest tour is of the order of Log ^N.^This

case has also been extensively ^studiednumerically by Kirkpatrick and Toulouse [7], ^{and for}^these^reasons

our numerical results in the next sections will be given

in this case. We predict â^finitelength ^{of the}^shortest path, ând^notâLog ^Ngrowth.

In order to extract from the tours those which contribute, ^it^was^{shown in}[13] ^{that the}parti-

tion function which is naturally introduced in the statistical mechanics description ^ofoptimization problems must be of the form :

where P îsânînverse« temperature ». The ênergy

density ^{times the}

average length ^{of the}^tours^at^thetemperature ^T=1 /B.

The whole difficulty of the TSP lies in the fact that the sum in (2.2) ^is^overtours, i.e. closed connected

paths. This constraint of connectivity ^ishighly ^non-

local and difficult to implement ^withlocal variables.

We use a procedure that has been invented in the

theory ôfpolymers [14] ândîntroduceôneach site i

an m-component spin Si ^of^fixedlength Sf ⁼^m.

As was shown by Ôrland[ 15], ôneîs^then^led^tostudy

a generalized partition ^{function :}

which reduces to ZTSP ⁱⁿ^the^{limit m}^-+^0,y ^-+^oo[15J :

as can be shown using ^thespecial simplifications ^of integrals ^over^sⁱⁿ^{the limit}^{m -+}0, summarized by

the formula :

(4)

As we explained ^{in the}introduction, êxtensive thermodynamic quantities ^suchâs^{the free}energy F = - 1 Log Zysp âre^selfaveraging, ^which^means

that F/N goes for N ^--*oo towards a limit which is

sample independent. ^Hence^we^shalltry ^tocompute

the average Log Zpsp, using ^thereplica ^method^which

consists in computing ZnTSP ^andusing :

From (2.4) ^one^mustcompute ^Z"^{which is}equal ^{to :}

where we have introduced, ^on^eachsite 4 n replicas ^{of the}original spin variable : Sa, a ⁼^1,^...,^n.

Because of the statistical independence ôfthe di?s and of the introduction of replicas, the averages ôver each dij ^decoupleⁱⁿ^(2.7)ândusing :

’

where :

one finds :

where the a’s, going ^fromône^{to n,}ârereplica indices while the a’s, going ^fromône^{to m,}characterize the various

spin components. Âcrucial remark is that in (2. 0) ^the^terms^whereât^least^tworeplica îndicesâreequal ^lead

to vanishing contributions in the limit of the TSP (2.4) (which ^must^{be taken}before the n -+ 0 limit) ^as^is^shown

in appendix ^{I. A}Gaussian transformation decouples the various sites in (2 .10), leading ^{to :}

where z is the one-site partition ^{function :}

In principle, ^formulae(2.11) ând(2.12) give ^{the free}ênergyof the TSP in the large ^N^limitthrough â^saddle point evaluation of the integral ⁱⁿ(2.11). However it is well known from the theory ôfspin glasses ^that^this

saddle point ^is^noteasy ^tofind in the limit n -+ o. Furthermore there are two other complications ^{here : the}

appearance of all the order parameters Qa, Qab’ ..., Q I.... (similar ^to^the^case^{of the}matching [11]), and the presence of a polymeric îndexâgoing ^fromône^{to m,}^{where m}^-+^{0. We}shall propose and analyse â^solution^{of these}

saddle point equations ^{in the}^next^section.

Before going ^to^thisanalysis, ^let^usexplain ^how^otherquantities, besides the thermodynamic functions,

can be deduced from the saddle point values of the Q’s.

An interesting information is the average distribution P(q) ôfoverlaps between the relevant tours at a given temperature. ^Wedefine the overlap qt,t, ^betweentwo tours t and t’ as their number of common bonds, ^divided by ^N[7]. The characteristic function g(y) ôfP(q) îs^defined^{as :}

(5)

In order to compute g(y) within the present ^field^-theoretical approach, ^wefollow the same method as in spin glasses [ 16J ândîntroduce^twocopies ôfâgiven înstance^-^twotravelling salesmen who must visit the same set of cities. This can be done by using ^twocopies Si ândSi of ^{the m}component spins, ^with^some^suitable^{link auxi-} liary variables which count their number of common links. The denominator Zip ⁱⁿ(2.13) ^can^beôbtained^with

the replica ^methodby introducing n - ²independent ^otherreplicas ^{of the}system : they contribute a factor

Znp 2 ^whichgives ^the^correct^resultfor n -+ 0. This whole computation is described in some detail in Appendix ^II.

The final result for P(q) ^{is :}

where the Qal".ap takes its saddle point ^value^Ihis result is _verysimilar to that already found in the matching problem, ^theonly difference being the presence of ^vectorspin ^indices.

Here also, ^{one can}^definegeneralized overlaps ^{between k}> 2 tours as 1 /N ^{times the}^number^{of bonds}

common to all of them. Their distribution p(k)(q) ^{is then :}

where the is the sum over all choices of the indices ak+ 1, ..., ap, each of them being distinct from

f

all the others and from the ai, ..., ak.

Finally ^let^usalso describe the computation ^{of the}distribution of lengths ^{of the}links. This strictly ^follows

the computation ôf^thematching [11], ând^weîncludeit here for completeness. ^Theimportant ^linksâre^{those of}

- 1

length - N " 1, ^{and the}distribution P(L) is defined ^{as :}

where 2: denotes the sum over all the N links (4 j) ^whichbelong ^to^the^tour^t.^In^order^tocompute ^{the mth}

(i,j)et ^t

moment M," of P(L) ^we^introduce^amodified TSP partition function defined as :

so that :

The 2,,(a) ^can^becomputed ^from^ageneralized ^function^similar^to^Zⁱⁿ(2.3), ^with^asimple modification of the

---

-

weight ^{of each}link, ^from

The corresponding one-link _integralin (2.8) ^ischanged ^{into :}

’

So the introduction of the extra weighting proportional ^to^a^can^be^simplyabsorbed into a change of gp ⁱⁿ

(2 11) ^into

(6)

From (2 .18) ^and(2. 20), ^weget :

from which one obtains the distribution of lengths P(L) :

3. The solution within a replica symmetric ^ansatz

As the full set of saddle point equations ^iscomplicated, ^we^{have made}^somehypotheses, compatible ^{with these} equations, ^on^thetype ^{of saddle}point ^we^{look for.}

As for the replica ^indices^we^{make the}assumption ^ofreplica symmetry : independently of the values of the a’s.

The spin component îndicesâ^mustbe treated differently ^{since the}symmetry ^{in this}space is a rotational symmetry înstead^{of the}permutation symmetry ^{of the}replicas. ^Theproblem îs^more^similar^to^{that of}^vectorspin glasses, but in the limit where the dimensionality m ^{of the}spin goes ^to^zero.We thus make the simple hypothesis

of a spontaneous breaking of the rotational symmetry (for large ^{values of}y) ^{with :}

where u is a given ^vector^which^we^normalize^toû2⁼^1.(3.1) ând(3.2) âre^twostrong hypotheses. ^We^shall postpone their discussion to the next section and first compute the free energy of the TSP within this ânsatz.

First of all ^let^usnotice that the ne term necessary ^toinsure the convergence of (2 .11 ) in the TSP limit appears in Z" ^fromthe integrals ^over^theQ’s. Because of the spontaneous ^breakdownof the rotational symmetry there is a Goldstone mode associated with arbitrary independent rotations in each replica :

where 3li, ..., 1. ârerotation matrices in the m dimensional space of the ^vectorsSa. ^{As the}^surfaceôfân^m-

dimensional unit sphere ^vanisheslinearly ^with^m^for^{m -}^0,^theintegrals ^overQ’s ⁱⁿ(2 .11) ^areproportional

to m" for small m.. ^,

One must now compute the one-site partition ^{function :}

where the £’ ^meansthat all the indices must be different from each other. In the TSP limit (2.4) ^we^mustpick

up the term of order y" ⁱⁿ^zand take its limits m -+ 0 and then n, ^-+^{0. In}Appendix ^III^weshow that the result is (F2) :

From (2.11) and (3. 5) ^wehave the free _energyof the TSP :

(’) ^{Dz is}â^Gaussianintegration ^measure^{and He}âre^Hermitepolynomials. ^Thesequantities âre^definedprecisely ⁱⁿ

the appendix

(7)

where the Q’s satisfy the saddle point equations :

One can introduce a generating ^function equation for g :

and from (3 . 7) ^onegets ^thefollowing integral

This equation is rather similar to that found in the

matching problem [II], apart ^{from the}complications

due to Hermite polynomials ^andimaginary ^terms.

Unfortunately the behaviour of the Kernel :

for large values of the arguments ^is^morecomplex ^and

we have not been able to find out the correct asympto- tic behaviours of g.

In this situation it was impossible ^tosolve the inte-

gral equation (3. 8) by ^asimple ^iterativemethod : ^we

have used a different method to compute zn, ^which

consisted in a high-temperature expansion of the order parameters.

We shall present the formalism needed to build up this expansion ^{in the}general ^casewhere gp ⁼ (pB)-(r+1), and we shall restrict our numerical com-

putation ^to^thecase, ^r⁼0. We note :

In (3.7) ^’l^werescale À. -+ A _T2_Qi ^and^introduce^the^va-

riables :

which satisfy ^thefollowing equations :

The _energydensity ^isexpressed ⁱⁿ^terms^{of these}

variables as :

Clearly ⁱⁿ^thehigh temperature lhnit fl « ¹ ^the

function A(z, A) ^can^beapproximated by 1, ^and^one gets :

In order to compute ^thehigh temperature expansion of 2 PE up ^tothe order bN, ^oneexpands ^each Xi ( 1 ⁱ N) up ^tothe order bN+l-i. This can be

systematically done from (3.12 ^to3.14) by ^first expanding the function A(z, A) ^to^orderbN, ^thenper-

forming ^theintegrals (3.12, 3.13) ^toestablish the set of (N ⁺1) equations between the X¡’s, ^andfinally solving ^theseequations iteratively.

We have carried out this expansion numerically

in the cases ⁼0 _upto the order N ⁼20. The coeffi-

cients ak ^{of the}^series

are given ⁱⁿthe first column of the table.

Although the ak âre^rational^numbers^we^have represented ^themôn^thecomputer âsfloating point

numbers. In the actual computation ^we^have^done, quadruple precision ( ~ 30 significant digit) ^{has been} used ; ^from^various^checks^weknow that the error due to rounding ^increasesexponentially with k and only

the first 10 digits ^ofa19 ^are^correct.This _accuracyis far

enough ^forôurâims;^{the whole}computation ^takesâ

few minutes of Vax 8600. From the expansion ^{of the} Q’s ^{one can}compute ^otherinteresting quantities ^but

let us first show in the case of energy how ône^{can use} this high temperature expansion ^toobtain the low temperature properties ôf^thesystem. Îtis convenient

(8)

to perform ^thefollowing manipulations. ^{We first}put aside the first two terms and write :

We then rescale p by ^a^factor^51/3and Borel transform this series, introducing :

So that :

a 5k/3

The coefficients ak 5k/3 = bk ^are^givenin the second column of the table. The alternance of three plus signs

with three minus signs indicates the _presenceof

complex singularities ⁱⁿ^uⁱⁿthe three directions

( - 1)1/3 . ^Thissuggests ^tochange variables from u

to x, ^where

By ^{trial and}êrror,^we^have^foundthat - 1 ôîsâ^good

choice in the sense that the series expansion ^{of the}

function

converges rather well inside the unit circle. The coefficients _ck^aregiven ⁱⁿ^thethird column of the table.

The ground ^stateenergy (length of the shortest tour)

is :

which can be evaluated directly ^from(3.23). ^However

the successive approximants ^tof (1) ^obtainedby adding

new terms to its series expansion ^still^have^some

oscillations. A better method is to compute ^from (3.19)-(3.21) the function E(P) ^at^finitetemperature.

It turns out the the series is _veryconvergent for P ² (the relative fluctuations in the values of E(fJ) ^obtained

from successive orders from 10 to 20 are less than 1 %),

and the value of temperature ^T⁼^{0.5 is}^{below the} freezing region, ^so^thatE(T) ^can^beinterpolated safely

to T = 0.

4. Results and discussion

In this section we present the results obtained within the replica symmetric solution described above. As has been explained in section 3, ^thehigh temperature expansion allows for a very precise computation ^of^the

Fig. ^1.^-Average length ^Lâsâfunction of the temperature T, ^{for r}⁼^{0. Note}that, ^from(2. 2), ^T^{is 1}IN ^{times the}^«^{usual »} temperature. Înthe upper left comer (with the upper left

scale), ^theentropy S ^dividedby T, ^{as a}^function^of^{T. The}

error bars indicate the fluctuations in the values of S/T

obtained from successive orders in the high temperature expansion, between the 10th and the 20th order. These fluctuations ^arenegligible (less ^{than 1}%) ⁱⁿL(T) ^{for T}> 0.5.

function E(T) (average length ^{of the}^tours^found^at^a given temperature) ^forT > 0.5. The result is plotted

in figure ^1.^We^{have also}computed ⁱⁿ^the^sameregime

the entropy

where E is the function introduced in (3 .19).

It turns out that S(T)/T ^is^nearly^constant^{in the low}

temperature range, with a value of the order of 0.33 ± ⁰⁵(see Fig. 1), the estimate of the ^errorbeing quite subjective.

This suggests ^{that the}specific ^heat

grows linearly ^at^lowtemperature ^and^so^theenergy should start quadratically :

This allows to interpolate ^the^curveE(T) ^down^to

T ⁼0. The ground ^stateenergy is :

where the estimation of the error is again subjective.

When the temperature ^tends^to^zero^theentropy converges ^to^avalue near to 0 (e.g. ⁰^±0.03), ^but^it

is of course impossible ^{to state}^whetherS(0) ^isexactly

zero or not.

We have also computed the distribution of overlap

between the tours from formula (2.14). ^Within^the

(9)

Table I. ^-Results for ^r⁼0. ak ^isthe coefficients of fJk-t ⁱⁿ^thehigh temperature expansion of ²PE bk ⁼ak 5k/3

k ! is the coefficient after rescaling ^{and Borel}transforming. Ck âre^thecoefficients of ^the^new^series(3.23) ôbtained from ²Ê(P) after ^thechange of ^variable(3 . 22). dk îs^thecoefficient of pk + 1 ⁱⁿ^thehigh temperature expansion of2 q..

replica symmetric ânsatzP(q) îsâdelta function at a

value _qoequal ^to

The high temperature expansion ^{of the}Qp’s ^described

in section 3 gives ^ahigh temperature expansion of qo, for r = 0 :

The coefficients d. âregiven in the fourth column of the table. Assuming ^that^theanalytic ^structureôf qo(#) îsessentially ^the^{same as}^{that of}E(p). ^We^have performed ôn^the^seriesqo(fJ) exactly ^the^same^manipulations âs^those^we^didônE(p) in section 3. Again

the curve qo(7) ^{can be}^obtained precisely ^for

T = 1 > ^0.6^{and is}^plottedⁱⁿ^figure^{2. We}^have

interpolated qo(7) linearly ^at ^lowtemperatures, using ^theproperty

which is suggested by ^thefollowing argument. Keeping

r ⁼0, the distribution of lengths ^of^thelinks in the

Fig. ^2.^-^Theoverlap qo between relevant tours at tempe-

rature T (see (4.5)). ^The^curve^for^T> 0.6 is obtained from the high temperature expansion, ^{and then}extrapolated linearly.

relevant tours, P(L), (2.22) ^reducesⁱⁿ^ourreplica symmetric ^case^to

so that