• Aucun résultat trouvé

Linear operators on correlated landscapes

N/A
N/A
Protected

Academic year: 2021

Partager "Linear operators on correlated landscapes"

Copied!
17
0
0

Texte intégral

(1)

HAL Id: jpa-00246941

https://hal.archives-ouvertes.fr/jpa-00246941

Submitted on 1 Jan 1994

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Linear operators on correlated landscapes

Peter Stadler

To cite this version:

Peter Stadler. Linear operators on correlated landscapes. Journal de Physique I, EDP Sciences, 1994,

4 (5), pp.681-696. �10.1051/jp1:1994170�. �jpa-00246941�

(2)

Classification Physics Abstracts

0250 0550

Linear operators

on

correlated landscapes

Peter F.

Stadler(*)

Santa Fe Institute, 1660 Old Pecos Trail, Santa

Fe,

NM 87501, U-S-A-

(Received

26 October 1993, received in final form 12

January

1994,

accepted

28

January 1994)

Abstract. In this contribution we consider the elfect of a class of

"averaging operators"

on

isotropic

fitness

landscapes. Expbcit expressions

for the correlation function of the

averaged landscapes

are derived. A new class of

tunably rugged landscapes,

obtained

by

iterated snlooth-

ing

of the random energy

model,

is established. The correlation structure of certain

landscapes,

among them the

Shernngton-Kirkpatrick

model, remains

unchanged

under the action of ail

averaging operators considered here.

1 Introduction.

Evolutionary adaptation

as well as combinatorial

optimization

take

place

on

"landscapes"

that result from

mapping (micro)configurations

to scalar

quantities

like fitness values or

energies,

or, more

generally,

to nonscalar

objects

hke structures. The

assumption

of a

particular

mech- anism for

mutation,

or the choice of a

specific

move-set for

interconverting configurations,

introduces trie notions of

neighborhood

and distance between

configurations. Viewing

trie set of ail

configurations, 1e.,

trie

configuration

space, as a non-directed

graph

r bas tumed out to be

particularly

useful. Each

configuration

is a vertex of

r,

and

neighbonng configurations

are

connected

by edges.

When

configurations

are sequences of

equal length

and trie

neighborhood

relations are defined

by

the number of

differing positions,

one obtains trie sequence space with the

Hamming

metric.

Among

the most

important properties

of a

landscape

is its

"ruggedness",

which influences the

speed

of

evolutionary adaptation

as well as trie

quality

of solutions of heunstic

optimization strategies.

While

ruggedness

bas never been

precisely defined,

a number of

empincal

measures bave been

proposed,

such as trie number of local

optima

or trie average

length

of up- or downhill walks

il, 2],

and the

pair-correlation

as a function of distance m

configuration

space

[3, 4].

The very notion of

"uphill"

or

"optimum" implies

a mapping to a scalar

quantity.

Trie definition of a

pair correlation, however,

con be extended to mappmgs to

arbitrary objects,

as

(*)

Address for

Correspondence:

Institute for Theoretical

Chemistry, University

of

Vienna,

Wàhringerstra17,

A-1090

Vienna,

Austria

(3)

long

as there is a metric distance measure defined for them. Extensive studies ou

folding

of RNA sequences into

secondary

structures have been based on this fact

[5,

GI.

Trie

landscapes

of a number of well known combinatorial

optimization problems

bave been

studied,

such as trie

Traveling

Salesman Problem

[7],

the

Graph Bipartitioning

Problem

[8],

and the

Graph Matching

Problem

[9].

Detailed information on the distribution of local

optima

and the statistical characteristics of downhill walks have been obtained for the uncorrelated

landscape

of the random energy model

[10-12]. Furthermore,

two

one-parameter

families of

tuuably rugged landscapes

have been studied in detail: the NÉ model aud its variauts

[2, 5, 13, 14]

aud the

p-spiu

mortels

[15, 16].

Here we will be concerned with the eifect of

averaging procedures

related to runuing averages

ou

laudscapes.

After some formol definitions

(Sect. 2)

we discuss

briefly

the Fourier

decompo-

sitiou of

laudscapes (Sect. 3)

which is used in section 4 to derive the basic

properties

of liuear operators

applied

to

landscapes.

In section 5 we

briefly

address the relation of averages over ensembles of

laudscapes

aud averages over

configurations

in

single

instances of

landscapes.

In section 6

particular

operators,

namely weighted

averages over

one-step neighborhoods,

are used

to coustruct a uovel tuuable

family

of

laudscapes.

In section 7 the behavior of

p-spin

aud

landscapes

under the iterated action of these

operators

is

iuvestigated.

2. Some definitions.

Let r be a

graph

with N < cc vertices. A

graph autoniorphism

a is a one-to-one mappmg of r onto itself such that the vertices

a(~)

aud

a(y)

are connected

by

an

edge

if and

only

if

~ and y are conuected

by

an

edge.

Two vertices of r are said to be

equivaleut

if there is an

automorphism

o such that y

=

a(~).

A

graph

is uerte~ transitive if ail vertices are

equivalent.

A vertex transitive

graph

is

always regular,

1-e-, each vertex has the same number of

neighbors.

Trie distance

d(~,y)

of two vertices ~,y E r is defined as the minimum number of

edges

that

separates

them. This distance is a metric. A

graph

is distance transitive if for any two pairs of vertices

(z, y)

and (~1,

u)

with

d(~, y)

= d(~1,

u)

there is an

automorphism

a such that

a(~)

= u

and

o(y)

= u

(see,

e-g.,

[17]).

A

graph

r is defined

by

its

adjacency

matriz

A,

with

Axy

=1 if the vertices ~ and y are connected

by

an

edge,

and

Axy

= 0 otherwise.

In the

following

we will

only

consider

graphs

that are at least vertex transitive.

Following

Bollobàs

[18]

we may then

partition

the vertex set

by

the

following procedure:

Choose an

arbitrary

vertex as reference

point,

which in the

following

will be labeled 0. Then construct

pairwise disjoint

sets of vertices

V(0)

=

(0), V(1), V(2),

..,

V(M)

such that

â~~

:=

£ Axy

is

independent

of y E

V(v). (l)

x~V(p)

In other

words,

this

partitioning

is chosen such that each vertex y m

V(v)

is

adjacent

to

â~~

vertices in

V(p).

For some

exaniples

see

figure

1 in section 3. It is mduced

by

the

symmetry

of r: the vertices in each of the sets

V(~1)

are

permutated by automorphisms

that leave 0

invariant. In the worst case each set contains a

single

vertex. In

general

we will use

greek

letters to index trie sets of such a

partition,

while roman indices are reserved for trie vertices off.

For

highly symmetric graphs

trie

collapsed adjacency

matriz

=

(â~~)

cari be much smaller than trie

adjaceucy

niatrix A. lu case of distance transitive

graphs,

for instance, trie distance classes

(1.e.,

the sets of ail vertices with commou distance from the reference vertex

0)

form a

partition fulfilling equ.(l),

and the

collapsed adjacency

matrix comcides with the intersection

(4)

matrix as defined

by Biggs [19] (see

also [20]

).

We will use d instead of

greek

letters for

mdexing

the distance classes.

The

following property

of

collapsed adjacency

matrices will be used in the

subsequent

sec-

tions.

Lemma 1.

£ (A~)xy

=

(Â~)~~

for ail ~ E

V(p),

and ail

non-negative integers

m, and y~v(~)

hence for each

polynomial

p holds

£~~~~~~lp(A)]xy

=

[p(Â)]~~, i-e-, jJ(A)

=

p(Â).

Proof. We

proceed iuductively:

The assertion is true

by

definitiou for m = 1. For m

= 0 it

holds

trivially.

Now suppose it is true for m 1; theu

£ £ (A~~~)xzAzy

"

&<)~ [ (~[[

~~~ii~i~lll z(~mi)«~Àv<

-

(Â~~«~

~ ~~~~~, ~~v~~~ v

The

geueralizatiou

to

polyuomials

is

straight

forward.

~

Definition 2. A

landscape f

r - lit is a raudom field F ou the set of vertices of r defined

by

the distribution fuuction

P(yi,

y2>

UN)

" Prob

( f(~~) §

j,

§1 § N) (2)

where N is the number of vertices of r.

Let

El-j

denote the mathematical

expectation.

For

example,

the

expected

value of the

product

of the values

f(~k)

and

f(~i)

at the vertices ~k and ~i is

Eifl~k)fl~i)i

=

/vkvidPlvi>v2>. >vN). (3)

Definition 3. A

landscape

is

isotropic

if

Ii) El f(~)]

=

f

for ail

configurations

~ E

r;

(ii)

for any two

pairs

of

configurations (~, y)

and (~1,

u)

such that there is a

graph

automor-

phism

a with

o(~)

= ~land

a(y)

= v halas

£[f(~) f(y)]

=

El f(~1) f(v)].

The second condition nleans that any two

pairs

of

configurations

that are

equivalent

in

config-

uration space have the same

pair-correlation.

The covanauce matrix of F is defined

by

Cxy

=

Eif(~)f(v)i Elf(~)iElf(v)1 14)

By

vertex

transitivity

there is an

automorphism

a such that

a(~)

= 0.

Isotropy

hence

implies C~y

=

Co,~(y)

=

c(p),

where

V(~t)

is the

partition

to which

o(y) belongs.

The variance is given

by

Varjfj

=

Cxx

=

Coo

#

c(0) (5)

The a~ltocorrelation

f~lnction

is defiued as

p(p)

=

c(~1)/c(0).

(5)

3. Fourier series on

landscapes.

Let C be the covanauce matrix of a raudom field F ou some

graph

r. Deuote

by (#~)

a

complete

set of orthonormal

eigenvectors (which

exists

by symmetry

of

C). Although (#~)

can be chosen

real,

we will admit

complex

vectors as well. Then the

landscape f

on r cari be

represented

in mean square sense as

f(~)

=

£ a(v)4y(~) (6)

(The labeling

of the

eigenvectors #y

with vertices is

arbitrary.)

This series

representation

ta known as the Karh~lnen-Loèue

expansion.

Remark. The coefficients

a(y)

are uncorrelated. For limite sets the Karhunen-Loève

expansion

coiucides with the well known

principal compouent aualysis

mtroduced

by Hotelliug [21] (see,

e-g.,

[22]).

For au

arbitrary graph

r with

adjaceucy

matrix A the

graph Laplacian

is defined

by

/h

=

D~~A E,

where D is the

diagonal

matrix of vertex

degrees (see,

e-g.,

[20])

For vertex transitive

graphs

this becomes /h

=

(1/D)A E,

where D is now the constant vertex

degree

of r. Denote

by (Ù~)

a

complete

orthonormal set of

eigenvectors

of /h. A serres

representation

of the form

f(~)

=

£ b(v)Ùy(~) (7)

y~r

is called Fourier expansion of the

landscape

[2].

Definition 4. Let

(G,o)

be a limite group, aud let 4l be a set of

geuerators

of G such that the group

identity

is flot contained in

4l,

and for each ~ E 4l the inverse group element

~~~

is also coutained in 4l.

(4l

is a set of

generators

meaus that each group element z E G can be

represented

as a truite

product

of elemeuts of

4l,

with

multiplication

defiued

by

the group

operatiou o,)

Let

r(G, 4l)

be the

graph

with vertex set G aud au

edge couuectiug

two vertices

~ aud y if and

ouly

if

~y~~

E

4l,

1-e-, two vertices are couuected if and

ouly

there is a't E 4l such that y

= ~/~.

r(G, 4l)

is called a

Cayley graph (see,

e-g., [23]

).

Remark. A

Cayley graph

is vertex transitive.

A commutative group

(G, o)

can be writteu as trie direct

product

of a finite uumber L of

cyclic

groups

Cj

of order

Nj.

A group element cau be

represeuted by

~ =

(~i,

~2,

~L)

with 0 < ~j <

Nj;

theu the group

operatiou

becomes

z=~oy -

zj=~j+yjmodNj j=1,. ,L. (8)

It is easy to check that the functions ep, p E

G,

defiued

by ep(z) IL

= exP

2«1£ (~~ (9)

~=i j

fulfill

ep(~)eq(~)

=

epoq(x).

Furthermore

they

are

orthogonal

with

respect

to the scalar

product

£~~~ eq(~)eq(~)

=

ôpq.

A

simple

cousequence of these facts is trie well knowu

Lemma 5. Let

r(G,4l)

be a

Cayley graph

of the commutative group

G,

aud let

Hi

=

h(j

o

1-1) depeud ouly

ou the "diifereuce" of the group elemeuts1 aud

j.

Theu ep as defiued

(6)

in

equatiou (9)

is au

eigeuvector

of H. In

particular,

the set

(ep)p

E

G)

forms au orthouormal basis of

eigeuvectors

of the

adjaceucy

matrix of

r(G, 4l).

Remark. Lovàsz

[24]

used this result to show that that the

eigenvalues

of the

adjacency

matrix of a

Cayley graph r(G, 4l)

with commutative group G are

giveu by

Àp

=

£~~~ ep(k).

Weiuberger [14]

showed

usiug

lemma 4 that Karhuueu-Loève expansion aud Fourier expan- sion coiucide for

isotropic landscapes

ou

Cayley graphs arisiug

from comm~ltatiue groups.

(He

used a

slightly

more restrictive defiuitiou of

isotropy, requinug

the same covanauce for ail

pairs

of

configurations

with the same

distance.)

We will show that au

aualogous

result holds for distance transitive

configuration

space.

01o 0110

Â=

3 01

Â-

2101

0 2 2

~0121

10 01

Fig.

1. Petersen's

graph

is distance transitive, but it is not a

Cayley graph (see,

e-g.,

[23]).

It has three classes of vertices: the refereuce vertex

(top),

its

ueighbors,

aud ail other vertices. Its collapsed adjacency matrix is given below the

graph.

The

trigonal

prism

graph

T is the

Cayley graph

of C2 x C3 with set of generators 4l

=

((~,e), (e,~),(e,~~~)),

with C2

#

le, ~)

and

C3

=

(e,~,~~~)

and e

denoting

the group

identity.

An

edge connecting

the two

triangles

is

clearly

not

equivalent

to an

edge

within a

triangle,

hence T is not distance transitive. There

are 4 symmetry classes of vertices: the reference vertex

(top),

its two

neighbors

m the upper

triangle,

its

neighbor

in the lower

triangle,

and the remammg two vertices, which have distance 2 from the reference.

Remark. There are

graphs

that are distance transitive but flot

Cayley graphs,

aud vice versa, there are

Cayley graphs

of commutative groups that are not distance transitive. The two most

simple examples,

the Petersen

graph

aud the

trigoual

pnsm are shown m

figure

1.

Lemma 6. Let

a(~

= if

d(1, j)

=

t,

and 0

otherwise,

for

ail1, j

E r. Denote the

corresponding

matrix

by A(~)

Let r be distance transitive. Then

A(~)

is a linear combinatiou of the first t powers of A.

Proof. The assertion is

trivially

true for t

= 0 aud t

=

1,

as

A(°)

= E aud

A(~)

= A.

Now calculate A

A(~~~)

The

triangle mequahty

assures that

only

pairs of indices

(1, j)

with

(7)

distances

t,

t

-1,

aud t 2 can have non-zero entries. Distance

transitivity

of the

graph guarantees

that ail entries

belonging

to the same distance Mass are

equal,

and heuce

A

A(£-1)

= c ~

A(£)

+ c~

A(£-1)

~

~~

~(t-2) (~~)

where c+ > 0 as

long

as t ares flot exceed the diameter of r.

~

Theorem 7. Let

f

be au

isotropic laudscape

ou a

Cayley graph r(G, 4l)

with a commutative group

G,

or on a distance transitive

graph

r. Then trie

adjacency

matrix A of r and the

covariance matrix C of the

landscape f

commute, AC

= CA.

Proof.

(a)

For

Cayley graphs

of commutative groups the

proposition

is a direct cousequeuce of leninia

5,

see

[14].

(b)

If

f

is

isotropic

aud r is distance

transitive,

theu

Cxy depeuds ouly

ou the distance of the vertices

d(~, y).

Heuce C cau be written as a hnear conibiuatiou of the matrices

A(~),

and

by

lemnia

6,

it cari be writteu as a

polyuoniial

in terms of

A,

C

=

p(A).

In both cases C aud A commute because

they

bave trie same

eigeuvectors. ~

4. Linear

operators

on

landscapes.

Consider

landscape

g obtained

by

some

weighted averaging procedure

from a

landscape f,1

e.,

g(~)

=

£~~r flxy f(y).

In the

following

we will use the obvious matrix notations g =

Qf.

Lemma 8. Let C be the covariance matrix of the

landscape f,

and let fl be a hnear

symmetric operator.

Then the covariance matrix of g = fl

f

is given

by

C~

= flcfl

(11)

Proof. We assume without

loosing generality

that

El fp]

= 0 for ail p E r.

c)~

=

Ej(n1)~(n1)~j

= E

£ n~~ i~ £ n~~ il

p q

=

L "xpElfpfqlflqy

=

(flcfl)xy

p,q

Corollary

9. Let

f

be

isotropic

and assume that the

configuration

space r is distance transi- tive or the

Cayley graph

of a commutative group.

Suppose

fl is a

polynoniial

of the

adjacency

matrix. Then we have

C~

=

(fl~)C.

Proof. From theorem 7 we know that C and the

adjacency

niatrix A of r comniute. Thus

C comniutes with ail powers of A and

consequeutly

with ail

polynomials

of A. The

corollary

follows now

imniediately

from lemma 8.

~

(8)

Theorem la. Let

f

be

isotropic

with autocorrelatiou functiou p aud let r be distance tran- sitive

(or

a

Cayley graph

of a comniutative

group).

Furthermore let fl be some

polynomial

in

A,

Q

=

P(A).

Then g

= Q

f

is agam

isotropic

on r and

p~

= c

ù~p (12)

where the constant c =

1/(ù~p)(0)

is chosen such that

p~(0)

=1.

Proof. From

corollary

9 we kuow

Cg

=

(fl~C)ox.

With

Q

"

fl~

we may wnte this as

~~ ~Î0 ~ ~ QxyCy0 ~ ÔÀpCp0

P y~V(p) P

for ail ~ E

V(À),

as a cousequence of lemma 1.

Dividing by

the variauce

completes

the

proof.~

There exists a set of

nght eigenvectors

of that are

orthogonal

with

respect

to the

weight

function

w(~t)

=

)V(~t)) (for

details see

[25]).

We may therefore

expand

the autocorrelation function with

respect

to this

basis, p(~t)

=

£~ a~p~(~t).

Corollary

ii. Uuder the

assumptions

of theorem 10 we have

p~(~t)

= c

£ a~P~(À~)p~(~t) (13)

where À~ denotes the

eigenvalue

of

belonging

to p~. In

particular,

if p

= pj for some

j,

then

we have

p~

= p.

5.

Empirical landscapes

and ensembles of

Iandscapes.

The

empirical

vanance

a~

of an

isotropic landscape f,

1-e-, of a

particular

instance of the random field F is measured

by picking

a

sample

of values at

configurations randomly

distributed on r.

Denote

by p(~t)

=

)V(~t)) IN

the

probability

that two

randomly

chosen vertices ~ and y

belong

to

partition

~t, 1-e-, that there is an

automorphism

a with

a(~)

= 0 and

a(y)

E

V(~t).

In case

of a distance transitive

configuration

space we can

mterpret p(d)

as the

probability

to

pick

at random two

configurations

with distance d. The

empirical

vanance cari be written as [5]

a~

=

jr (f(~) f(Y))~ (14)

~,y

and its

expected

value cari be calculated:

Eia21 =j ~l (2Eii(~)21 2Eii(~)i(y)1)

=varifi

LP(JL)jv)~~j L Eif(°)f(Y)i

=

varifi Ii P(~1)P(~1)j

~~~~

~ y~v~~> ~

=varlfl(i à)

(9)

Hence the

expected enipirical

variance of a

landscape f

is related to the ensemble

variance,

i e., the vanance of the randoni field F at a

particular vertex, by

E[a~]

=

Var[f](1 fl) (16)

The

parameter

p nieasures the average correlation of the

landscape.

As shown m

[25]

fl is the contribution to p

corresponding

to the

largest eigenvalue

of

A,

Ào " D.

The ensenible-autocorrelation function p and the

expectation

of the

enipincal

correlation function fi are hence related

by

P(11) =

~l~ j~ (17)

Equation (17) iniplies

that the

oninipresent 'back-ground'

correlation p has to be subtracted.

We reniark that for

practical

purposes it is the

empincally

observable correlation function fi(~1) that niatters. The autocorrelation function is said to be

self-averaging

if fi

= p

(see,

e-g.,

[26]).

Corollary

12. Under the

assumptions

of theorem 10 it follows from p

= fi that

p~

=

fi~,

1-e-,

if the autocorrelation function of

f

is

self-averaging,

then the autocorrelation function of the

landscape

Q

f

is also

self-averaging.

6. Iterated

smoothing

maps A tunable

family.

By N(z)

we will denote the set of

neighbors

of ~

plus

~

itself,

i-e-, ail vertices y E r with

d(~,y)

< 1. We derme the

smoothing

operator it

by

it =

~

(A

+

E) (18)

The snioothed

landscape

is then defined

by

g =

itf.

Let r be distance transitive. Then we obtain as an immediate consequence of theoreni 10

f a~(à)p(à+k) p~(à)

=

k=-j (ig)

La~(o)p(k)

~mo

where ah

(d)

=

(Î~)d,d+k.

The coefficients have a

siniple

combiuatorial

iuterpretatiou:

ah

(d)

is the uumber of pairs (~1,

v)

with u E

N(~),

v E

N(y)

with d(~1,

v)

= d + k where

d(~,y)

= d is

fixed.

For

particular graphs

r it is

fairly

easy to calculate the coefficients ah

(d) explicitly.

Cousider

a

geueral hypercube

of diameter n over au

alphabet

of size ~. The

configuration

space has N

= ~" vertices.

By

distance

trausitivity

we may choose

~

=(000...0000000....0000)

y

=(111.. 1111000....0000)

~~~~

with

d(~, y)

= d. We

partition

the set of pairs

lu, v)

with ~1 E

N(~),

v E

N(y)

iuto urne subsets

depending

on whether u

= ~, u is

neighbor

of ~ with the diiference

occurring

m one of the first d

positions,

or u is a

neighbor

with the diiference occurnng in the latter n d

positions,

and

(10)

the same for v. If u

= ~ aud v

= y then

d(u,v)

= d. If u diifers form ~ in oue of the latter

n d

positions

and v

= y then d(~1,

v)

= d

+1;

there are

(n d)(~c -1)

such

pairs.

If ~t differs from ~ in the first

section, however,

there are d(~c

2)

pairs with distance d aud d

pairs

with distance d

1,

the latter

ansiug

from

mutatiug

a 0 to a 1. The combinatious of mutations m

both z and y cau be treated

similarly. Collecting

ail terms

fiually yields

a~~(à)

=

à(à i)

a-i(d)

= 2d + (~c

2)(2d 1)d

(d)

= 1 + (lG

1)(n

+

2d(n d))

+ (lG

2)(lGd

+

d(d

1)(lG

2)) (21)

a+i

(d)

=

2(~ l)(n d)

+

(~ 2)(~ l)(n d)(2d

+

1) a+2(d)

= (lG

1)~(n d)(n

d

1)

Let

fo

be the random euergy mortel

[27, 28].

The autocorrelation function is

p(d)

=

ôo,à

and

fi(0)

=

1, fi(d)

=

-1/(N -1)

for d >

0, respectively,

where N denotes the number of

points

m the

configuration

space. Define

fr

= it

fr-i

Let fi(~) deuote the autocorrelation fuuctiou of

fr.

We will refer to this

family

of

laudscapes

as iterated

smoothing landscapes,

ISLS.

Numerical evaluatiou of

equation (19)

with the initial condition fi ou Boolean

Hypercubes

and their ~c-letter

generalizatious

shows that

(fr)

iudeed forms a

family

of

tuuably rugged landscapes (Fig. 2).

In

particular,

we have

Theorem 13. Let r be a

generalized hypercube

over an

alphabet

with ~c > 2 letters. Then

lim

fi(~'(d)

= 1-

Îd (22)

Ou a Boolean

Hypercube

(~c

=

2)

we have

lim

fi(~)(d)

=

~(l ~~)

+

(-1)~ (23)

Proof. The

eigenvalues

of the

adjaceucy

matrix of the

geueral hypercubes

are

[29,30]

ÀJ "

n(~t 1)

~cj

(24)

aud heuce the

eigeuvalues

of the

smoothiug operator

it are 1

j

~ for

j

=

0,

,

n(~c -1)

+ n.

The

largest eigeuvalue

of

it~, apart

from

1,

which is obtaiued from

j

=

0,

is fouud be

settiug j

= 1 for ~c > 2. In the case ~c

= 2 the second

largest eigenvalue

of

it~

is

degenerate,

the

corresponding eigenspace

is

spanned by

pi and p~. As an immediate cousequence of

equation (28)

in

[25],

the autocorrelation function of the random energy mortel con be written as

>(°) (à)

= c

f(~ i)£ (])

p~(à) (25)

tmi

By corollary ii,

the iterates of fi will converge to a vector tu the eigenspace of the

largest eigenvalue

of

Î~

that is not

orthogonal

to the initial condition. The

eigeuspace correspoudiug

to Ào has beeu omitted

by

construction of

fi. By equatiou (25) p(°),

is flot

orthogonal

to auy of the

eigenvectors

pi

through

p~. For ~ > 2

hence,

p converges to pi while for ~c

= 2 we obtaiu

a mixture of pi aud p~. The ratio of their contributions does flot

change

uuder trie action of

flÎ as a consequence of

coroiiary

II.

.

(11)

o

0.8

îl.

~ ~,

>i ~

O.G ~ ',

i1,

"~

'i

~

'~

0,4 ~ ',

', i '~

i

1,

"~,

°.~

",, ',,~ ",_~

~

oo

Î""-=.-m-ÎÎÎ-=~Î~

~ --_

0.2 '~~~"--

0.4

0.6

0.8

0

0 10 20 30 40 50 60 70 80 90 100

Hamming

Distance d

a)

0

;,

°'8 ' ',

' ',

' ',

0.6 1 ~,

[

i '~

', ' '~~

°'4

', "~

' , ',

' , ',

~ '

0'2

' '~ "~

" ~~ '~,

~~ ~,

~o ',_ ~-__ ~

6 O.O '~~'"'~~~~~~

~~ '- "~-,

-0.2

-0.4

0.6

o.8

1.o

0 10 20 30 40 50 60 70 80 90 100

Hamming

Distance d

b)

Fig.

2. Autocorrelation function of ISLS on

generahzed hypercubes

with

a)

<

= 2, and

b)

<

= 4

for with n

= 100. The number of

smoothings

are

a)

r = 25

(dotted),

r

= 50

(short dashed),

r = 75

(long dashed),

r = 100

(dot,dashed),

r

= 200

(sohd),

and

b)

r = 50

(dotted),

r = 100

(short dashed),

r = 150

(long dashed),

r

= 200

(dot,dashed)~

and r

= 400

(sohd), respectively.

(12)

We remark that trie contribution of p~ decreases with the size of the

configuration

space.

The

hmiting

autocorrelation function

(22) corresponds

to both the trivial

spin glass

mortel with Hamiltonian

7i(a)

=

£~ Ajaj,

and to the Nk,mortel with k

= 0. We also note that for

r

In

< 1

the

landscapes

are

essentially uncorrelated,

1-e-, the correlation

length

is

o(n).

Numerical estimates show that the nearest

neighbor

correlation fi(~)

(1) depends only

on the ration r

In

for

large

n

(See Fig. 3).

As a consequence of theorem

13,

we find that

limr-m pli

= 1

~ ~

n ~c 1

for ~c > 2. For ~c

= 2 we find

lim

p(~)(d)

= lim

p(~)(d 1)

= 1-

~~

for even d

(26)

r-ce r-ce +1

The data in

figure

3 indicate that this limit is

already

obtained for

r/n

> 2 to a

good approximation.

7.

Averaging operators

on

spin glass

and Nk

landscapes.

We first consider the p-spm

mortels,

1-e-, the

long

range

spin glasses

with Hamiltonian

7ip(a)

=

£

Aj~j~__,j~ aj~aj~ aj~

(27)

31<32<...<3P

with a~ E

(-1, +1)

and

coupling

constants A~j___k drawn from some common distribution. The

ruggedness

of the

landscapes

increases as the order p of the

spin-interactions

increases. We have then

Theorem 14. Let fl be any

polynomial

of the

adjacency

matrix of the Boolean

Hypercube.

Then the

landscapes 7ip

and

Q7ip

have the same autocorrelation functions.

Proof. As shown in

[16]

the autocorrelation functions of the

p-spin

mortels are

PV

id)

=

(il

~

(ii)~ Ill Ii Il (28)

These are

smtably

normalized Krawtchouk

polynomials,

which are known to be the

eigenvectors

of the

collapsed adjacency

matrix of the Boolean

Hypercube (see,

e-g.,

[25, 29]). Application

of

Corollary

ii

completes

the

proof. ~

In Kauifman's Nk-mortel the fitness of a

binary string

~ is defined as the average over

"Site fitnesses"

f~,

which

depend

on ~~ and k additional

positions ii through ik.

The site fitnesses f~ =

f~(~~,

~~~,

..,

~~~)

are assumed to be uncorrelated random variables drawn from

a common distribution

iii.

The

ruggedness

of the

landscape

mcreases as the number k of

positions contributing

to a

surgie

site fitness mcreases. Diiferent variants of these

landscapes

have been

investigated, depending

on which

positions

contribute to a given site fitness

(see,

e-g.,

[31]).

In contrast to the p-spm

mortels,

the

landscapes

of NÉ

type

are aifected

by

the

smoothing

operator

it.

Theorem 15. Let

f

be an

arbitrary

landscape.

Then the autocorrelation function of the

landscapes it~f

converges to

p(d)

=

((1- 2d/n)

+

(1- ()(-1)~

with o <

(

< 1 as r tends towards

infinity.

(13)

o

0.8

0.6

0.4

n=25 n=50 n=100 n=200 0.2

ù-ù

ù-ù 0.5 1-Ù 5 2.0

r/n

a)

i o

o.8

0.6

0.4

n=25 n=50 n=100 n=200

o-ù

o-o 0.5 1-o 1.5 2.0

r/n

b)

Fig.

3. Nearest

neighbor

correlation for ISLS on

generalized hypercubes

as a function of the number of iterated

smoothings. a)

~

= 2,

b)

«

= 4.

(14)

Proof. It is necessary and sufficient to show that the autocorrelation function p of

f

and pi

are not

orthogonal,

1-e-, that

£ p(d)pi (d) (")

=

( #

0

(29)

"

Î~

d

Note that pi

(d)

= -pi

(n d). Equation (29)

cari hence be rewritten as

f

=

£ (()

pi

(d) lp(d) p(n

d)1

(30)

d<n/2

If

p(d)

>

p(n d)

for ail d <

n/2,

and if the

inequality

holds

strictly

for at least one d <

n/2,

we have

(

> 0. It follows

immediately

from the very definition of the NÉ mortel

(irrespective

of how the

neighboring

sites are

chosen)

that p is

monotonically decreasing

and non-constant-

'

Remark. The vanants of the NÉ mortel do trot behave

identically

under the action of it.

Whenever the autocorrelation function is a

polynomial

of

degree

less than n, then we have

(

= 1 in theorem 15. For exact

expressions

of the autocorrelation functions of varions NÉ

niodels see

[2, 5]. Recently

Nk-mortels on sequences of

arbitrary alphabets

have been considered

[2].

Theorem 15 holds with

(

= for ail

alphabets

with three or more letters.

As a final

exaniple

consider an

isotropic landscape

on the circle of

length

n > 4

(Millonas,

pers.

comniunication).

The

configuration

space hence is the

Cayley graph

of the

cyclic

group

C~

=

l't~)0

< k <

n)

with set of

generators

4l =

16't,'t~~).

Its vertex

degree

is D

= 2. The

eigenvectors

of the

adjacency

niatrix are eh

(~)

=

exp(2grk~ In),

as an immediate consequence of lemma 5. The

eigenvectors

of the

collapsed adjacency

matrix are

pk(d)

=

cos(2grkd/n)

for 0 < k <

n/2,

associated with the

eigenvalues

Àk #

2cos(2grk/n).

The

eigenvalue

of

Î~

corresponding

to

À~/2

is niuch smaller that the one

corresponding

to

Ài

for n >

4,

hence p converges to pi under the iterated action of it whenever p is not

orthogonal

to pi

Suppose f

has a monotone

non-increasing

autocorrelation function p. As a

particular

ex-

ample

consider the random energy niodel on the circle.

By

the same reasoning as in the

proof

of theorem

15,

p is not

orthogonal

to pi, and hence iterated

smoothing eventually

leads to a

landscape

with autocorrelation function pi Note that the smoothed

landscapes

on the Boolean

Hypercube (and

on related

graphs) eventually

have autocorrelation functions of the form

p(d)

= 1-

)d

+

O(d~)

while for the circle

graphs

we find

p(d)

= 1

ad~

+

O(d3)

The

landscape

on the circle hence becomes

exceptionally

smooth [16] as a consequence of the

special geometry

of the

configuration

space.

This

example

also shows that the

degeneracy

of it on the Boolean

Hypercube

is trot a

necessary consequence of the fact that the

configuration

space is

bipartite,

1-e-, that there is

an

eigenvalue

-D of the

adjacency matrix,

since trie circle

graphs

are

bipartite

for even n, and it is not

degenerate

there.

8 Discussion.

A

general theory

of the action of linear

(averagmg)

operators on

isotropic landscapes

is devel-

oped

which allows to

predict

their eifect on the correlation structure of the

landscape.

We have shown that there are

particular Iandscapes (characterized by

autocorrelation functions that are

eigenvectors

of the

collapsed adjacency

matrix of the

underlying configuration space)

which are

(15)

not aifected

by "smoothing" operators.

The

p-spin models,

and in

particular

the

Sherrington- Kirkpatrick [32],

model are members of this dass on the Boolean

hypercube.

Another

example

is the

landscape

of the

graph-bipartitioning problem.

A new

family

of

tunably rugged landscapes

is

established,

which arises

by

iterated

smoothing

of the random

landscape.

These "iterated

smoothing landscapes" (ISLS)

have an

advantage

over the other two known

tunably rugged

families: The

p-spin

mortels are defined

only

for Boolean

hypercubes,

and I<auifman's Nk

family

lives on

general

sequence spaces, while ISLS can be constructed on

arbitrary configuration

spaces. Their

simple

structure

suggests

furthermore

that

analytical

solutions of these mortels

might

be feasible. Iterated

smoothing

of the random energy mortel

eventually

leads to a

landscape

with

linearly decreasing

autocorrelation function

on

general

sequence spaces with

alphabets

of size ~c >

2,

while on Boolean

hypercubes,

~c =

2,

one obtains a

superposition

of a linear function and an oszillation of the form

(-1)d.

On a circle

graph

trie same

procedure

leads to an autocorrelation function of trie form 1-

ad~

+.

This shows

again

that trie structure of a

landscape

cannot be

separated

from the

geometry

of the

underlying configuration

space.

Evolutionary adaptation

cari be descnbed in the

simplest

case as the motion of a

population

in

configuration

space. Almost ail

analytical work, however,

has been doue with

adaptive

walks

(or

related

processes)

which

neglect

the

population aspect.

For a recent review see, e-g-,

[31].

TO a first

approximation

a

population

cari be

replaced by

its "center of mass"

(consensus configuration),

which moves in the

landscape "experienced" by

the entire

population: again

to a first

approximation,

this

landscape

is a

weighted

average of the fitnesses of

configurations

centered around the consensus.

Consequently,

the width of the

population

should

strongly

influence the

dynamics

of evolution: broader

populations

"feel" in

general

smoother

landscapes,

and hence

optimization

should be easier.

Usually

the broadness of the

population

is controlled

via the mutation rate

[3,33]

which at the same time controls the

exploration

rate. Further

investigations

are necessary in order to separate the eifects of enhanced

exploration

and

implicit smoothing

of the

underlying landscape

caused

by

an increased mutation rate.

It should be

kept

in mina that the results

presented

here

depend

on the

assumption

of

isotropic landscapes.

While this requirement is often fulfilled for

simple

statistical

mortels,

it is

unlikely

to holà for real

biophysical

or

physical systems.

It bas been

shown,

for

instance,

that trie

landscapes

of RNA free

energies

are not

isotropic

for natural sequences

[34].

Much work on random

landscapes

such as Derrida's REM bas been motivated with the

notion that a

sufficiently

"coarse

grained"

version of any

rugged landscape

would look random.

More

precisely,

this is to say that it should be

possible

to construct renormalization groups

having

random

landscapes

as attractive fixed

points.

While the

averaging procedures

discussed

m trie this contribution are trot

exactly

trie kind of coarse

graining

one would have in mmd for such a

construction,

the methods

developed

here seem to be useful for such a task.

The

theory

outlined in this contribution is trot

applicable

for the

traveling

salesman

problem

since the raturai

configuration

spaces of the TSP are

Cayley graphs

of the

symmetric

group with the set of ail

transposition,

or the set of ail inversions

(20pt-moves),

as

generators

[7].

It is easy to check that these

graphs

are trot distance

transitive,

and of course the

symmetric

group is non-commutative-

Acknowledgements.

thank the Max Planck

Society, Germany,

for

supporting

my stay at the Santa Fe Institute.

Stimulating

discussions with P-W-

Anderson,

W.

Fontana,

S-A-

Kauifman,

M.

Millonas,

R.

Palmer,

and M. Virasoro are

gratefully acknowledged.

(16)

References

iii

Kaulfman S-A- and Levin. S., Towards a General

Theory

of

Adaptive

Walks on

Rugged

Land-

scapes, J. Theor. Biot, 128

(1987)

11-45.

[2]

Weinberger

E-D-, Local

Properties

of the Nk,Model, a

Tunably Rugged Landscape, Phys.

Reu.

A 44

(1991a)

6399-6813.

[3]

Eigen M.,

Mccaskill J-S- and Schuster

P.,

The Molecular

Quasispecies,

Adu.

Phys.

Chem. 75

(1989)

149-263.

[4]

Weinberger

E-D. Correlated and Uncorrelated Fitness

Landscapes

and How to Tell the

Dilference,

Biot.

Cybern.

63

(1990)

325,336.

[Si Fontana W, Stadler P-F-,

Bornberg,Bauer E-G-,

Griesmacher T., Hofacker I.L., Tacker

M.,

Tara-

zona

P., Weinberger

E-D- and Schuster

P.,

RNA

Folding

and

Combinatory Maps, Phys.

Reu.

E 47

(1993)

2083-2099.

[6] Fontana W.,

Konings

D. A.M-, Stadler

P-F-,

Schuster P., Statistics of RNA

Secondary Structures, Biopolymers

33

(1993)

1389,1404.

[7] Stadler P-F- and Schnabl W., The

Landscape

of the

Traveling

Salesman

Problem, Phys.

Lent. A 161

(1992)

337-344.

[8] Stadler P-F- and

Happel R.,

Correlation Structure of the

Graph Bipartitioning Problem,

J.

Phys.

A: Math. Gen. 25

(1992)

3103-3110.

[9] Stadler P-F-, Correlation in

Landscapes

of Combinatorial

Optimization Problems, Europhys.

Lent. 20

(1992)

479-482.

[loi

Macken C.A.,

Hagan

P-S- and Perelson

A-S-, Evolutionary

Walks on

Rugged Landscapes,

SIAM

J. Appt. Math. 51

(1991)

799-827.

[iii Flyvbjerg

H. and

Lautrup B.,

Evolution in a

rugged

fitness

landscape, Phys.

Reu.

Af15j

46

(1992)

6714-6723.

[12] Bak

P., Flyvbjerg

H, and

Lautrup B.,

Coevolution m a

rugged

fitness

landscape, Phys.

Reu.

Af15j

46

(1992)

6724-6730.

[13] Kaulfman S-A-,

Weinberger

E-D- and Perelson

S-A-,

Maturation of the Immune

Response

Via

Adaptive

Walks on

Aflinity Landscapes,

Theoretical

Immunology,

Part I, Alan Perelson Ed.

(Addison-Wesley,

Redwood City~

1983).

[14]

Weinberger

E-D-, Fourier and

Taylor

Series on

Landscapes,

Biol.

Cybern.

65

(1991b)

321-330.

[15] Amitrano C.~ Peliti L, and Saber M.,

Population Dynamics

in

Spin

Glass Model of Chemical

Evolution,

J. Mol. Euol. 29

(1989)

513-525.

[16]

Weinberger

E-D- and Stadler P-F-,

Why

Some Fitness

Landscapes

are Fractal, J. Theor. Biot.

163

(1993)

255-275.

[17]

Buckley

F. and

Harary

F., Distance in

Graphs (Addison-Wesley, Reading,

MA,

1990).

[18] Bollobàs

B., Graph Theory

An

Introductory

Course

(Springer-Verlag,

New York~

1979).

[19]

Biggs

N.,

Algebraic Graph Theory (Cambridge University

Press~

Cambridge~ 1974).

[20] Cvetkovié D.M.~ Doob M. and Sachs

H., Spectra

of

Graphs Theory

and

Applications (Academic Press~

New York~

1980).

[21]

Hotelling

H.,

Analysis

of a

complex

of statistical variables into

principal

components~ J. Educ.

Psych.

24

(1933)

417-441 and 498-520

[22] Rao C.R., Linear Statistical Interference and Its

Applications.,

2nd Ed.

(Wiley,

New York,

1973).

[23]

Biggs

N-L- and White A.T., Permutation

Groups

and Combinatorial Structures

(Cambridge

Univ.

Press, Cambridge, 1979).

[24] Lovàsz L.,

Spectra

of

graphs

with transitive groups, Pemodica Math.

Hung.

6

(1975)

191-195.

[25] Stadler P-F-, Random Walks and

Orthogonal

Functions Associated with

Highly Symmetric Graphs~

prepnnt 93-06-31

(1993),

Disc. Math.

(1994)

m press.

[26] Binder K. and Young A.P.

Spin

Glasses:

Experimental

Facts~ Theoretical

Concepts~

and Open

Questions.

Reu. Mod.

Phys.

58

(1986)

801-976.

[27] Derrida B. Random

Energy

Model: An exactly solvable model of disordered systems~

Phys.

Reu.

B 24

(1981)

2613-2626.

(17)

[28] Gardner E. and Derrida B.~ The

probability

distribution of the partition function of the random energy model, J. Phys. A 22

(1989)

1975-1982.

[29] Dunkl C.F.,

Orthogonal

Functions on Some Permutation

Groups.

D.K.

Ray-Chaudhun Ed., Proceeding

of

SymposJs

in Pure Mathematic 34

(Amencan

Mathematical

Society,

New York~

1979).

[30]

Rumschitzky D., Spectral Properties

of

Eigen's

evolution

matrices,

J. Math. Biot. 24

(1987)

667-680.

[31] Kaulfman

S-A-,

The

Origins

of Order

(Oxford University

Press, New

York, Oxford, 1993).

[32]

Sherrington

D, and S.

Kirkpatrick,

Solvable Model of a

Spin Glass, Phys.

Reu. Lent. 35

(1975)

402-416.

[33]

Eigen

M.,

Selforganization

of matter and the evolution of

biological

macromolecules, Naturwis-

senscha/ten

58

(1971)

465-523.

[34] Stadler P-F- and W. Grüner,

Anisotropy

m Fitness

Landscapes,

J. Theor. Biot. 165

(1993)

373-388.

Références

Documents relatifs

The amount and shape of the background at short decay time are estimated by comparing the decay-time distributions of real data and of simulated events.. This determination is

O papel da cidade da agricultura familiar do semiárido: o exemplo do submédio São Francisco Ludivine Eloy, Pedro Castelo Branco Silveira, Eldonice Barros, Geneviève Cortes,

Pour r´ epondre ` a ces objectifs, nous avons men´ e une ´ etude exp´ erimentale dans le p´ erim` etre irrigu´ e de Kalaˆ at Landelous dans la basse vall´ ee de la Mejerda, pour

Assessing NR governance from the stakeholders’ perspective and based on the ‘Good Governance Principles’. • Case study (at the

( 34 ) ىذلدا مهف دقل ةيرشبلا سانجلأا تُب عارلإلا خيرات وأ تاقبطلا تُب عارلإلا ونأ ىلع خيراتلا ليوأتك ،أطخ تلايوأتلا هذى الؼراتلا ،ةيجولويديلإا

[r]

More specifically, we aimed to (i) explore how the analysis of landscape metrics by principal component analysis emphasizes the main urban features, (ii) map the corresponding

In this work, the SRB consortiums isolated by inoculating saline Postgate's medium C with injected water obtained from the In Amenas oil field, situated in the South