Attraction domains in neural networks

(1)

HAL Id: jpa-00246757

https://hal.archives-ouvertes.fr/jpa-00246757

Submitted on 1 Jan 1993

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Attraction domains in neural networks

L. Viana, A. Coolen

To cite this version:

L. Viana, A. Coolen. Attraction domains in neural networks. Journal de Physique I, EDP Sciences,

1993, 3 (3), pp.777-786. �10.1051/jp1:1993162�. �jpa-00246757�

(2)

J.

Phys.

I France 3 (1993) 777-786 _MARCH 1993, PAGE 777

Classification

Physics

Abstracts

87.30 75.10H 64.601

Attraction domains in neural networks

L. Viana and A. C. C. Coolen

Lab. de Ensenada, Instituto de Fisica, UNAM, A. Postal 2681, 22800 Ensenada, B.C. M6xico

Department

of Theoretical

Physics, University

of Oxford, I Keble Road, Oxford 0Xl 3NP, U.K.

(Received 28

April

1992,

accepted

in revised

form

8 October 1992)

Abstract. We performed a systematic study ^of ^the sizes of the basins of attraction in _a Hebbian- type neural network in which small numbers of pattems were stored with non-uniform

embedding

strengths _w~. This _was done by iterating

numerically

the flux equations for the

« overlaps _» between the stored pattems ^and the

dynamical

state of the system ^for zero noise level T. We found that the existence of _attractors related to mixtures of three _or _more pure memories

depends

_on the

specific

values of the

embedding strengths

involved. With the _same method _we also obtained the domain sizes for the standard

Hopfield

model for pm18.

1. Introduction.

In the last few years, neural networks

(NN)

have attracted a fair _amount of attention due to their

properties

_as content-addressable memories

[I].

In these

systems

_{the p} ^stored pattems are

stable _states which _act _as _attractors of the

dynamics

of _an

N-spin

_system, thus

allowing

the

recoverability

of information from

partial

_or

noisy

data. Amit et al.

[2]

demonstrated that in the

thermodynamical

limit it is indeed

possible

to

study

the space of

configurations

of NN in _a

systematic

_way,

by using

statistical mechanics tools. In their studies for the Hebb

model, they predict

the existence of _a

huge

number of

spurious

stable states in addition to those

corresponding

to the stored pattems

(to

be called pure

memories)

_; these

spurious

states deteriorate the memory function for

they

also _act _as attractors of the

dynamics

of the system.

Their

approach

has been used to

study

other

Hopfield-like

models

[3, 4]

_; however,

although

it

gives

us essential information about the existence and

stability

of attractive states, it does not

allow _us to evaluate their

importance,

in the _sense that it does not deal with the size of their basins of attraction. If _we want to assess the

importance

of pure and

spurious

fixed

point

attractors, then it becomes necessary to go one

step

^further ând êvaluate ^the ^size ôf ^the attraction

domains,

_as this

quantity

is

directly

related to the

content-addressability

of the stored

information.

Several

analytical

and numerical

approaches

have been

followed,

at various levels of

description,

in order _to evaluate the basins of attraction for _a number of models

[5-7]. Among

such

work,

_we _can mention that of Forrest

[6],

who calculated

numerically

the _mean fraction

f(qo)

of states which _are recalled with less than N/16 _errors for various initial

overlaps

qo and p ₌

aN,

and the work of Homer _et al.

[7]

who made _a

non-equilibrium

treatment for _a

(3)

Hopfield-type

NN with different levels of

activity

that allowed them to

study dynamical properties. They

evaluated the critical value q~ of the initial

(pure) overlap,

needed _to

trigger

retrieval of this

particular

_pattem and

subsequently

defined the

quantity

R

= I q~, ^to be the

corresponding

size of the basin of attraction

(again

for

a =

p/N finite).

In this work _we will define the _« size of basin of attraction _» _as the fraction

f~(p )

of all microstates which evolve towards the

p-th

stored

pattem,

^and will calculate this

quantity

^for

finite _p in the

thermodynamical

limit N

- oJ. This calculation _can be done _at either _a

macroscopic

_or

microscopic

level _: at a

microscopic level,

it consists in

considering

_a NN

composed by

^N elements where p pattems ^have ^been

stored,

and then

carrying

out the actual simulations of the

(Monte Carlo) dynamics starting

from random initial states

[8].

On the other

hand,

the

macroscopic

level _concems the

overlaps

vector q, whose

components

_q~, ^constitute

a

macroscopic

_measure of the resemblance between the present

microscopic

state

(S,

of the system ând êach ôf ^stored pattems

f,~.

The

procedure

is based _on the iteration of the flux

equations

for these

overlaps starting

from random Gaussian initial states. This level of treatment is

specially

convenient _as _not all the

microscopical

details of the system, at the

spin level,

_are relevant.

However,

it _can

only

be

implemented

in the _case _a

= 0

~p-finite

as

N -

oJ).

By using

this last

method,

Coolen calculated the cumulative size of the attraction domains of the p stored

pattems

defined _as

f~

_m 2

z f~(p ),

for the Hebb model

[9] (the

factor 2 _comes

a

from the

symmetry

^of

(~

_~

(~).

He

performed

this calculation

analytically

for p « 3 and

numerically

_{for p} _m 4. He found _an

interesting

result _: after _an initial

decrease,

the cumulative domain size of the stored pattems

begins

_to increase for pm 6 _; that

is,

the effect of mixed states is reduced

by increasing

the number of pattems ^stored.

The _same kind of work _was also

performed

for _a modified Hebb model in which each pattem is stored with different

weight [10]

_; the

importance

of this last model is that it is

possible

_to

increase _or decrease the domain size of individual stored

pattems by increasing

_or

decreasing

the

weight

associated to

them,

_so various

degrees

of

training

_can be accounted for

[3].

For this model and p =

3,

the sizes of the basins of attraction _were found

provided

the restriction

w~

<

z

_wA> for all _p,

applied however,

there _was _no _answer for the _cases

violating

this

<A « aJ

restriction.

Therefore,

in this paper we

perform

_a detailed numerical

study

^of the domain sizes

for this modified Hebb model _as _a function of the

embedding strengths (w

for

p =

3,

4 _we put

special emphasis

_on the

problem

of

spurious memories, by evaluati~g

their cumulative basin of attraction. We also

analyse

the domain sizes for the standard Hebb model

as a function of p, for

large

values of _p.

2.

Analytical background.

We will consider the Hamiltonian

3C=-~ _£

J~~s~s~, s,=±i (1>

~<iwi>

describing

_a

_system composed

of N _neuron like

Ising

elements

S,,

whose

(symmetrical)

interactions J,~ ^between

pairs (ij )

reflect the

storage

^of a finite number p of random unbiased pattems

(ff)

= ± I, with _p

= I,

,

p,

according

to a modified Hebb Rule

[3]

P

J_

=

z

_~

fa fa (~)

lj j~ ^a ^I ^j ^'

@

(4)

N° 3 ATTRACTION DOMAINS IN NEURAL NETWORKS 779

where

w~

^is ^the

weight

associated to the

Jz-th pattem.

^In ^the case where all

embedding strengths

_are

equal, (w~

₌ ^I ^{for all} p,

equation (2)

^reduces to the Hebb rule. The

equilibrium properties

of this system are characterized at a

macroscopic

level

by

the existence of p order

parameters

_q~, or «

overlaps

_» which _measure the resemblance between _a

microscopic

stable

state of the system ^and ^the

p-th

_pattem. In the

thermodynamical

limit

(N

- oJ

),

^the value of these

overlaps

^is

given [2, 3] by

the solutions to the _set of p

coupled equations

Q~ =

lff

^tanh

^fl ^I

WV qv

fi

> Jl =

I,

, p

(3)

v

~

where

fl

is defined _as the inverse of the noise level

(fl

_oz

I/T),

and the brackets

( )

indicate _an average over the random variables

(f,~).

In this limit strong

averaging applies [I II,

_as _a consequence, for _zero noise level

(T

= 0

),

this

equation

can be written _as _:

~a

l~la

^~~~~

(~

~Y ~Y

IY)

~~~

y

~

where the double bracket

II ) )~

^indicates

^averaging

over the 2P _comers of an

hypercube surrounding

m~ ⁼

0,

_(~1~

= ±1, with

« = 1,.

,

2P"~).

This static

picture

has _a

dynamical counterpart

: for _a system ^with a

synchronous parallel dynamics,

the time evolution of the

overlaps

is

given by

the

mapping [9]

q(n

+ I

= F

(q(n» (5>

where F

(q)

is

given by

F

(q)

=

m~ sign £

_{w~ q~} _~1~

(6)

~ ~

In such _a way that fixed

points

of the

dynamics (attractors),

^that

is, points satisfying F~(q)

= F~~

(q), correspond

to stable

points given by equation (4).

On the other

hand,

the domain size

f~,

related to the

Jz-th pattem,

can be written _as _:

f~m

^lim

ldqD(q) ^(7a)

N _- m A~

where

4~

is defined _as the

region containing

all the initial states which

eventually

evolve towards that

pattem,

^and

D(q)

is the

density

of states in the _«

overlaps

_» _space,

given by

D

(q )

m q

z Si (; 1(7b)

~

i s

In _some

particular

_cases and

approximations, f~

can be obtained

analytically [9, 10],

in

others,

this

quantity might

be evaluated

by

^numerical iteration of the flux

equations (5)-(6).

The

regions

F

(q)

= fare _convex

(bounded by

the

planes z

_w~ _q~ _~1~)

and,

_{for p}

finite,

this

A

vector _can

only

have _a finite number of values

f;,

each of them associated to a

region

D,

ⁱⁿ ^the _«

overlaps

_» _space. If _we _now define the set

R~

_~ ^RP

n@" q011q$( ") _z ^A(ql() _(g>

aA#a

(5)

then _we know that F

(qo)

=

(0,

,

q[, 0),

with

q[

₌ ± I, for all qo e

R~.

Therefore the set

R~

has the

following properties

_:

(Ii R~

is _convex.

(2)

For all _q in R

~

F~ (q

=

F

(q ). Therefore,

all initial q-states ⁱⁿ

R~

^will evolve towards the

p-th pattem.

This

quantity

allows _us to calculate

analytically

the fraction of microstates which evolve towards the

p-th

_pattem in _one

single

_step, and therefore

gives

us a lower bound _to the fraction

f~ (clearly R~

z

4~).

^It is

important

to notice that the

boundary

of the

region

defined

by R~, namely,

the _set obtained

by using

_an _«

= »

sign

in

equation (7)

instead of _«

> », does not

satisfy F~(q)

=

F

(q)

_;

however,

_as N

- oJ this set has _a _measure 0.

Figure

I includes the lower bound of

f~(x

xx as a function of p for the Hebb model

(all

(w~)

=

I),

_as calculated

by integrating equation (7)

over the

region

U

R~

_; it also includes the value of

f~

as obtained

by

_a numerical iteration of the flux

equations.

As _we _can _see, after

an initial decrease, ^the cumulative domain size of the stored pattems

begins

to increase for p > 6 and tends

asymptotically

to a value around

0.88,

that

is,

the effect of mixed _states is reduced

by increasing

the number of stored pattems ^this ^result

improves

that

reported by

Coolen et al.

[9] by eliminating

_some finite size effects.

fp

Hebb model

llAf~i=I

o.7

O.4

~ ~ ³ flux equations

x Analytical lower

O.2 ~°~~

o-i

o-o

5 lo 15 20

P

Fig.

I. This

figure

shows the cumulative domain size

f~

₌ 2

£ f~

for the Hebb model _as _a obtained

by

a

numerical iteration of the flux

equations

_as _a function of p (3 3 3 ), and the analytical lower bound to this

quantity

(x _x _).

3. Fixed

points

^of ^the ^flux

equations.

Equations (5)-(6)

have _a number of fixed

points

some of them

acting

_as attractors of the

dynamics

of the system. ^It ^has ^been a common belief that, for _a

given

finite number p of stored

(6)

N° 3 ATTRACTION DOMAINS IN NEURAL NETWORKS ₇₈₁

pattems ⁱⁿ ^the

thermodynamic

limit

(a

=

0,

N

-

oJ),

there exists _a

fixed-point

related _to each of the p pure

memories, plus

additional fixed

points

related _to any combination of _r pure

memories,

where 3 _« _r _w p

(plus

their

symmetrical counterparts). Although

this is true for the Hebb model

(for

_a

=

0),

in this paper ^we will show that for the modified Hebb model, the existence of _attractors related to any mixture of three _or more pure memories

depends

_on the

specific

values of the

(w~)

involved. To this

end,

_we ordered the

weights,

without

loosing generality,

in such _a way that wi =

I and 0

< w~ « w~ for

>

j.

Fixed

points q(n

+ I

)

₌

q(n)

_are classified into two groups

depending

_on their number of

non zero components. ^These

corresponding

to the pure and

spurious

states. A fixed

point

to the

p-th

stored memory

(ff)

is characterized

mathematically by having only

one

component

^different ^from zero, that is q~ ^{# 0} ^and q~ =

0,

for V _v _# _p. On the other

hand,

a fixed

point corresponding

to an attractor related _to _a mixture of several stored memories is

one that has

simultaneously

_more than _one component or

overlap

different from _zero.

3.I p = 1, 2.

By inspection

of

equations (5)-(8),

_we _can observe

that,

in the _case when less than three pattems ^have ^been

stored,

U

R~

_covers the whole

overlap

_space.

Therefore,

the system ^does not have any

spurious

attractors, and

equation (I)

_can be solved

exactly.

3.2 p = 3. For the _case p =

3,

it is

possible

to demonstrate

(see Appendix A)

that the

relationship F~(q)

=

F

(q

) holds for all _q, if w~ <

z

_w~, for all _p this _means that for any

<A « al

initial value _q, the flux

equations

will converge ^to a

fixed-point (related

either to a pure or to _a mixed

memory),

ⁱⁿ a

single

time step. ^Due to _our convention in the

ordering

of the

weights

^this

restriction _can be summarized _as I _< w~ ⁺ w~.

Clearly,

the Hebb _case lies within this category.

By

_an exhaustive

analysis

of the

possible

fixed

points

of the flux

equations (5)-(6),

_we found the

following

_: this system ^has one fixed

point

_q~

= &~~ ^related ^to ^each ^stored pattem _p

(plus

its

symmetrical

counterpart q~ =

-&~~);

these fixed

points

exist for any ^set

(w~)

(I

= wi m w~ m w~ > 0

). Additionally,

we found _two different kinds of fixed

points

corre-

sponding

to mixture _states

(with

_any combination of

signs)

_:

I)

_qi ₌

±1/2,

existent

q~ = ±

1/2,

in the w~ ⁺ w~ > wi

,

q3 * ± 1/2,

region III

_qi

" ±

I/4,

existent

q, = ±

3/4, along

_w,

=

(wi

+ w~

)/3

,

I,

j

=

2,

3

q~ ⁼ ^± I/4. the line

A

stability analysis

shows that

only

solution I, ^which exists for w~ ⁺ w~ > wi = I, is

stable, being

stable in the whole

region

where it exists. In other

words,

the solutions of the type ^II ^do

not

correspond

to attractors. It is

interesting

to note that fixed

points

related _to

spurious

memories do not exist for w~ ⁺ w~ ~ wi =

I,

for it has been _a _common belief that

having

more

than two memories in Hebbian type ^models ^for a = 0

(N

- oJ

) implies

the existence of

spurious

stable states. These fixed

points

_are indicated in

figure

2.

3.2 p = 4. In this _case, all solutions mentioned above exist. That

is,

there is _an attractor related _to each of the 4 stored pattems,

plus

_one

spurious

attractor related to each combination of _r

= 3 stored memories whenever the condition w; <

wj

+ w~ for I ~

j, k,

and any value for wt, is 8atisfied here

(I, j, k, f )

are any

permutations

of

(1, 2, 3, 4) (this

restriction leaves out many

regions

of the

parameters' space). Additionally,

new attractors appear which _are mixtures of 4

pattems.

^Due to the

large

number of

parameters

it is not

simple

to find out, ^for

(7)

1.o 1Jim1.0

l"1= l~~~1.0 ^0.9

1J2mA

0.8

&L,

o.7

0.6

O-O O-1 0.2 0.3 DA 0.5

W,

bl~

DA

0.3

0.2

o-i

o-o

O-O O-1 0.2 0.3 DA 0.5 0.6 0.7 0.8 0.9 1-O

w~

Fig.

2. p =

3. The behaviour of the network in w-space is

separated

in two

regimes by

the line w2 + w3 "1. Above this line, the contour levels represent ^the ^cumulative ^domain ^size

f~.

In the shadowed _area

f3

_m I

exactly,

in this region broken lines represent ^the percentage ^of ^times ^the ^flux

equations

_converge _on the first iteration.

Along

the lines (+ + and (+ ₊ there exist unstable fixed

points.

p >

3,

which _are the conditions

required

for

particular

types ^of

fixed-points

to exist.

Similarly,

it is not

possible

to derive

analytically

the number of the iterations

required

for the flux

equations

to converge, so it becomes necessary to find out

numerically

the _answer _to these

questions.

4. Numerical iteration of the flux

equations.

In order to evaluate the domain sizes of the p =

3,

4 attractors, a

systematic study

_was

performed

of the evolution in time of the flux

equations

for _a NN with

synchronous dynamics.

The p memories _were stored

according

to the modified Hebb rule

[Eqs. (1)-(2)].

This

study

_was

done _as _a function of the

embedding strengths (w~),

with the conventions

previously

indicated,

and

by considering

_a

grid

in the

parameters'

_space

given by

Aw

= 0.04. The idea

was to calculate the cumulative size of the attraction domains

f~

₌ 2

z f~ (p ), by iterating

the

a

flux

equations (5)-(6)

^with ^random initial values for _q, obtained from _a Gaussian distribution

D(q)

with _zero _mean and _a

dispersion

_« 0. This _was done

lo,

000 times. The choice of this distribution reflects random initial states

(S;)

when _a _set of _non biased random pattems

(ff)

has been stored in _a network

composed by

N II _« ^~

spins.

The results obtained _were the

following

_: For _p

=

3 the behaviour of the network _was found to be

separated

into two

regimes by

the line w~ ⁺ w~ =

I

(in general, by

^the line w~ ⁺ w~ =

wi),

as follows _:

(8)

. Below this

line,

for w~ + w~ < wi =

I,

all attractors

correspond

to one of the stored

pattems,

^that

is,

_no

spurious

memories

exist,

_so

f~

₌ I. In this

regime,

the flux

equations

converge in either _one _or _two time

steps.

Broken lines in

figure

2 show the contour levels for the

percentage

^of ^times ^the ^flux

equations

_converge _on the first

iteration,

the

remaining

of

times

they require

two iterations _to converge.

. As the line w2 + w~ = wi is crossed

(entering

the parameter

region

_w~ + w~ > wi =

I),

there is _an

abrupt

transition into _a different

regime

_:

here,

all flux

equations

_converge _on the first

iteration, however,

not all the attractor states _are related to

only

one of the p stored

pattems.

^Solid lines in

figure

2 indicate the contour levels for the percentages ^of microstates

f~

which evolve towards states related to pure memories these lines _can be obtained

analytically [10].

It is

interesting

to note that the _contour lines in

figure

2 _seem to continue _across the line w2 + w3 m wi> but have _a different

meaning

_on each side. This indicates that all those _cases in which the flux

equations

do not converge ^on the first step get transformed into

spurious

memories _as _one switches _to the other

regime.

Another indication of

this,

is the behaviour

along

the line w2 +w~ = wi =

I,

which

happens

to be _on the

boundary

of the

region

R

i ; on this

line,

all the

points

show both _a percentage ^of

spurious

memories and _a percentage of pure memories for which two steps were needed to obtain the

fixed-point.

In these _cases

these _two percentages sum the _same _as the percentage ^of ^the contour lines

they

_are in.

For p = 4 _we found the

following

there _are

large regions

in the

parameter's

_space where no

spurious

attractors with _r

=

3 exist.

Additionally, by

equations

we found that there is _a

region

^where there _are _no attractors related to

spurious

memories at all.

However,

_contrary to what

happens

_{for p}

=

3,

the transition between

regions

with and without

spurious memories,

is _a soft _one. That

is,

_as _we

change

the

embedding strengths

(w~),

the fraction of microstates which evolve towards _a

spurious

attractor goes

smoothly

from values

equal

to zero, to values different _to _zero.

Figures

3 and 4 show the contour lines for the cumulative domain size

f4,

for _some sets of values

(w~)

of the

embedding strengths,

as calculated

by

equations starting

from random initial states, _as indicated above. In these

figures,

shadowed

areas

correspond

to

regions

with

f4

=1. The main

figure

in 3

corresponds

to wi =

I,

w~ =

1,

0 _< w4 w w~ « w~. As _we _can _see, the Hebb _case

presents

^the ^lowest ^cumulative domain

size,

with

f4

0.5 this value increases _as w~ and w4

decrease,

_up to a value of almost

f~

~

l. For any ^set of values

(w~

^included on this

graph,

some of the pure memories

require

more than _one iteration to converge ^to a

fixed-point

_; the average number of iterations n~~

required being

the

highest

in the Hebb _case with n~~ =

1.42 and

decreasing

to about I, lo for wi =

I,

_w~

=

I,

and w~, w~ 0. The inset in

figure

3 represents ^the contour lines for the _case wi = I, ^and ⁰ _< _w~ w w~ w w~ = DA _; _as _we _can _see, there is _a soft

(second order)

transition

between _a

region

with

f~

= I to another _one with

f~

_< I.

Figure

4

depicts

the _contour lines of

f~

for three other cuts in the

parameters'

_space.

4. Discussion.

The flux

equations

which _are used in this

approach

_are exact in the

thermodynamical

limit.

However,

in the results obtained

numerically

there _are two

possible

sources of finite size effects in addition _to _an _error of about 19b related _to the number of random initial states

considered. The first

possible

source of finite size effects is related _to whether the union of the

convex sets

R~ (which

determine the

fixed-point

to which _a

given

initial state will

evolve)

for

small p indeed covers

overlap

_space. This is true

except

^for ^those

overlap

vectors located

(9)

.o

o<w3< w~ < wi =i o.9

+ + + 1J2

=

~[l _+1J~]

0.8

+ + 1J3 ₌

~[l +1J21

0.7

~ ~.

~.~ ^~ 0.6

~ ~

0.5

bl~

0.80

O-I

o-O o-1 0.2 0.3 DA 0.5 0.6 0.7 0.8 0.9 1.0

bl2

Fig. ^3. ^The ^main figure shows the contour lines of

f~

for the _case 0 ₊ w~ w w~ « w~ = wi =

I in this

region

of

(w~ ),

there _are always

spurious

memories. The inset shows the _case 0

~ w4 « w~ « w~ = 0.4,

with wi =

I, the shadowed _area shows the region in w-space with

f~

= 1.

~~_~ ^0.2

0.6

~z~

Wi

=1.0

~

0,1 W4

W2m0.6

^°'~

~0.0

DA O-O O-1 0.2

W~

~~~

0.3W4

0.3

0.2

~$~[(

o.75 o~~ ^0.2

~

O-1 _, W4

° O-I

O-O

~

O-O O-1 0.2 0.3 DA 0.5 0.6 0.0

W3

o-o 0.1 0.2 0.3

W~

Fig.

4. Contour lines for

f~,

for three different

regions

in the parameter space

(w~)

the shadowed

areas indicate the regions where

f~

=

1.

(10)

exactly

_at the boundaries of the

regions,

I,e. for which

[q~

₌

£

_w~

[q~ [,

for _some _p.

~@

A wa

In the

thermodynamical

limit these

regional

boundaries _are sets of _measure _zero.

The second

possible

source of finite size

effects,

is related _to the choice of initial values for the

overlaps.

Random initial conditions (S~ ^lead to a Gaussian distribution for the

overlaps

(q~),

with _zero _mean and _a deviation

given by

« ~

l/

Qk.

_The

use of _a _non _zero width Gaussian distribution for the initial

overlaps

would introduce finite size effects for _non-zero noise levels. However, in the noiseless _case

(T

=

0),

these effects

disappear.

It is very

important

to stress that

storing

_more than three pattems ^does not

imply having spurious memories,

for there _are _some

regions

in the w-space for which _no

spurious

memories exist.

Therefore,

it is

possible

to eliminate the existence of

spurious

stable states

by modifying

the

weights

associated _to the

pattems.

^This ^has an intuitive

explanation

if _we make _a

comparison

with

hyperspheres

of different sizes which _we know _« fill _» better the space than

spheres

of

approximately

the _same

size, by leaving

less intersection _space.

Acknowledgements.

One of the authors

(LV)

wishes to thank Dr.

Miguel

Avalos for his advice in

computing

matters, and C. Martinez for her collaboration in the

production

of

figure

I. This work _was

partially supported by project

DGAPA IN013189 of the National

University

of Mexico.

Appendix.

The

expression F~(q)

= F

(q)

can also be written _as _:

am

(£w~ _q~) (£

_y~A _wA

_FA(q)) »o, (A.1)

~ ~

where

~1is

_a vector whose

components

are the 2P _comers of _an

hypercube surrounding

m~ ⁼

0,

_(~1~

=

±1,

with _«

= 1,

,

2P~ ~). We define z~ m ~1~ w~ q~, ^and ^Z m

£

_z~, with

Z#0 (I,e. excluding region boundaries),

and consider the _case

Z>0;

the result for Z < 0 _can be obtained

by switching

z _~ z and

using

W

(-

_q

= W

(q).

In this way, W _can be

written _as

W

=

~ £z~ ~ £w~ F~(z)

If _we _now define _mm

z f~,

we can write

P

~

2P(fl)

⁼²

z (w,ijsgn(z.ij+ z (w.i). (A.2)

~

i."~o i."=o

For p =

1,

2,

3 this

expression corresponds

to :

Pure solutions ~p =

1)

:

W/Z=wisgn (z)~0,

(11)

W/Z

=

(wi

₊

w~)

_sgn

(zi

+

z~)

₊

((wi w~)

_sgn

(zi

=

z~)

+

(w~

_wi

ign _(z~

_zi

₎₎

2 4

= wi

[I

+ sgn

(zi z~)]

+ w~

[I

_sgn

(zi z~)]

_> 0

2 2

As _we _can _see, for p = 1,

2,

the

expression F~(q)

= F

(q)

is

always

true for any ^set of values

(w).

For _p

=

3, equation (A.2) corresponds

to p = 3 _:

4 W/Z

= wi + w~ + w~ +

(wi

₊ _w~ +

w~)

_sgn

(zi

+ z~ z~

)

₊

+

(Wl

W2 + W3 S~~

(Zl

22 + 23 ⁺

(WI

W2 W3 S~~ (21 22 23

)

= wi

[I

₊ _sgn

(zi

+ z~ z~

)

₊ _sgn

(zi

_z~ + z~ + sgn

(zi

_z~ _z~

)]

+

w~[I

₊ _sgn

(zi

+ z~

z~)

_sgn

(zi

_z~ + z~

)

_sgn

(zi

_z~ _z~

)]

+ w~

[I

_sgn

(zi

+ z~

z~)

+ sgn

(zi

_z~ ₊ _z~ _sgn

(zi

_z~ _z~

)]

=

wi[>

_{2 sgn}

(yi)]

₊

w~[>

2 sgn

(y~)]

+

w~[>

2 sgn

(y~)]

,

where _: y, m

zz~

_{2 z,} and >

m I ₊

z

_sgn

(yj).

^Since

z

_y~

= Z _>

0,

at least _one of the

i i

y; ^must be

positive

_; let's _assume that y~ >

0,

with

(p,

>, _p

)

_any

permutation

of

(1,

2,

3),

therefore _:

4 W/Z

= w~

isgn (y~ )

+ sgn

(y~ )i

₊

w~12

+ sgn

(y~

_sgn

(y~

)1 +

+wA12-sgn (yA)+sgn (y~)i.

This

expression

will be

positive

if w~~w~

+w~

^for

(p,

>,

p)

_any

permutation

of

(1, 2, 3). Therefore,

this is _a necessary condition for

having F~(q)

=

F(q),

in the _case p =

4.

References

[1] HOPFIELD J. J., Proc. Nail. Acad. Sci. USA 79 (1982) 2554.

[2] AMIT D. J., GUTFREUND H., SOMPOLINSKY H., Phys. Rev. A 32 (1985) 1007.

[3] VIANA J., J.

Phys.

France 49 (1988) 167.

[4] COOLEN A. C. C. and RUUGROK Th. W., Phys. Rev. A38 (1988) 4253.

[5] GARDNER, J. Phys. A 22 (1989) 1969.

KEPLER T. B, and ABBOTT L. F., J. Phys. France 49 (1988) 1657.

KOHRING G. A., Europhys. Lett. 8 (1989) 697.

KRAUTH, NADAL J. P. and MEzARD M., J.

Phys.

A 21(1988) 2995.

KRATSCHMAR J. and KOHRING G. A., J. Phys. France 51(1990) 223.

[6] FORREST B. M., J. Phys. A 21(1988) 245.

[7] HORNER H., BORMANN D., FRICK M., KINzELBACH H. and SCHMIDT A., Z.

Phys.

B 76 (1989) 381.

[8] VIANA L., COTA E., MARTINEZ C., in Statistical Mechanics of Neural Networks, L. Garrido Ed., Lecture Notes

Phys. (Springer Verlag)

368

(1990)

97.

[9] COOLEN A. C. C., JONKER H. J. J. and RUIJGROK Th. W.,

Phys.

Rev. 4o (1989) 5295.

[10] COOLEN A. C. C.,

Europhys.

Lett. 16 (1991) 73.

jll] VAN HEMMEN J. L., GRENSING D., HUBER A. and KUHN R., Z.

Phys.

B 65 (1986) 53.