HAL Id: jpa-00246462
https://hal.archives-ouvertes.fr/jpa-00246462
Submitted on 1 Jan 1992
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Simulation of a multineuron interaction model
J. Arenzon, R. M. C. de Almeida, J. Iglesias, T. Penna, P. M. C. de Oliveira
To cite this version:
J. Arenzon, R. M. C. de Almeida, J. Iglesias, T. Penna, P. M. C. de Oliveira. Simulation of a multineuron interaction model. Journal de Physique I, EDP Sciences, 1992, 2 (1), pp.55-62.
�10.1051/jp1:1992122�. �jpa-00246462�
Classification
Physics
Abstracts87.30G 64.60C 75.10H
Simulation of
amultineuron interaction model (*)
J. J. Arenzon
(I),
R. M. C. de Almeida(I),
J. R.Iglesias (I),
T. J. P.Penna(2)
and P. M. C. de Oliveira(2)
(1) Instituto de Fisica, Universidadc Federal do Rio Grande do Sul, C.P. 15051, 91500Porto
Alegre,
RS, Brazil(2) Instituto de Fisica, Universidade Federal Fluminense, C.P. 100296, 24020 Niter6i, RJ, Brazil
(Received 2
July
1991,accepted
infinal form
6September1991)
Abstract We simulate the
dynamics
of a muitineuron model (RS model) witli an energy functiongiven by
tileproduct
between thesquared
distances inphase
space between the state of tile net and the stored pattems. We obtain the relativefrequency f(m~)
that anarbitrary
pattem is retrieved from an initialoverlap
m~ and estimate the size of the basins of attraction for differentactivities a. Two limit cases are taken into account : when pattems and
antipattems
are stored(p.a.s.)
and whenonly
the pattems are stored(o.p.s.).
For the a= 0.5, p.a.s. nets a limit for the load parameter was not found, but for the other cases (a,0.5 or o.p.s.
configuration)
therelative size of the basins of attraction may become too small.
Most common models for neural networks consist of
systems
of a great number ofinteracting spins (neurons) [1, 2]
where eachconfiguration
is characterizedby
an N- dimensional vectorS, given by
S
=
(Sj,
,
SN
)
;S;
= ±
(1)
The Hamiitonian
function,
and hence theconfigurations
that minimize the energy of thesystem,
is determinedby
thespin
interactions consideredby
agiven
model. For the attractorneural
network,
when any set of P chosen states of the net(pattems),
say(~,
~1 =
1,
...,
P,
areimplemented
as minima of the Hamiltonianthrough convenient,
apriori prescribed tuning
of the interactionsstrengths,
the net is said to have leamt thesepattems.
In this case, agiven pattem (~
is retrievedwhen, starting
from aninitial,
unstableconfiguration
that is
sufficiently
similar to the pattem, the net relaxes towards and stabilizes at the(minimum energy)
state~~.
(*) Work
partially supported by
brazilianagencies
Conselho Nacional de Desenvolvimento Cientificoe
Tecno16gico (CNPq),
Financiadora de Estudos eProjetos
(FINEP), Coordenadoria deAperfeiqoa-
mento de Pessoal de Nivel
Superior
(CAPES) andFundaq6es
de Amparo hPesquisa
dos Estados do Riode Janeiro and Rio Grande do Sul (FAPERJ and FAPERGS).
56 JOURNAL DE
PHYSIQUE
I N° IThe most
extensively
studiedexample
of such idealizedsystems
is theHopfield model,
which in itsoriginal
version considers thefollowing
Hamiltonian[2]
:E
=
jj
J,~S,
S~
(2)
,,j i
and the
synaptic
matrixJ,~,
associated to thestrengths
ofspin interactions,
isgiven by
Hebbleaming
rule[2, 3]
:J,~ =
jj f,~f~~ (3)
N
~
with
(~,
~1=
1,
,
P, being
P vectors in thephase
spacerepresenting
the stored pattems.This
synaptic
matrix J;~ guarantees that thesepattems
arestrongly
correlated to minima of the Hamiitonianequation (2),
ifthey
are uncorrelated among themselves and the load parametera, defined as
a =
~
(4)
is less than a critical value a~ = 0.14
[4].
In this limit case theHopfield
model describes acontent-addressable memory. The uncorrelated pattems are chosen
by assigning equal
probability
a= 0.5 of each
spin if
to be + or at random. The value a defines theactivity
of the net and lower values of a
(favoring spin +1,
forinstance)
introduce correlation between the stored memories. Severalattempts
have been made to model a neural networkcapable
to handlesatisfactorily
with correlatedpattems
and with ahigher
critical loadcapacity,
here measuredby
a~, bothby introducing
modifications in theHopfield
Hamiltonianor
by proposing
new models(see,
forexample,
Refs.[4-11]).
Recently,
de Almeida andIglesias [12] proposed
a new Hamiltonian function that is relatedto the distances in
phase
space between the state of the net S and the P pattems(~.
Thismodel,
named RS after the brazilian state where the model wascreated,
includesmultispin
interactions and can be writtenas
p
l~
NE
=
N
fl jj
(S~ff)~ (5)
~=i
~'~,=i
The
expression
in theproduct
is thesquared
Euclidean distance between S and(~, i.e.,
E is anon-negative
function andE(S)
=
0 if S
=
(~,
for any~1=1,
.,
P.
Consequently
thepattems
arealways
minima of the energy function. Thecapacity
of thenetwork,
as well as itsability
to handle with correlatedpattems
aregreatly
andqualitatively
enhanced in
comparison
withprevious
models. Also, if(~
is a minimum ofequation (5),
it isnot a direct consequence that its
antipode -(~ (antipattem)
should also minimize theHamiltonian,
as ithappens
in theHopfield
Model.Nevertheless,
one canalways explicitly
store the
antipattems,
if one wishes so. This feature of the modelyields
to two differentlimiting
cases for uncorrelated pattems : whenonly
the pattems are stored(o.p.s.)
or when bothpattems
andantipattems
are stored(p.a.s.).
The above considerations were
directly
inferred fromequation (5)
and further conclusionsrequire deeper analysis
of thedynamics implied by
the model. As the system under consideration takes into account a great number ofinteracting
neurons, there are two differentapproaches
to theproblem
: mean field calculationsthrough
statistical mechanicstechniques,
whichpreliminary
results have beenpresented
in reference[12],
and numerical simulations. In this work wepresent
somecomputer
simulation results for the twolimiting
cases
(o.p.s.
and p.a.s.configurations)
where we consider different loadparameters
and activities for the randompattems.
In what follows webriefly explain
theprocedures
to obtain the relative fractionf (mo)
of times that anarbitrary pattem
is retrieved whenstarting
from aninitial state with an
overlap
mo,following
theprescription by
Forrest[13].
Weanalyze
theresults,
obtain an estimate of the size of the basins of attraction and show that the loadcapacity
of the net has beengreatly improved.
Consider a network of N neurons and a zero-temperature
dynamics
such that aspin
isflipped
whenever the energy of the net is lowered.Equation (5) gives
a lot of information about the energylandscape
inphase
space, but it is not in anadequate
form for numericalpurposes. It can be rewritten as
p
E
= N
fl (i
m~) (6)
~=i
in the o.p.s. case, or
p
E
= N
fl (i m( (7)
~=i
in the p.a.s.
configuration [12].
Inequations (6)
and(7)
m~ stands for theoverlap
of the state of the net S with the~1-th
pattem :m»
=Z if St (8)
,
~
i
The numerical simulation is
performed
as follows :I)
initialization : construct P randompattems
withactivity
a, and an initial state that has agiven overlap
mo with one of the storedpattems,
chosen at random. Calculate theoverlaps
m~ with the other onesthrough equation (8)
and then the initial energythrough (6)
or(7)
;it)
consider a virtualflip
in the I-thspin
and calculate virtualoverlaps mf by
mi
= m»
~
S, if
;(9)
iii)
obtain the virtual energy E*through equations (6)
or(7) using
theoverlaps
~ *.
» ,
iv)
whenever E* WEflip
thespin
andupdate
the energy andoverlaps
v)
theupdating
isperformed sequentially
until the system reaches a stable state ; The steps(I)
to(v)
arerepeated
several times and theaveraged frequency f(mo)
that therandomly
chosenpattem
is retrieved from an initialoverlap
mo is obtained. We used themultispin coding approach,
asprescribed by
Penna and Oliveira[14, 15]. Although
the number of arithmeticoperations
perupdating
are of the same order as in theHopfield model,
more
computing
time isrequired
because here the energy function is calculatedthrough
theproduct
of theoverlaps
instead of the sum as in theHopfield-Hebb algorithm.
For morecomputing
details see reference[17].
Figure
shows the relativefrequency f(mo)
for the p.a.s.configuration
and 0.5activity,
uncorrelated sets of memories. The nets considered have N
=128,
256 and 512 andsimulations were
performed
for a=
0.1,
0.5 and I. The average valuesf(mo)
were obtainedconsidering
200 relaxations(50
for the N= 512
net)
for each of five different sets of memories for everypoint
of thefigure
and the estimated errors are less than 10 9b. As the pattems arealways
minima of the energyfunction,
when a pattem is retrieved the final58 JOURNAL DE PHYSIQUE I N°
1-O ~~ _j-~ ~ &@~
/ fi' /
~ /
/O~
O /
O-B
/,~
// /
/~~ ,
t°'~
Q~0. ~/ 0.5~
~0.4
/
02 ~~11
~
~ ~ /°~
/~ Ja ~/
00
0.0 0 2 0 4 0 6 0 fi 0
mo
Fig.
I. Fractionf(mo)
of recalled pattems with an initialoverlap
mo for N= 128 (O), 256 (6) and
512 (a) and « 0.1, 0.5 and I iri the p.a.s. case, and uncorrelated pattems.
overlap
isalways I, I.e.,
there is no errors in the retrieved states.Considering
the p.a.s. case,similarly
to theHopfield model,
curvescorresponding
to nets with the same value of the load parameter a but different number of neurons N(and
hence different number P= NtY of
stored
pattems)
superpose[13],
and the rate at whichf(mo)
increases form 0 to I becomesmore
pronounced
as the size of the system N increases for all values of a in thefigure.
Assuggested by
Forrest[13]
this behaviorapproaches
adiscontinuity
atm~(a)
as N~ cc
implying
that even fora =
i the net
keeps
itsretrieving
abilities. The value lim m~(a gives
N
~ w
a measure of the size of the basins of attraction of the memories.
Rough estimates,
takendirectly
fromfigure I,
arem~(a
=
0.1)
=0.12, m~(a
=
0.5)
= 0.43 andm~(a
=
1)
=
0.61.
Comparisons
with theHopfield
model valuemf(a
=
0.1)
= 0.37 shows
that,
even at very low a the size of the basins of attraction aregreatly
enhanced in this model.Also,
thedependence
of m~ with the load parameter a is much smoother than inHopfield
Model(the
basins of attraction are notdrastically reduced),
because here there is no critical value for the loadparameter,
at least in theregion
a w I. This effect couldoriginate
from the elimination ofspurious
statesby
thehigher
ordersynaptic
connections consideredby
the Hamiltonianequation (2). (At
N ~ cc a few very shallowspurious
states with Em 0 may appear in numerical simulations. At N~ cc these states merge in an absolute
(unstable)
maximum at apoint
inphase
space that isequidistant
from every(uncorrelated)
stored pattem, in agreement with mean field(N
~ cc
)
calculations[12, 16]). Higher
values ofa do not present any
qualitative
difference for theanalyzed
case but for aexpected
decrease in the size of thebasins of attraction.
In
figure 2,
we presentf (mo)
versus mo for 0.5activity,
uncorrelatedpattems
in the o.p.s.configuration
for nets with N=
128,
256 and 512 neuronsagain,
the average is taken over200 relaxations for each of five different sets of memories
(50
for the N=
512).
Here the different N curves with same number P of stored pattems(and
nota)
superpose. This can beexplained
as follows. Theantipattems
are located in aregion
inphase
space that is thespecular
reflexion of thecorresponding region
where thepattems
are. When antimemoriesare also stored
(p.a.s. case)
these tworegions
have similar energylandscapes.
But when antimemories are not considered(o.p.s. case),
theantipattems region comprises
verylarge
energy states, while its
specular image
contains the lower energy ones,forming
ananalogue
ofa multi-hole bowl in the closed space defined
by
the(hyper)
surface of the(hyper)
cube in1-o = ~Q
/
/ ~
o-B ~
/
/ /
d
-°.6
P=13 /~~~ 64 128~
o/~~
+-0.4 /
~/
l~/
lO-Z
~
/
/ ~d
O-O
0.6 0.7 O-B 0.9 1-O
mo
Fig.
2.f(m~)
versus m~ for N=
128
(O),
256(A)
and 512(o)
and P=
13, 64 and 128 in the o.p.s.
case, and uncorrelated pattems. Note the different scale on the horizontal axis.
phase
space. The greater the number of pattems Pis,
thebigger
the difference between these tworegions
and thedeeper
is the bowl. Thepattems
are stored inside thisbig
basin whose centerlays
in the direction C determinedby
the sum of all stored uncorrelatedpattems.
As Pincreases,
the sizes of the basins of attraction of each memorydecrease,
I.e. m~decreases, enlarging
theprobability
that the net stabilizes in a state which isstrongly
correlated to thecenter of the
big
bowl.Figure
3 presents apictorial
sketch of the energylandscape
in thephase
space for this case. In the limit P~ cc, the energy of the state in the center of the bowl goes to zero, the basins of attraction of each stored
pattem
merge, and there isonly
onebig
basin with a
flat,
zero energy bottom in theregion
inphase
space which is delimitedby
aring
obtained
by linking
thepoints
associated to the pattems and contains the direction C. Outside thisring
the energy increasesmonotonically
from zero toinfinity, reaching
its maximum value at thepoint
C. Hence it is notsurprising
that in the o.p.s. case the relative size of the basins ofattraction,
measuredby
m~, is determinedby
the absolute number P while for the p.a.s.configuration
the relativequantity
a =PIN,
thatgives
a measure of thedensity
of storedpattems,
is the relevant number indefining
the size of the basins of attraction. This effect is inagreement with mean field calculations
[16]. Again,
the rate at whichf(mo)
increases from 0to becomes more
pronounced
as the size of the system increases and we can see that the net still retrieves the stored pattems when P is of the order ofN,
at least for finite nets.p<~~n p-on
~ B
AE
Iv f iv I» f iv
Fig.
3. -Pictorialrepresentation
of the energylandscape
in thephase
space for the o.p.s.,a = 0.5 case. For P ~ oJ the direction C, determined
by
the sum of all stored pattems, is a localminimum of the Hamiltonian, with E ~ 0. In the P
~ oJ the center of the bowl and all the stored
pattems merge in a
unique
wide minimum.60 JOURNAL DE PHYSIQUE I N° I
Figure
4 shows theplot
off (mo)
versus mo for different activities(a
= 0.2 and
0.4)
for thep.a.s. case. Here too, as in the o.p.s.
configuration,
the curves with same number P of stored pattems superpose andagain
the effect ofincreasing
the size N of the net is to pronounce the rate at whichf(mo)
increases from 0 to 1. It alsoimplies
that the size of the basins of attraction decreases with a. Theexplanation
is similar to the o.p.s., a = 0.5configuration
there is a
high
energyregion
in thephase
space,comprising
now the a= 0.5 states, where there are not minima of energy. This
region separates
two multi-hole-bowl-likeregions,
each of themcontaining
all the pattems or all theantipattems and,
as theactivity
adecreases,
the bowls growdeeper,
moreenergy-decreasing paths
lead from an initial state to the center of the bowl and the initialconfiguration
should be nearer to agiven
pattem in order to retrieve it : more information isrequired
todistinguish
between correlatedpattems.
Nevertheless itstiff
represents
an enhancement incomparison
to theoriginal
or modifiedHopfield
model[4].
I-o
~~~W
n~ ~
fi
(
,,'~
'i ~~
0.8
/
I ~,'~/
l / ?
/ i
--°.6
/
~~P=13
/ 128
cJ
,
j~ , ~ /
~
-DA j ~ / /
~~'
/ /
~
d
o-Z / / /
Id
/ ~
t' ~/
0.0
DO 0 2 0 4 0 6 0 fi 0
mo a)
1-o
0.fi
--°.6 P=13 128
I i$°
C ~0.2
0.0
0 0 0.? 0 4 0 6 o-B 0
mu b)
Fig. 4. f
(mo
versus m~ for the p.a.s. case with activities : a) a = 0.4 and b) a= 0.2 for two different values of P and for N
=
128 (O), 256 (6) and 512 (o).
In
figure 5,
we present the results for the o.p.s.configuration
for lowactivity pattems
and theonly
effect whencomparing
to the a= 0.5 case is the reduction of the size of the basins of attraction.
1.0
II II
o-a
-°.6 P=13 128 '
£
c~-0.4
~
/ /
o-z /
/ /
/
0.0
0.6 0.7 o-B 0.9 1.0
mo a)
0
o-a
j°.6
P=13 128~o.4,
o-z
o-o
0.6 0.7 o-B 0.9 1-o
mo b)
Fig.
5.f(m~)
versus mo for the o.p.s. case with activities : a) a=
0.4 and b) a = 0.2 for two different values of P and for N
= 128 (O), 256 (6) and 512 (o). Note the different scale on the horizontal axis.
In
conclusion,
wepresented
here the results of numerical simulations for the Hamiltonianequation (5), considering
twolimiting
cases : when bothpattems
andantipattems
are stored(p.a.s.)
and whenonly
thepattems
are considered(o.p.s.).
In the case of a=
0.5,
thecapacity
of the net isgreatly improved, specially
for the p.a.s. nets,compared
to theHopfield
model. Also the retrieval of memories are
always perfect, I.e.,
the finaloverlap
of a retrievedstate is
always I, reflecting
the fact that thepattems
arealways
minima of the energy function.The Hamiltonian function can be
expanded
in a sum of different orders of interactionsand,
in the p.a.s. case, it can be truncated at the second order term,recovering
theHopfield
model.As this cut-off is reasonable
only
when the load parameter is low and thepattems
areuncorrelated 11
2, 16],
the critical value a~ = 0.14 may beregarded
as alimiting
value for thevalidity
of theapproximation implied by
the cut-off at the second order term.JOURNAL DE PHYMQLLJ -' 2,A J"'Ilk1'>.>2 t
62 JOURNAL DE PHYSIQUE I N° I
A
novelty
is that, in the o.p.s.configuration,
the size of the basins of attraction scales with the number P of storedpattems,
because of the bowl-likeshape
of the energy inphase
space.The p.a.s.
configuration
seems to occupy moreefficiently
thephase
space. On the otherhand,
in the o.p.s. case the basins of attraction may become too small. This reduction in the size of the basins is also verified for lowactivity
pattems, for both p.a.s. and o.p.s. nets.A limit value for the load parameter a for a
=
0.5,
p.a.s. nets has not been found.Anyway,
the pattems arealways
zero-energy states of anon-negative
Hamiltonian and hence thedefinition of a limit value for a may
require
someadjustment
in this model : forexample,
alimitation may be
imposed by
thevelocity (or
some otherparameter)
inretrieving
thepattems rather than
by
theability
inaddressing them, although
up to now we did not detect such effect. Work in this direction ispresented
elsewhereii?].
References
[Ii MCCULLOCH W. S. and PITTS W. A., Bull. Math. Biophys. 5 (1943) lls.
[2] HOPFIELD J. J., Proc. Natl. Acad. Sci. USA 79 (1982) 2554.
[3] HEBB D. O., The
Organisation of
Behaviour (New YorkWiley,
1949).[4] AMIT D. J., GUTFREUND H. and SOMPOLINSKY H.,
Phys.
Rev. A 32 (1985) 1007 Ann.Phys.
NY 173 (1987) 30Phys.
Rev. A 35(1987)
2293.[5] GARDNER E., J. Phys. A 20 (1987) 3453.
[6] KANTER I. and SOMPOLINSKY H.,
Phys.
Rev. A 35 (1987) 380.[7] HORN D. and USHER M., J.
Phys.
France 49 (1988) 389.j8] ABBOTT L. F. and ARIAN Y., Phys. Rev. A 36 (1987) 5091.
[9] NADAL J. and TOULOUSE G., Nenvork 1 (1990) 61.
[10] PEREz-VICENTE C. J. and AMIT D. J., J.
Phys.
A 22 (1989) 559.[I Ii PEREz-VICENTE C. J., Europhys. Lett. lo (1989) 627.
[12] DE ALMEIDA R. M. C. and IGLESIAS J. R., Phys. Lett. A146 (1990) 239.
[13] FORREST B. M., J. Phys. A 21(1988) 245.
[14] PENNA T. J. P. and DE OLIVEIRA P. M. C., J.
Phys.
A 22 (1989) L719.[15] DE OLIVEIRA P. M. C.,
Computing
Boolean Statistical Models (World ScientificPublishing,
1990).[16] ARENzON J. J., DE ALMEIDA R. M. C. and IGLESIAS J. R.,
preprint (1991).
[17] PENNA T. J. P., DE OLIVEIRA P. M. C., ARENzON J. J., DE ALMEIDA R. M. C. and IGLESIAS J. R., Int. J. Mod.