Neural networks with many-neuron interactions

(1)

HAL Id: jpa-00212356

https://hal.archives-ouvertes.fr/jpa-00212356

Submitted on 1 Jan 1990

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Neural networks with many-neuron interactions

G.A. Kohring

To cite this version:

(2)

145

LE

JOURNAL

DE

_PHYSIQUE

Neural networks with many-neuron interactions

G. A.

_Kohring

_(*)

Physikalisches

Institut der Universität Bonn, Nussallee 12, D-5300 Bonn 1, F.R.G.

(Reçu

le 2 octobre 1989,

accepté

le 3 octobre

₁₉₈₉₎

Résumé. 2014 Les

propriétés statiques

et

_dynamiques

de réseaux neuronaux avec interactions entre

plusieurs

neurones sont étudiées

_{analytiquement}

et

_{numériquement.}

La

_capacité

de

_stockage

de

ces réseaux est _{la même que celle des réseaux}avec interactions entre

_paires

de neurones, ce

_qui

implique

que ces réseaux ne _{stockent pas l’information de manière}

_plus

efficace. La taille des bassins d’attractions est calculée exactement à

_partir

d’une solution de la

_dynamique

du réseau

totalement connecté. Ceci montre _{que les}réseaux avec interactions entre

_plus

de deux neurones

sont

_plus

_{efficaces pour la reconnaissance des}_patterns_{que les réseaux}avec interactions entre

paires

de neurones.

Abstract. - The static and

_{dynamical properties}

of neural networks

having

many-neuron interactions are studied

analytically

and

_numerically.

The _storage

capacity

of such networks is found to be

_unchanged

from that of the more

_widely

studied case of two-neuron interactions

implying

that these networks store information no more

_efficiently.

The size of the basins of attraction in the many-neuron case is calculated

_exactly

from a solution of the network

_dynamics

at full

_connectivity

_{and reveals that networks with many-neuron interactions}are better at _pattern

discrimination than the

_simpler

networks with

_only

two-neuron interactions.

J.

_Phys.

France 51

₍₁₉₉₀₎

145-155 15 JANVIER 1990,

Classification

Physics

Abstracts

87.30G - 75. 10H - 64.60C

Introduction.

The static structure of the

_{simplest single layer}

neural networks is described

_by

N two-state

neurons,

Si’

and a

_real-valued,

N x N

_coupling

_matrix,

_Tij,

which contains information about the architecture of the network as well as determines the stable states. The two-dimensional character of the

_coupling

matrix follows from the

_assumption

that the neurons have

_only

_However,

it was realized

_{quite early}

_[1]

that this

_assumption

is not

tenable for

_biological

networks.

_Biological

networks have a

_propensity

_{for many-neuron}

interactions,

although

the reasons for this are far from clear. Several authors

_[1,

2,

3]

have

(3)

pointed

out the increased

_storage

_capacity

of networks

_{using multibody}

interactions and Baldi and Venkatesh

_[3]

have estimated that this

_storage

_capacity

increases like

_{0 (N A)}

for networks

_having

_(A

+ 1

_)-neuron

interactions. For the case of two-neuron interactions it is known from the work of Venkatesh

_[4]

and Gardner

_[5]

that the maximum number of uncorrelated

_{patterns, P,}

which can be stored without error, increases as P = 2 N. If _one

defines the

_storage

capacity,

a, as :

then the maximum

_storage

_capacity

for networks

_having

two-neuron interactions is

a = 2.

Hence,

the

important

question

to be asked about networks with many-neuron

interactions is whether this

_storage

_capacity

is

increased,

decreased or remains the same. In

other

words,

do many-neuron connections store information more or less

_efficiently

than

two-neuron connections ? The results to be derived here will show that the maximum obtainable value of a is a = 2 for all values of k -- 1.

In addition to their static

_properties,

neural networks are also defined

_by

their

_dynamical

properties.

The most common

_dynamics

_studied,

for two-neuron

_{interactions,}

is

defined,

at

zero

_temperature

(i.e.

in the absence of

noise)

via the

_{equation :}

For

_strongly

connected networks with

only

two-body

interactions,

the

_{only dynamical}

property

known

_analytically

is the first

_step

in this time evolution

_[6].

_However,

_{many other}

properties,

such as the size of the basins of attraction are known from

_computer

simulations

[7-9, 15].

A natural

_{generalization}

of the

_dynamics

to the

_multi-body

case is defined

_{by :}

It turns out that

_multi-body

_dynamics

are somewhat easier to

_study

than

_two-body

_dynamics

because of a decrease in the number of

_macroscopic

variables needed to describe a

_given

state

of the network. Because of this decrease an ansatz can be made

_regarding

the

_dynamics

which

allows certain

_parameters

of the time evolution to be calculated

_exactly.

These

_parameters,

as

will be

shown,

are in excellent

_agreement

with

_computer

simulations.

The

_plan

_{of the paper is then}as follows. In the next section _{the many-neuron networks will} be defined in more detail and the static

_properties

will be calculated

_analytically

via a

generalization

of the method first

_{employed by}

Gardner

_[5].

Then the first time

_step

will be calculated

_analytically

and

_compared

to

_previous

results. A

_conjecture

will made at this

_point

regarding

the

_analytic

form of the exact solution to the

_dynamical

evolution.

_Next,

after

presenting

a

_learning

rule suitable for all values of

À,

numerical results will be discussed which

support

the

_validity

of the

_{foregoing analytical}

_arguements.

Static

_properties.

The

_properties

of a neural network will

depend,

of course, upon the network architecture. For many-neuron

interacting

networks a

_fully

connected architecture has some

_symmetries

not

_present

in the two-neuron case. This is illustrated

_{schematically}

in

_figure

1 for the

_simple

(4)

Fig.

1. - A schematic

diagram

of a three-neuron interaction.

respect

to

_interchanges

of the

_{indices j}

and

_k,

i.e.,

physically

there exists

only

one

three-neuron _{« synapse » labelled}

_i-j-k.

For a

_general,

(À

+ 1

_)-neuron

interaction the

_synaptic

matrix

_{Tijl ...j,B}

will be invariant under

_interchange

_{of any of the}

_{jl to jk}

indicies. This should be reflected in the calculation of the local field

_{by summing only}

over the

_independent

elements of the

_synaptic

matrix, i.e.,

the sum in

equation (1.3)

should be further restricted to run over

all jl 1 : j 2 ... : j À in

addition to the normal restriction of

_{running over jl, j2,}

...,

j03BB ~ i.

With these extra

_symmetries,

it is evident from

_equation

_(1.3)

that the stable states of the

system

(denoted

by

e)

will be defined as those states with local

_stability

greater

than zero :

for every neuron i.

_{Where, g}

_{E {l, ..., P }, eN’ is}

defined as :

which is

_simply

_{the square}root of the number of

_independent

elements in the sum and

E

denotes the restricted sum over

_only

the

_independent

elements of the

_synaptic

matrix as

J

described above.

In order to fix the scale for the

_couplings

a suitable

_{generalization}

of the two-neuron

interaction case to _{the many-neuron interaction}case is :

For À

_{= 1,}

this is seen to reduce to the usual normalization

_[5]

and as is the case

there,

it can

be shown that this is not a real restriction on the network rather it is a

_description

of the networks which emerge when constructed from some of the known

_learning

rules

_(see

Sect.

_3).

_Experience

with À = 1 networks has also shown the need to

_adjust

the

_stability

condition in

_equation

_{(2.1) from £M:::.}

0 to

_Cr::>

K

(where

K is a

_positive

constant which is taken for convience to be

_independent

_{of 1£}

and

_i),

so as to

_produce

stable states with

non-negligible

basins of attraction.

(5)

defined

_by

the

_vanishing

of this volume. The fractional volume is

_{simply : VT}

_{= fl}

_Vi,

where

i

Vi

is

_given

by :

with,

The

_typical

fractional volume is then to be found

_{by averaging VT}

over all realizations of the

stored

patterns.

Since

_VT

factorizes over the index

i,

the

_assumption

of self

_averaging

with

respect

to i should be valid.

Therefore, write,

In

_{VT = 1 In}

_Vi

and

_perform

the

_quenched

i i

average on In

_VT,

_i.e.,

on each

subsystem

In

_Vi.

This average can be done via the

_replica

method :

For the definition of

_{Vi given}

above,

the calculation of the

_quenched

_{average reduces}to

evaluating

the

_{following expression :}

Using

the standard

_{integral representation}

of the e

function,

this

_equation

becomes :

where the

_integrals

over the x-variables are restricted to _{the range}

_{[K, 00]}

and the other

integrals

remain unrestricted. In order to _{calculate the average}over the

_patterns,

we rewrite

the

_exponential

as :

The sum over the

_patterns

acts

_only

on the terms _{in the square brackets and when}

expanding

these brackets out for À = 1

_they

vanish

_identically.

For À >

1,

all of those terms

(6)

Since the

_patterns

are random and

uncorrelated,

a

_priori

one

_expects

that the

_{Tij, ...}

_j,Bwill also

be uncorrelated.

Hence,

with on the order of

NÀ(À+l)/2

terms in the above sum and

eN’ (À

+ _{1)/2 -}

_{N À (À}

+

1)/2,

these sums are of the

order N- À (À + 1)/4.

For

_large

N these terms are

negligible

and one is left with :

Therefore,

to lowest order in

_1/N,

the

_quenched

_{average has the form :}

If the

variable,

q a6,

@ is now

introduced ;

then

_equation

_(2.11)

becomes identical to that found

_by

Gardner for the case of two-neuron

interactions.

Hence,

the rest of the

_{analysis proceeds identically}

to her calculation and the interested reader is referred to her

_original

_{paper for}the details. The main

_result,

obtained at

the

_{replica symmetric point,}

which can be shown to be

stable,

is that the

_storage

_capacity

grows at

_{saturation, i. e. ,}

the maximum allowed a for a

_given

K, as :

When K = 0 the

_patterns

_are

minimally

stable and the

_storage

capacity

has its maximum value

of a = 2. This proves that networks

having

_{many-neuron interactions}store information no more

_efficiently

than networks

_{having only}

_They

do store much

more information than two-neuron

_connections,

with the number of storable

_patterns

_growing

as

N À 1 À !,

_however,

_they

_require

of the same order of

_magnitude

more connections to reach this

_higher

level of

storage.

Dynamic properties.

The

_dynamical

behaviour of these models is at least as

_important

as the

_storage

_properties

for

application

purposes. In order to function as an _{associative memory, the stable}states must be able to attract almost all states within a « reasonable »

Hamming

distance,

i.e.,

the distance

must not be so

_large

that states

_{having spurious}

correlations with the stored state are

attracted,

but it also must not be so small that

_only

states which are almost identical to the stored state are attracted. The determination of this distance is a _{very difficult}

_problem

analytically.

For

_strongly

connected networks with two-neuron

_{interactions,}

_only

the first

_step

of the time evolution is known

_exactly

_[6],

_although

the attraction basins have been studied

_by

computer

simulations

_[7-9].

To

_gain

a

_simple

_{understanding}

of

_dynamics

for À >

1,

first

compute

the

probability,

P (m (1) m (0)),

that a state of the network at

_{time, t = 1,}

has

_overlap,

_{m (1 ), (m (t )}

-(1/N) 03A3 03BEi

Si(t))

with some one of stored states,

given

that it had

_overlap

_m(0)

with the

(7)

same stored state at time t = 0 and random

overlap

with all of the other stable states. This

probability

is

_{given by :}

The evaluation of this

_{probability proceeds along}

the lines of

_Kepler

and Abbott’s

_[6]

work for the two-neuron interaction network and with the use of the same

_arguement

as in section 2 for

computing

the sum over

_{configurations,}

_hence,

_omitting

the

details,

the result is

quite

simply :

where

is defined in

_{equation (2.1).}

If the distribution of the

_£i’s

is

known,

then

_(3.2)

_gives

an

explicit equation

for

_{m (1 ).}

_However,

the

_only

restriction on the

_Ci’s

is :

_£i

> rc

Vi,

which is

independent

of À ; hence,

they

should have the same distribution as for À = 1

_{[10]. (It}

is very easy to check this via a

_replica

calculation similar to that of Sect.

_2.)

_Thus,

with

_probability

one,

_{m (1 )}

becomes :

For À

_{= 1,}

the calculation of

_{m (2)}

_{is very}difficult because it

_depends

not

_only

_upon

m(1)

but _{also upon the}

_parameter,

p

_{(1):= L}

[m p, (1)]2,

which measures the

_projection

of the

03BC

state of the network onto the

_{subspace spanned by}

the stored states.

_(At

the first time

_step

this

parameter

does not enter

_explicitly

since the initial state of the network has been assumed to

have

_{macroscopic overlap}

with

_only

one stored

_pattern

and random

_overlap

with all of the other stored

_patterns.)

The calculation of the time evolution of

_{p (t)}

_appearsto be an

intractable

_problem

_[11].

When À >

_1,

_however,

a similar

_variable,

pk (t)

03B1

[M 2 1

03BC

does not

_play

such a fundamental role because the

_complete

_{configuration}

_{space is}

_always

spanned by

the set of stored

_patterns

_(provided

of course that a ~ 0 and in the limit

Nu

_oo),

so that P À is

independent

of time. The effect of this time

_independence

can be seen

by

considering

the basins of attraction around the stored states.

A convenient measure of the « size » of the basins of attraction around a

_given

stored state

can be

_computed

from

_equation

_(3.3).

This measure is often called the radius

_{o f attraction}

and

it is denoted

_by

R

_[15],

with R

_being

defined as : R -1- _{mu, and}_muis the unstable fixed

point

of

_(3.3)

at a

_given

value of K

(and

as a _{consequence of}

_Eq.

_(2.13)

an

_equivalent

value of

a).

R is

_plotted

in

_figure

2 as a function of a for several values of À. If

_{m (0 ) >}

_{mu, then the} network will iterate toward the stable fixed

_point,

m = 1. If on the other

hand,

(8)

been noticed that the

_system

may iterate in the first few time

steps

toward the fixed

_point

m = 1 and then at latter times reverse direction and flow toward m = 0

[12].

Once,

the

_system

starts

_flowing

toward m = 0

however,

it has not been observed _to_reverseitself.

(For

the

related case of feed-forward networks one can solve the

_{dynamics exactly}

and prove

rigorously

that such an effect does indeed exist

[13].)

_Thus,

it can be

_conjectured,

and is

supported by

the data

_(see

that

_figure

2

_gives

_{the upper bound for the radius of} attraction when À = 1. The reversal of the network’s direction of flow has

_previously

been related to the time evolution of p

(t ) [11].

Since,

for Je >

1,

_{p À}should be

independent

of

time,

it can be

_conjectured

that no such reversal takes

_place

and that the fixed

_points

of

_equation

(3.3)

determine the radius of attraction

_exactly.

In the next

_section,

numerical simulations will show this in fact to be the case.

Fig.

2. -

R = 1 - mc, where mc are the fixed

_points

of

_equation

_(3.3),

vs. a for several values of À.

There is another

_qualitative

difference between the flow

_diagram

for À = 1 and

À >

1,

namely

the nature of the transition to R = 1. For À =

1,

equation

(3.3)

predicts

_a

second order transition to R = 1 at the value of rc defined

_{by :}

or a = 0.419.

_While,

for À >

1,

equation

(3.3)

predicts

a first order

_jump

to R = 1 at a = 0. The size of this

jump

increases as À _{increases and goes}to 1 as A - oo. In other

_words,

as _{A - oo,}R --> 0 for all a > 0 and R = 1 at a = 0.

For the

_{dynamical properties}

_then,

there are

_quantitative

as well as

_qualitative

differences between networks with

_only

two-neuron _{interactions and networks with many-neuron}

(9)

Numerical simulations.

A

_{learning algorithm}

to

_generate

networks

_satisfying

the constraints of section two _for

many-neuron

_interacting

networks is

_easily

constructed from a

generalization

of the

learning

rules

developed

for two-neuron

_interacting

networks

_[5,

_8,

_14].

_Although

_{any of the}

_proposed

learning

rules can be

_generalized,

the

_{generalization}

here is based upon the

learning

rule

developed

by

the

_Edinburgh

_group.

_Starting

with the first

pattern

and

_moving

_sequentially

through

all the

patterns

(and

in a

_sequential

or

_parallel

manner

_through

the

neurons),

define an « error mask » as :

then

_update

the

_coupling

matrix as :

After all the

_patterns

have been checked in this manner the

_procedure

is

_repeated

until the

stability

condition is satisfied for every

pattern

on _everyneuron. This

_learning

_algorithm

can

be shown to _convergeto a solution of the

_stability

conditions

_provided

such a solution exists. The

_proof

makes no essential

_departures

from that

_given

in reference

_[5]

for the case of

two-neuron interaction and so will not be

_repeated

here.

For finite size

_networks,

one must

_compensate

for the deviation of _a,from that

_{given by}

equation

(2.13).

This is done

_by

_looking

for the value of _{K N}at which the

_learning

time tends to

infinity.

K N will be somewhat less than the Ke

predicted by

equation

(2.13),

but the finite size

system

will have

_properties

which are _{very close}to those of the infinite volume

_system

_[9].

As a test of the

_{foregoing analytical}

calculations,

m (1 )

was measured as a function of

m (0 )

for a = 0.20 and a = 0.4 at À = 2 and a network size of N = 64. As seen in

_figures

3a and 3b the results agree

quite nicely

with the

_{analytic predictions.}

As

_{m (0 )}

becomes small the interference from the other stored states increases

_(a

_priori

one

_expects

interference to set in around

_{m(O) -- 0(11}

-N,/N-»

and the

_agreement

with the theoretical calculations is

_predictably

worse.

_Now,

in the

_previous

section it was

_conjectured

that the radius of attraction could be calculated from

_equation

_(3.3)

for À > 1. To show that this is indeed the case a definition of R is needed which takes the finite size of the network into account.

A

_working

definition of the radius of attraction R

is ; 1 -

mc, where

mc is

the value of the

overlap

such that as N - oo almost all of the states

_{havin m}

_{> mc will evolve toward}

m = 1. For finite size

_{systems, R is}

reduced due to the

_{0 (1 /}

~N)

overlap

between the stored

/ 1 M,

B

states.

_Hence,

a better definition of R is

[15] :

R

= 1 - Mav ,

where m is the _average

[ ]

B

1 -

_m

av g

-

av

overlap

of the

_given

state with all of the other stored states and the bracket indicates an

average over all stored states and all

_starting

_{configurations.}

_(As

N ---> oo, the mc defined here tends to _{the mu defined in the}

_previous

_{section and Ma, tends toward}

_zero.)

The results obtained

_when

_calculating

R

_using

this

_prescription

are

_plotted

in

_figure

4.

_(The

results for À = 1

_[9]

are also

_plotted

for

_comparison.)

As can be seen, the fixed

_points

of

equation

(3.3)

agree very well with the numerical simulations for the radius of attraction when h = 2.

(Note

also that for À

= 1,

equation

(3.3)

becomes _a_very

good predictor

of R for

large

a _where

_{p (t)}

_{is much}more

_restricted.)

Although

it would

_certainly

be desirable to have results for

_larger

values of

_À,

such simulations become

_{increasingly costly}

for

_fully

connected networks near

_saturation,

_i.e.,

(10)

Fig.

3. -

a)

mo vs. _mlat a = 0.20. The solid line is a

_plot

of

_equation

_(3.3).

_b)

mo vs. ml at a = 0.40. The solid line is a

_plot

of

_equation

_(3.3).

All the data was taken on a network with N = 64 _neurons.

memory,

growing

like 0

_{(N AI À !).}

Hence,

these simulations must await the next

_generation

of

parallel

computers.

Discussion.

It has _{been shown that networks with many-neuron}interactions do not store information

(11)

Fig.

4. - The radius of attraction for À = 1

(0)

and À = 2

(El).

For À = 1 the network size _was

N = 100 while for À = 2, the network size was N = 64. Also shown are the theoretical

_predictions.

global

nature of the

_limiting

value of a = 2.

Although

the

storage

capacity

of the

_original

Hop field

model can be

_improved

_upon

_{by deleting}

connections

_[16],

or

_{by using Ising}

_{(± 1)}

variables for the synapses

[17],

or

_{by using}

_{restricted range interactions}

_[18],

none of these

approaches

leads to

_{increasing a}

above 2 as

_long

as random uncorrelated

patterns

are used. It would be

_interesting

to know if a is limited

_{simply by}

the two state nature of the

_spin

variables

or

_by

some more fundamental mechanism. Of course, one can consider correlated

patterns

in which case the

storage

capacity

defined above goes to

_infinity

as all of the

_patterns

become

perfectly

correlated ; however,

if one then looks at the

_{quantity : in formationlsynaptic}

coupling,

this decreases as the correlation between the

_patterns

intreases

[5],

_implying

once

again

some fundamental limit to the information

_storage

_capacity

of neural networks. On the other

_hand,

_many-neuron

_interacting

networks do have a different

_dynamical

behaviour as indicated

_{by equation}

(3.3).

This solution of the

dynamics

indicates that many-neuron

_interacting

neural networks are far better at

_pattern

discrimination than two-neuron

interacting

networks,

i.e.,

when

_operating

at a

_given

_{radius of attraction many-neuron}

interacting

networks are able to store more

_patterns

than the

_simple

two-neuron

_interacting

networks. For

_example,

a reasonable radius of attraction _{for the purpose of associative} memory is R = 0.20

(10

% _wrong

spins).

Operating

at this value of

R,

networks with two-neuron interactions can store

_{approximately}

0.52 N

_patterns,

while a network with three-neuron interactions can store

_{approximately}

0.35

_{(N 2/2!)}

_patterns

and a network with four-neuron interactions can store 0.18

_{(N 3/3!)}

_{patterns. Hence,}

_{the real power of networks with} many-neuron interactions will become

apparent

in

_applications

where it _{is necessary}to obtain

a

_{predetermined}

level of discrimination between

_large

numbers of

_patterns.

_(For

real world

applications

this will be

_{important only}

if the cost of

_making

_{many-neuron interactions is}not

sighificantly

more than that for

_making

two-neuron interactions. Such is

_certainly

the case in

(12)

fast thus

_making

the network useless for most _{associative memory}

_applications

_independent

of the value of A. Instead of

_working

at

_large

a

then,

these results

imply

that it would be better to

go to a

_larger

value of À and then work at a smaller value of a.

Finally,

it should be added that the calculations

_presentéd

above can be extended to include networks of mixed

type, i.e.,

networks

having

for

_{example two-body}

and

_three-body

interactions,

etc., and to networks

_having

less than full

_{connectivity.}

In such cases, the

efficiency

of information

_storage

is

_unchanged

from that

above,

but the

_{dynamics undergoes}

predictable changes depending

upon the exact architecture one is

_considering.

Acknowledgments.

1 would like to thank the

_theory

_groupat the

_University

of Bonn for their

_hospitality

throughout

the earlier

_phases

of this work and the HLRZ for its

_hospitality

while the

computer

calculations were in progress.

References

[1]

PERETTO P. and NIEZ J. J., Biol.

_Cybern.

54

₍₁₉₈₆₎

53.

[2]

PERSONNAZ L., GUYON I. and DREYFUS G.,

Europhys.

Lett. 8

₍₁₉₈₇₎

863 ;

GARDNER E., J.

_Phys.

A 20

₍₁₉₈₇₎

3453 ;

ABBOTT L. F. and YAIR ARIAN

_Phys.

Rev. A 36

₍₁₉₈₇₎

5091.

[3]

BALDI P. and VENKATESH S. S.,

Phys.

Rev. Lett. 58

₍₁₉₈₇₎

913.

[4]

VENKATESH S. S., Neural Networks for

_Computing,

Ed. J. S. Denker

_(American

Inst. of

_Phys.,

New

_York)

1986.

[5]

GARDNER E.,

Europhys.

Lett. 4

₍₁₉₈₇₎

_481;J.

Phys.

A 21

₍₁₉₈₈₎

257 ;

GARDNER E. and DERRIDA B., J.

_Phys.

A 21

₍₁₉₈₈₎

271.

[6]

KEPLER T. B. and ABBOTT L. F., J.

_Phys.

France 49

₍₁₉₈₈₎

1657 ;

KRAUTH W., MÉZARD M. and NADAL J.-P.,

Complex Syst.

2

₍₁₉₈₈₎

387.

[7]

WEISBUCH G., J.

Phys.

Lett. France 46

₍₁₉₈₅₎

L623 ;

KRAUTH W., NADAL J.-P. and MÉZARD M., J.

_Phys.

A 21

₍₁₉₈₈₎

2995.

[8]

FORREST B. M., J.

_Phys.

A 21

₍₁₉₈₈₎

245.

[9]

KRÄTZSCHMAR J. and KOHRING G. A.

_(to

be

_published).

[10]

KINZEL W. and OPPER M.,

Physics

of Neural Networks, Eds. J. L. van _Hemmen,E.

_{Domany and}

K. Schulten

_(Springer

Verlag ;

to be

_published).

[11]

KOHRING G. _A.,

_Europhys.

Lett. 8

₍₁₉₈₉₎

697.

[12]

GARDNER E., DERRIDA B. and MOTTISHAW P., J.

_Phys.

France 48

₍₁₉₈₇₎

741.

[13]

MEIR R. and DOMANY E.,

Phys.

Rev. Lett. 59

₍₁₉₈₇₎

359 ;

Phys.

Rev. 37

₍₁₉₈₈₎

608.

[14]

KRAUTH W. and MÉZARD M., J.

_Phys.

A 20

₍₁₉₈₇₎

L745 ;

ABBOTT L. F. and KEPLER T. B., J.

Phys.

A 22

₍₁₉₈₉₎

L711.

[15]

KANTER I. and SOMPOLINSKY H.,

Phys.

Rev. 35

₍₁₉₈₇₎

380.

[16]

CANNING A. and GARDNER E., J.

Phys.

A 21

₍₁₉₈₈₎

3275 ;

DERRIDA B., GARDNER E. and ZIPPELIUS A.,

Europhys.

Lett. 4

₍₁₉₈₇₎

167.

[17]

AMALDI E. and NICOLIS S., J.

Phys.

France 50

₍₁₉₈₉₎

2333 ;

KRAUTH W. and OPPER M., J.

Phys.

(to

be

_{published) ;}

KRAUTH W. and MÉZARD M., Laboratoire de

_{Physique Statistique}

de l’Ecole Normale

Supérieure

preprint

95, 1989.

Neural networks with many-neuron interactions

HAL Id: jpa-00212356

https://hal.archives-ouvertes.fr/jpa-00212356

Submitted on 1 Jan 1990

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Neural networks with many-neuron interactions

G.A. Kohring

To cite this version:

LE

JOURNAL

DE

PHYSIQUE

Neural networks with many-neuron interactions

Kohring

(*)

Physikalisches

(Reçu

accepté

1989)

propriétés statiques

dynamiques

plusieurs

analytiquement

numériquement.

capacité

stockage

paires

qui

implique

plus

partir

dynamique

plus

plus

paires

dynamical properties

having

analytically

numerically.

capacity

unchanged

widely

implying

efficiently.

exactly

dynamics

connectivity

simpler

only

Phys.

(1990)

Physics

simplest single layer

by

Si’

real-valued,

coupling

matrix,

Tij,

coupling

assumption

only

However,

quite early

[1]

assumption

biological

Biological

propensity

_PHYSIQUE

_Kohring

_(*)

₁₉₈₉₎

_dynamiques

_{analytiquement}

_{numériquement.}

_capacité

_stockage

_paires

_qui

_plus

_partir

_dynamique

_plus

_plus

_{dynamical properties}

_numerically.

_unchanged

_widely

_efficiently.

_exactly

_dynamics

_connectivity

_simpler

_only

_Phys.

₍₁₉₉₀₎

_{simplest single layer}

_by

_real-valued,

_coupling

_matrix,

_Tij,

_coupling

_assumption

_only

_However,

_{quite early}

_[1]

_assumption

_biological

_Biological

_propensity

_[1,

_storage

_capacity

_{using multibody}

_[3]

_storage

_capacity

_{0 (N A)}

_having

_(A

_)-neuron

_[4]

_[5]

_{patterns, P,}

_storage

_storage

_capacity

_having

_storage

_capacity

_efficiently

_properties,

_by

_dynamical

_dynamics

_studied,

_{interactions,}

_temperature

_{equation :}

_strongly

_{only dynamical}

_analytically

_step

_[6].

_However,

_computer