Generalization in a Hopfield network

(1)

HAL Id: jpa-00212540

https://hal.archives-ouvertes.fr/jpa-00212540

Submitted on 1 Jan 1990

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Generalization in a Hopfield network

J.F. Fontanari

To cite this version:

(2)

2421

Generalization in

a

Hopfield

network

J. F. Fontanari

Instituto de Fisica e

_Quimica

de São Carlos, Universidade de São Paulo, Caixa Postal 369, 13560 São Carlos SP, Brasil

(Received

23

_April

_1990,

_accepted

in

final

form

12

_{July 1990)}

Abstract. 2014 The

performance

of a

_Hopfield

network in

_learning

an extensive number of concepts

having

access

_only

to a finite

_supply

of

_typical

data which

_exemplify

the _conceptsis studied. The

minimal number of

_examples

which must be

taught

to the network in order it starts to create

representations

for the _conceptsis calculated

_{analitically.}

It is shown that the mixture states

_{play a}

crucial role in the creation of these

representations.

J.

_Phys.

France 51

₍₁₉₉₀₎

2421-2430 ler NOVEMBRE 1990,

Classification

Physics

Abstracts 87.10 - 64.60C

1. Introduction

Learning

and

_{generalization}

in neural networks has been the

_subject

of intensive research in the

_past

_{few years}

_[1-7].

The most recent studies have been carried out in the context of

supervised learning

in

_single-layer

feedforward neural networks

_[5-7],

_following

the theoreti-cal framework

_presented

in Gardner’s seminal paper

[8]. Comparatively,

little attention has been devoted to the

_ability

of

_simple

feedback neural

_networks,

_e.g.

_Hopfield’s

model

_[9],

to

perform computational

tasks

_beyond

the

_simple

_storage

of a set of

_activity

_patterns.

The

_learning

_{process in}

_Hopfield’s

model consists of

_setting

the value of the

_coupling

.l¡j

between neurons i

_{and j}

for all

_pairs

of neurons such that a

_given

set of

_activity

_patterns

is memorized

_by

the network. In this model the states of the neurons are

_{represented by Ising}

spins, Si

+ 1

_{(firing) or Si= - 1 (rest). Storage}

of an

activity

_pattern

{ç r

= ±

_1,

i =

1, ...,

N }

into the _memoryof the network is achieved

by modifying

the

couplings

according

to the

_generalized

Hebb rule

After

_{being exposed}

_{to p}

_activity

_patterns

the

_couplings

are set to

where we have

_{assumed Jii}

= 0

initially (tabula rasa).

(3)

Once the

_couplings

are

_fixed,

the

dynamical

retrieval _processis

_{governed by}

the Hamiltonian

_[9]

The network can

_potentially

retrieve a

_given

_activity

_pattern

if it is a minimum or if it is very near a minimum of H. The natural

parameters

for

_measuring

the

_performance

of the network in

_retrieving

the stored

_patterns

are the

_overlaps

where the state

_{{Si, i = 1, ..., N } is a}

minimum of H. The

_properties

of these minima have been

_fully

studied

_by

Amit et al.

[10, 11 ]

using

statistical mechanics tools

_developed

in the

analysis

of _{infinite range}

_{spin-glasses [12].}

It has been shown that besides the retrieval states

which have

_{macroscopic overlaps, 0(1),}

with

_only

one stored

_pattern

there exist mixture

states which have

_{macroscopic overlaps}

with several stored

_patterns

_[10].

Since the interest in

Hopfield’s

model is

_mainly

due to its

_prospective

use as an associative memory, the attention has been focused on the retrieval states, the mixtures states

_{being regarded}

as a nuisance

which can be eliminated either

by adding

external noise to the

system

[10]

or

_{by modifying}

the

learning

rule

[13].

On the other

hand,

these

_spurious

states have been seen as

_proof

of the

ability

of the network to create new

_{representations}

to handle the information contained in the stored

_patterns

_[14].

In this paper we show that the mixture states

_play

a crucial role when the task

_posed

to the network is to extract

_meaningful

information from the

_activity

_patterns

it is

_exposed

to

_during

the

_learning

_stage.

We consider the

_{following problem.}

Let us _{suppose that}

_during

the

_learning

_stage

the network is

_exposed

to s

_examples

of a

given

_concept.

The

_examples

are embedded in the memory of the network

by

the Hebbian

_learning

process,

equation (1.2).

The

question

we address is whether the network can create a

representation

for the

_concept

to which it had been

_{exposed only through examples.}

We say that the network has a

_{representation}

for a

concept

if the

_concept

is a minimum or if it is _verynear a minimum of H. More

_{specifically,}

we

consider p

= a N

_concepts

represented by

the

activity

patterns {T}.

=

1,

..., p. For each

concept,

a finite number of

_examples

_{{ç jJl}, V}

=

1, ..., s

is

generated.

Their

_components

_are

statistically independent

random variables drawn from the distribution

with O:s; b :s; 1. The

_examples

can be

thought

of as

_noisy

versions of the

_concepts

_they

exemplify.

The

_parameter

b measures the

difficulty

of the task

_posed

to the network : Small

values of b result in low correlations between

_examples

and

_concepts,

_making

the

_grasping

of

the

_concepts

more difficult. For

_simplicity,

the

_components

of the

_concepts

are

_randomly

chosen as ± 1 with

_{equal probability.}

As an alternative

_viewpoint,

one may consider the

concepts

as

defining

_pclasses each one

containing s

individuals

_(examples).

Thus,

the task of the network would be to _groupthe

examples

in their

_respective

_classes,

i.e. the network should

_categorize

the

_examples.

However,

in this paper we follow the

_point

of view

_expressed

in Denker et al.

_[1]

_]

that

categorization

is a

_particular

case of rule extraction

_{(generalization)}

in which the rule may be

(4)

2423

Finished the

_learning

_stage,

the

_couplings

are set to

The

_quantities

we focus on are the

_{generalization}

errors eJL defined

_by

where

m e,

the

_{generalization overlaps,}

are the

_overlaps

between the

concepts

and a certain minimum of H which will be

_specified

later.

The aim of this paper is to calculate the

_dependence

of the

_{generalization}

error

(el-")

on the number of

_{examples taught}

to the network

_(s)

for a

given

task characterized

by

the

parameter

b. To achieve this we

study

the

thermodynamics

of

Hopfield’s

Hamiltonian,

equation (1.3),

with the

_couplings

set as in

_{equation (1.6).}

Our

_analysis

is restricted to the

noiseless

_{(zero temperature)}

limit. A first

_attempt

to tackle this

_problem

has been

_published

recently [15].

This paper

presents

a

_simpler

and more

_{general approach.}

The paper is

organized

as follows. In section 2 we

_study

the

_{thermodynamics}

of the model in the limit a =

p

/N -

0. The

simplicity

of this

limit,

which

dispenses

the use of the

replica

trick in the

_computation

of the

_averaged

_{free energy}

_density,

allows us to find

_analytical

expressions relating

ee with s and b. The

_analysis

of the nonzero a limit is

_performed

within the

_{replica symmetric}

framework in section 3. We summarize our results and

_present

some

concluding

remarks in section 4. 2. Finite number of

_concepts.

The Hamiltonian of

_Hopfield’s

model with the

_{couplings given by equation (1.6)}

can be written as

where we have omitted a constant term and included an additional term in order to

_compute

M,". In

_fact,

_writing

the

_averaged

_{free energy}

_density

as

where Z =

Trs e- pH

_onehas

The

_{parameter 8 =-}

T-1

in

the

expression

for the

_partition

function Z is a measure of the amount of noise

_acting

on the

_system.

The noiseless limit is obtained

_by

_taking

/3

- oo. The

_{notation ... >}

stands for the _averagesover the

_examples

and over the

concept

taken in this order. The calculation

_{of f is straightforward}

in the limit a = 0

[10]

_{so we}

_present

(5)

The order

parameters,

with

_{(... ) T standing}

for the thermal average, are

_{given by}

the

_{saddle-point equations}

with hu = 0.

Using

equation

(2.3)

we can write the

generalization overlaps

me in terms of the retrieval

_overlaps

m p, JI

We restrict our

analysis

to a

particular

class of solutions for

m J.L 11,

i.e. to a

_particular

class of minima

_{of f (or}

H for T =

0).

Since there _{are no}

macroscopic

correlations between different

concepts

we consider solutions of the form

_{m J.L 11 = m 1 11 8 J.L 1.}

Moreover,

we choose

m1v

to be of the form

The motivation for

choosing

this solution is that it

_gives

a bias to the network to behave as an associative memory : each

example

is

singled

out

_(the

solution is s

_degenerate

since we can select any of the

examples

to be

_{ml 1)}

and treated as an

_{independent piece}

of information to be stored. If any other behaviour emerges, it will be a

_spontaneous

_property

of the network and not an artifice due to the

_particular

choice

_{expressed by equation (2.8).}

At T =

0,

m11

_{and ms -}1

satisfy

the

equations

where Xs -

_{1 = ¿ ç 1 JI.}

For s > 10 the binomial distribution of Xs - 1 can be

replaced by

a

v > i

Gaussian with mean

_{(s - 1 )}

bç

1 and

variance

4)

=

(s - 1 ) ( 1 -

b 2).

Performing

the averages

over ç Il,

Xs - 1 i

and ç

1 in

this

order,

we find

where

(6)

2425

Next we discuss the solutions of

_equations

_(2.10).

For small s the network retrieves the

examples

almost

_perfectly,

i.e.

m11 ~ 1

and

_{ms - 1 = b2.}

Hence the

_{generalization}

error

E = E ,

equation (1.7),

is

Strictly,

the retrieval is

_perfect

for

_{(s -}

_{1 ) -- b-2}

as can be

_easily

seen from

equations (2.9)

since 1 xs - 1 1 -- s -

1. This result is not recovered

_{by equations (2.10)}

because the

_replacement

of the discrete distribution of _{xs _}₁

_by

a Gaussian

_makes

_{1 Xs - 11 [}

unbounded. As s

_increases,

m 11

decreases

_slightly

until s reaches a critical value Sc, above which

m11

_jumps

to a much smaller

value,

almost

_equating

_{with ms - 1. This}behaviour is reflected in the

_{generalization}

error which we show in

_figure

1 as a function of s for several values of b. For s - Sc the network

just

memorizes the

patterns

it is

_exposed

to. This behaviour is referred to as the retrieval

_{phase (R).}

_{For S:> sc the network}no

_longer

treats the

_examples

as

independent pieces

of information and starts to mix

_them,

_creating

the

_{representation}

for the

concept.

This is the

_{generalization phase (G).}

The

_{phase diagram}

in the

_{(s, b ) plane}

indicating

the

_regions

where each

_regime

occurs is shown in

figure

2.

Although

mIl = m s -

i is solution of

equations (2.10) only

in the limit s -+0 _oo, in the

generalization regime

one has

m ms ms

where ms is the

_symmetric

solution of

equation (2.6),

In

_{fact, m 11}

never

_equals

_{ms _}1 because when

averaging

over the

examples

we have

explicitly

ruled out this

_{possibility by retaining}

the discrete nature

_{of e}

11 while

_making

the Gaussian

approximation

for the distribution of _{Xs - 1.}

_{Nevertheless,}

since the differences between

m 11,

ms -

and

m, are

_negligible

in the G

_phase

we can

_{approximately}

characterize this

_regime

by

the

_symmetric

_{solution ms.}

_Following

similar

_steps

to the ones

_leading

to

_{equations (2.10)}

one finds

Fig.

1. -

The

generalization

error as function of the number of

_examples

for b = 0.18, 0.20, 0.25, 0.40

(7)

where

Thus,

the

_{generalization}

error in the G

_phase

can be

_approximate

to

The values of 8

_computed

_through

this

_equation

are

_{indistinguishable}

from the exact

generalization

error,

computed through equations (1.7)

and

(2.13),

in the scale of

figure

1.

3. Infinité number of

_concepts.

In this section we consider the case where the network has to create

_{representations}

for an extensive number of

_{concepts, a}

=

p /N +

0,

being exposed

to a finite number of

examples s

of each

_concept.

In this case we have to take into account the fact that the combined

_overlap

of a

_concept

with all the other

_concepts

is of

0 (-,/à ).

To handle this situation we follow Amit et al.

_{[11] and}

assume that

_only

the

_overlaps

m

JI

condense,

i.e. are of

₀₍₁₎

while the others are of

O(N- 1/2).

The

averaged

free _energy

density

is calculated

through

the

replica

trick

where Z’ is the

_partition

function

_{replicated n}

times,

Averaging over rJl(J.L

>

1) explicitly

and

using

the

self-averaging

_property

of ei ’

yields

where we have omitted

_{multiplicative}

factors which vanish in the

_{thermodynamic}

limit and

withBJlÀ = b2 +

(1 - b2)

8J1À

and

_Qpu

=

qpu

+

_{(1 -}

_qpu)

_8pu.

_{The integrals in equation (3.3)}

(8)

2427

resulting

in the

_{following expression}

for the

averaged

free energy

density

where

with

_{C = 6 (1 - q )}

and Dz =

_{dz /}

J2;

e - z 2/2

Thé order

_parameters

are

_{given by}

the

saddle-point equations

which,

in the limits

_f3

-+ oo and

hl...

0,

are written as

The

_equations

for the standard

_Hopfield

model

_[11]

are recovered in the cases s = 1 and

b = 1 with _an

appropriate rescaling

of C and r. For b = 0 the network is

effectively storing

sp uncorrelated

_patterns

and

_{Hopfield’s equations}

are obtained

_{by rescaling a ’ -}

sa. Once

m

"

and C are known we can

_compute

m

through

the

_equation

Next we discuss the solutions of the

_{saddle-point equations (3.10)-(3.12).}

In addition to the solutions considered in the a = 0

limit,

there exist a

_{spin-glass solution m 1 JI =}

0

‘d v which is stabilized

_by

the Gaussian noise due to the

_overlaps

of

_{O(N- 1/2)}

between

e

"

and el’ ’,

» > 1.

However,

adding

noise to the

_system

has a

_{destabilizing}

effect for the

retrieval

_{phase [ 13,}

_16,

_{17], reducing}

its domain to a

_region

much smaller than the one shown in

_figure

2. Therefore the

_phase

_diagram

in the

_{(s, b ) plane}

will be dominated

_by

the

generalization phase

characterized

_by

the

symmetric

solution,

equation (2.15),

and the

spin-glass phase.

In the

_following

we will focus

only

on the

_interplay

between these two

_phases.

For the

_symmetric

solutions

_{equations (3.10)}

and

_(3.11)

reduce to

(9)

Fig.

2.

_Fig.

3.

Fig.

2. - Phase

_{diagram showing}

the

generalization

phase (G)

and the retrieval

phase (R)

for

a = 0. The transition at s = _SCis discontinuous. The retrieval of the

examples

is

perfect

below the

dashed curve, b = 1

/

Bls - 1.

Fig.

3. - The

generalization

error as function of the number

of examples

for

_{a/a 0}

= 0.05, 0.2, 0.4, 0.5

and b = 0.4.

respectively,

where Xs

_{= L çl}

JI.

For s > 1 0 one can

_replace

_Xs

_by

a Gaussian variable of mean

v

sbe

1 and

variance

_{s( 1 -}

b

_2).

_Performing

the averages as in section 2

yields

where

₄₎

=

a r + m.; s (1 - b 2).

Equations (3.12), (3.16)

and

(3.17)

must be solved

numeri-cally

in order to

_compute

the

_{generalization overlap,}

and,

consequently,

the

_{generalization}

error e =

( 1- m 1)/2.

In the limit s - oo the noise in the

_examples

is

_averaged

out, i.e. Xs -

sbe

1 and

the standard

Hopfield

model with the

_concepts

_replacing

the

_examples

in

_{equation (1.6)}

is recovered. This

result can be

_easily

obtained

_{by rescaling}

C’ =

sb 2 C

and r’ =

r / S2 b 4

which

implies

that

ms =

bm

1 with m

satisfying

the standard

Hopfield’s equations.

Next _wediscuss the behaviour

of the

_{generalization}

error as a function

_{of s, b}

and a. In

_figure

3 we show e as a function of s

for b = 0.4 and several values of

a / a o,

where

ao = 0.138

gives

the

_storage

capacity

of the

standard

_Hopfield

model. As in the a = 0 _case,there is _aminimal number of

examples

which

must be

_taught

to the network in order it starts to

_generalize.

The transition between the SG

phase,

where E = 0.5 since the

spin-glass

states have no

_macroscopic

correlations with the

concepts,

and the G

_phase

is

_always

discontinuous. The values of

_{a lao}

for which the transition occurs are shown in

_figure

4 for several values of b. As b --+ 1 or s --> _00,

(10)

2429

Fig.

4.

_{Fig. 5.}

Fig.

4. - The critical values of

_{a / a 0}

below which the network starts to

_generalize

as function of the

number of

_examples

for b = 0.2, 0.4, 0.6, 0.8.

Fig.

5. - The

generalization

error at

_criticality

as function of the number of

examples.

generalization

error at the transition

_(Be)

is shown in

_figure

5. As b --+ 1 or s --+ _oo,

ec tends to

_0.0165,

the critical retrieval error for the standard

_Hopfield

model

[11].

4. Conclusion.

In this _paperwe have

s*hown

that a

_simple

feedback neural network

_using

a local Hebbian

learning

rule is able to learn a set of _concepts

_having

access

_only

to a finite

supply

of

typical

data which

_exemplify

the

_concepts.

The network

_acomplishes

that

_{by creating representations,}

i.e. minima of the energy function

governing

the retrieval process, which

capture

the

underlying

statistical structure of the

_{examples, allowing}

the extraction of

_meaningful

information from the data

_supply.

For the

_{specific problem}

we have considered in this paper, the

_symmetric

mixture states

_provide

for the

_{representations}

which allow the network to _grasp

the

_concepts.

Since the research on

_Hopfield’s

model has focused

_mainly

on the retrieval

states, very little is known about the relation between the statistical structure of the

_patterns

presented

to the network

_during

the

_learning

_stage

and the structure of the

_{representations}

created

_by

the network

_[13,

_17].

This seems to be a crucial issue if one intends to lead the

study

of

_Hopfield’s

model

_beyond

its

_{memorizing capabilities.}

Next we summarize our main results. For finite _pwe have found that there is a

regime

where the network

_simply

memorizes the

_{examples ignoring}

their statistical structure

(retrieval phase).

However,

as the number of

_examples

increases

_passing

a certain critical

number Sc

the behaviour of the network

_undergoes

an

_abrupt

_{change entering}

a new

regime

where the

representations

of the

concepts

are created

_{(generalization phase).}

For p = a N

(a >0)

the retrieval

phase

is confined to a _{very small}

_region

of the

(s, b )

plane.

However,

a

_{spin-glass regime,}

where the network

_ignores

both the

_examples

and the

concepts,

appears to

_compete

with the

_{generalization regime.}

The

_interplay

between these

two

_regimes

is

qualitatively

similar to the one discussed in the finite _plimit with the

_spin-glass

replacing

the retrieval

_regime.

It is

_interesting

to _compareour results with the ones

_presented

in the literature for

(11)

the

present

work is the existence of a critical number of

_{examples (sc) beyond}

which the

network

_generalizes

well

(Figs.

1 and

_3).

Whether a similar behaviour occurs in the case of

feedforward networks is an unsettled issue. On the one

hand,

simulations of

_multilayer

networks for the

_contiguity

_problem

and some

_general

theoretical

_arguments

point

out for the

existence of a critical size of the

training

set above which the

_{generalization}

error

(E)

falls

_{of exponentially}

fast

_{[1, 3].}

On the other

hand,

an

_{analytical study}

of the

_performance

of a

_single-layer

_perceptron

in

classifying examples according

to their

_Hamming

distance from a set of

_prototypes

indicates that such a critical number does not exist

_[6].

_However,

since the

behaviour of E seems to

_{depend strongly}

on the architecture of the network considered

_[3]

there may be no

simple

answer for this issue. A similar

controversy

could _verywell arise in the

context of feedback neural networks if we use other

_leaming

rules than the one considered in

this _paper.

Finally,

we should mention the relevance of our results to the

_problem

of

_{categorization}

in neural networks. As

_pointed

out in the

Introduction,

the

_{generalization regime}

may be

interpreted

as a

categorization

of the

examples

into the classes defined

by

the

concepts.

We have found that

_{categorization}

emerges

spontaneously

when a critical number of

examples

is

presented

to the network

_during

the

_learning

_stage.

This result corroborates the

_viewpoint

that

_{categorization}

is related with the limitation of an associative _memory.This

point

was

beautifully expressed by

Virasoro : « we

_categorize

not because we want to but because we cannot do otherwise »

_[18].

Acknowledgments

It is a

_pleasure

to thank

Ronny

Meir for many

helpful

discussions.

References

[1]

DENKER J., SCHWARTZ D., WITTNER B., SOLLA S., HOWARD R., JACKEL L. and HOPFIELD J. J.,

Complex Systems

1

₍₁₉₈₇₎

877.

[2]

PATARNELLO S. and CARNEVALI P.,

Europhys.

Lett. 4

₍₁₉₈₇₎

503 ;

CARNEVALI P. and PATARNELLO

_{S., Europhys.}

Lett. 4

₍₁₉₈₇₎

1199.

[3]

TISHBY N., LEVIN E. and SOLLA S. _A.,

_{Proceedings of}

the International Joint

_Conference

on Neural

Networks

(1989).

[4]

VALLET _F.,

_Europhys.

Lett. 8

₍₁₉₈₉₎

747.

[5]

DEL GIUDICE P., FRANZ S. and VIRASORO M. A., J.

_Phys.

France 50

₍₁₉₈₉₎

121.

[6]

HANSEL D. and SOMPOLINSKY H.,

Europhys.

Lett. 11

₍₁₉₉₀₎

687.

[7]

GYÖRGYI G. and TISHBY N.,

Proceedings of

the STATPHYS-17

_Workshop

on Neural Networks

and

Spin

Glasses, Eds. W. K. Theumann and R. Köberle

_(World

_Scientific,

_Singapore)

1990.

[8]

GARDNER _E.,J.

_Phys.

A 21

₍₁₉₈₈₎

257.

[9]

HOPFIELD J. J., Proc. Natl. Acad. Sci. USA 79

₍₁₉₈₂₎

2554.

[10]

AMIT D. J., GUTFREUND H. and SOMPOLINSKY H.,

Phys.

Rev. A 32

₍₁₉₈₅₎

1007.

[11]

AMIT D. J., GUTFREUND H. and SOMPOLINSKY H., Ann.

_Phys.

N. Y. 173

₍₁₉₈₇₎

30.

[12]

MEZARD M., PARISI G. and VIRASORO M.

_{A., Spin}

Glass

_Theory

and

Beyond (World Scientific,

Singapore)

1987.

[13]

FONTANARI J. F. and THEUMANN W. K., J.

_Phys.

France 51

₍₁₉₉₀₎

375.

[14]

ANDERSON J. A., IEEE Transactions on

_Systems,

Man and

_Cybernetics

13

₍₁₉₈₃₎

799.

[15]

FONTANARI J. F. and MEIR R.,

Phys.

Rev. A 40

₍₁₉₈₉₎

2806.

[16]

FONTANARI J. F. and KÖBERLE R., J.

_Phys.

A 21

₍₁₉₈₈₎

2477.

Generalization in a Hopfield network

HAL Id: jpa-00212540

https://hal.archives-ouvertes.fr/jpa-00212540

Submitted on 1 Jan 1990

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Generalization in a Hopfield network

J.F. Fontanari

To cite this version:

Generalization in

Hopfield

network

Quimica

(Received

April

accepted

final

form

July 1990)

performance

Hopfield

learning

having

only

supply

typical

exemplify

examples

taught

representations

analitically.

play a

representations.

Phys.

(1990)

Physics

Learning

generalization

subject

past

[1-7].

supervised learning

single-layer

[5-7],

following

presented

[8]. Comparatively,

ability

simple

networks,

Hopfield’s

[9],

perform computational

beyond

simple

storage

activity

patterns.

learning

Hopfield’s

setting

coupling

.l¡j

and j

pairs

given

activity

patterns

by

represented by Ising

_Quimica

_April

_accepted

_{July 1990)}

_Hopfield

_learning

_only

_supply

_typical

_exemplify

_examples

_{analitically.}

_{play a}

_Phys.

₍₁₉₉₀₎

_{generalization}

_subject

_past

_[1-7].

_single-layer

_[5-7],

_following

_presented

_ability

_simple

_networks,

_Hopfield’s

_[9],

_beyond

_simple

_storage

_activity

_patterns.

_learning

_Hopfield’s

_setting

_coupling

_{and j}

_pairs

_given

_activity

_patterns

_by

_{represented by Ising}

_{(firing) or Si= - 1 (rest). Storage}

_pattern

_1,

_generalized

_{being exposed}

_{to p}

_activity

_patterns

_couplings

_{assumed Jii}

_couplings

_fixed,

_{governed by}

_[9]

_potentially

_given

_activity

_pattern

_measuring

_performance

_retrieving

_patterns

_overlaps

_{{Si, i = 1, ..., N } is a}

_properties

_fully

_by

_developed

_{spin-glasses [12].}

_{macroscopic overlaps, 0(1),}

_only

_pattern

_{macroscopic overlaps}

_patterns

_[10].

_mainly