HAL Id: jpa-00248077
https://hal.archives-ouvertes.fr/jpa-00248077
Submitted on 1 Jan 1994
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Phase diagram of imprinted copolymers
Vijay Pande, Alexander Grosberg, Toyoichi Tanaka
To cite this version:
Vijay Pande, Alexander Grosberg, Toyoichi Tanaka. Phase diagram of imprinted copolymers. Journal
de Physique II, EDP Sciences, 1994, 4 (10), pp.1771-1784. �10.1051/jp2:1994231�. �jpa-00248077�
Classification
Physics
Abstracts61.40K 87,15 36.20C 82.35 64.60C
Phase diagram of imprinted copolymers
Vijay
S.Pande,
Alexander Yu.Grosberg (*)
andToyoichi
TanakaDepartment
ofPhysics
and Center for Materials Science andEngineering,
Massachusetts Institute ofTechnology, Cambridge,
Massachusetts 02139, U-S-A-(Received
10 April 1994, accepted 30 June1994)
Abstract. Recently, a model for the preparation of
"protein-like" heteropolymers
with aunique
and stableground
state has been proposed and examinedcomputationally. Formally,
this model is similar to another
recently proposed
andcomputationally
examined model of theevolutionary design
ofprotein-like heteropolymers. Using
mean fieldreplica theory,
wefind,
in addition to thefreezing
transition of randomchains,
a transition to the target "native" state. Thestability
of this state is shown to be greater than that of theground
state of random chains. The results derived here should at leastqualitatively
beapplicable
to knownbiopolymers,
which are conjectured to be m viva"designed" by
evolution. Furthermore, we present a crude prescription for alaboratory procedure
in which chains can besyntheeized
m vitro.1. Introduction.
Due to its
biological importance
andphysical complexity,
theproblem
of thefolding
transition ofcopolymers
has attracted considerable attention[1-3]. Biologically, proteins represent
a sort of"designed" heteropolymer,
in this case the result of evolution. It is also known that pro- teins have aunique
structure. It isintriguing
to consider that the existence of aspecific (I.e.
designed) unique ground
state could be the result of evolution. This has been studied com-putationally, by
a Monte Carloprocedure
which swaps monomers via theMetropolis
criterion such that thepolymer
energy is minimized [4].A method to
synthesize
renaturableheteropolymers
in alaboratory procedure
has also beenrecently suggested.
In thisdesign procedure,
monomers areequilibrated
in space and then in-stantly polymerized.
It has been showncomputationally
that thepolymerization
conformation will often be theground
stateconformation,
I-e- thepolymerization
conformation has beendesigned,
or"imprinted"
[5]. Thesignificance
of theimprinting
of thepolymerization
con-formation is that if monomers are
equilibrated
in the presence of atarget
molecule and then(*)
On leave from: Institute of ChemicalPhysics,
RussianAcademy
of Sciences, Moscow 117977,Russia
polymerized,
thecomplementary
site formedby equilibration
of the monomers with thetarget
moleculeprior
topolymerization
would bepreserved
in thepolymerization
conformation. If thispolymerization
conformation is theground state,
then the sequence has beendesigned
sothat it folds to a conformation which has a site
capable
ofrecognizing
thetarget
molecule.Clearly,
inspirit,
these twodesign procedures
areradically
different.However,
in the mean fieldapproximation,
both models areformally indistinguishable,
as both choose sequences witha fixed monomer
composition
such that the energy of interaction is minimized[5].
In this paper, we
employ
thereplica approach
to describe the effect ofdesign
on thefreezing
transition
previously predicted
for random chains[2, 6].
The mean fieldreplica approach
is believed to beapplicable
to disorderedpolymers,
as thepolymer problem
is similar to thelong
range SK
spin-glass [7]. Indeed,
due topolymer flexibility,
all monomers can come in contact with each other in real space,and, therefore,
interact with eachother,
no matter how farthey
are
along
the chain. In this sense, theheteropolymer
isperhaps
the bestphysical
realization of the SKsystem,
with atruly
infinite radius of interaction [6].The
freezing
transition in randomcopolymers
is due to thecompetition
between theentropic favorability
of alarge
number of conformations and theenergetic tendency
toward one or a few conformations withdistinctively
lowenergies. Qualitatively,
we expect that thedesign procedure
should lead to sequences whoseunique ground
state conformation is thetarget
conformation,
as wehave,
in a sense, exerted a "field" which chooses an ensemble of sequences which have beenoptimized
for theparticular target
conformation.2. Formulation of the model.
Consider a
heteropolymer
chain with a frozen sequence of monomers sI, where I is the numberof monomer
along
the chainIi
< I <N)
and sI is thetype
of monomer in thegiven
sequence.In the
present model,
we consideronly
two values for s, s=
+1,
and have the interaction Hamiltonian of the form~
7i =
)B ~
s I s j
6(r
I r
j) (1)
~ ~
where
6(r)
is the Dirac delta function. As we wish to concentrate onheteropolymeric effects,
wedo not
explicitly write,
butimplicitly
assume, an overall attractive second virial coefficient as well as arepulsive
third virial coefficient.Specifically,
we assume that thecomplete
Hamiltonian isgiven by
the sum ofheteropolymeric
andhomopolymeric
terms:7i'
=
~B f
sI s j
6(rI
rj)
+B0p
+Cp~ (2)
~ I,J
where
80
and C are the mean second and third virial coefficients. As ~~assume that
(Bo(
»B,
we can
optimize
the free energy withrespect
to pindependently
of anyheteropolymeric
proper-ties.
Thus,
thesehomopolymeric
terms lead to a compactglobular
state with constantdensity
p =
-80/2C. Furthermore,
B is due toheteropolymeric effects;
it is the"preferential"
en-ergy: for two types of
species
labeled I and2,
thepreferential
energy is the energy differenceE12 (l~ii
+E22).
Themeaning
and value of thepreferential
energy for any real systemdepen/
on the nature of the actual interactions involved
le.
g.hydrogen bonding, hydrophobic
forces,
etc.). Essentially,
some conformations with agiven density (fixed
due to thehomopoly-
meric
terms) might
be morethermodynamically
favorable thanothers,
due toheteropolymeric
effects. This will be the main
subject
of ouranalysis.
The
partition
function isexpressed
aszlseq)
=jj
exP
I-yli lconf, Seq) 13)
conforrnat;ens
Note that the Hamiltonian
depends
on both conformation and sequence. The standard wayto
approach
thepartition
function of asystem
with frozen disorder is toemploy, first,
theprinciple
ofself-averaging
of free energyand, second,
thereplica
trick:Fjseq)
t F=
jfiseqjj~~q
=
-TjIn Zjseq))~~q
= -T limi~~i~~~)i~~~
,
j4)
where
(.. )sea
means average over the set of sequences.In the works
[2, 6],
whileaveraging,
the sequences were considered to be random. The main purpose of this work is toincorporate
the fact that sequences are somehow selected. Thismeans
(. )seq
"~j
l~seq(5)
seq
where
Pseq
is theprobability
distribution for different sequences which appear in the process ofdesign
orsynthesis
of chains.Both of the
recently suggested
models of sequencepreparation is,
4](see above) employ
in the selection process the same volume interactions with which the links of chains interact. In bothcases,
Pseq
isgoverned by
the Boltzmann factor related to the same HamiltonianIi
taken forthe
"target"
conformation *. Since we are not interested in anyparticular
* conformationand, besides,
this conformation seems to be out of control in any real(not computer) experiment,
we average over the conformation *:
~seq ~j
~XP~
li(*, ~~)1, (~)
~
where
Tp
is"polymerization" temperature
at whichdesign procedure
isperformed
andz =
~ ~
exP(-j7ii*, eq)j 17)
seq *
is the normalization constant. The
probability Pseq
includes allpossible
sequences, notonly
the ones with any
given composition.
Collecting
the aboveequations,
we can write then-replica partition function,
up to the constantfactors,
as~ ~~~~~, ~~~~
"
i~"~~~~~~~~~ ~ ~j )~*l~~~~ °~~°~~'°~~~[
~~ j
~, ~, ~
exp
n7i ion°
~~~~T
ni
~
~~"'~~~~
' ~~~
where
C~
=
Co, Cl,
.,
Cn
stand for conformations ofreplica
number a, and index a= 0 is
attributed to the
target
conformation *. Asexpected,
we return to the usual case ofcompletely
random sequences atTp
- cc. The newphysics
which appears at finiteTp
is the mainsubject
of further
analysis.
3~
Replica theory analysis~
We write the
n-replica partition
functionlz~ls~q))seq
"
~j ~j (9)
seq Coocio.. Cm
x exp
If f
/dRidR2
sI6(r) Ri
s j6(r) R2 6(Ri R2)
,
n=0
~
" I,J=1
where T"
=
Tp
for o = 0 and T*= T for o >
0,
and weperform
Hubbard-Stratonovich transformation(Z"(seq))sea
=
/ D# ~
exp
1- f
/dRidR2
~~4" (Ri )4" (R2 6(Ri R2
~
cn,ci,.. cm
~
n=0
~
x
~
expIf /dR f
#"(R)
sI6(rJ R) (10)
seq n=0 1=1
Here
#"(R)
are the fieldsconjugated
to thecorresponding
densities£ )~~ sI6(rJ R), fD#.
means functional
integration
over all the fields(#"(R)),
and we havedropped
all irrelevantmultiplicative
constants from thepartition
function. Note that the sum over sequences entersonly
in the last "source" term of(10).
The summation over sequences can beeasily performed
to
yield
exp
jsource termj
=
~ ij
exp
sI f /dR
j"jR)6jrl R)
si,s2;. sN=+11=1 n=0
=
jj ~
exp
lsl f /dR
#*(R)6(rJ R)
I=1s=+1 n=0
=
jj
2 cosh
If /dR
#"(R)6(rJ R) Ill
1=1 n=0
We
perform
now theexpansion
over#.
It is the mostimportant approximation
of this work.The
corresponding
conditions ofapplicability
will begiven
later.Keeping
the terms up totJ(#~),
weget
then-replica partition
function in the form:lz~ls~q)lseq
"
~j / vi
eXp
j-Ejooflil l121
where the effective energy of
n-replica system
isgiven by
E(Qnp)
=(/dRidR2
x
f § <"(RI )4"lR2) 6jRi R2) f 4"lRi)4~lR21QaPlRi, R2) l13j n=o
n,P=o
and
N
Qnp(Ri, R2)
"~ 6(rJ Ri) 6(r( R2 (14)
1=1
is the standard two
replica overlap
order parameter[6, 3].
Recall that the value ofQ~p(Ri, R2
has the verysimple phyiical meaning:
it isproportional
toprobability
offinding simultaneously
one monomer of the
replica
o at thepoint RI
and one monomer of thereplica fl
at thepoint
R2.
Also note that the normalization conditions/dRi Qap(Ri, R2)
"pp(R2)
and/dRidR2 Qap(Ri, R2)
= N(IS)
are obvious from the definition of
Qnp, equation (14),
wherepp(R2)
is thedensity.
As we are concerned here with alarge globule, density
is assumed constantthroughout
theglobule,
such thatpp(R)
= p
(constant
in space, same for allreplicas) (16)
and therefore
Qnfl(Rl, R2)
"
Qafl(Rl R2) II?)
We also mention that the
diagonal
element isgiven by
Q~~ iRi, R~
= p
61Ri R~) j18)
We can therefore rewrite the effective
n-replica
energy in the formi " ~na
E(Qnp)
"
~ /dRidR2 4"(Ri)4~(R2) ((- ) 6~p6(Ri R2) Qn#p(Ri 2)j
~
",P"0
~
(19)
Now we pass from summation over conformations
(microstates)
to functionalintegration
over
Qnp (macrostates). Qnp
is theonly
relevant orderparameter.
Thecorresponding entropy
isgiven by
[6]e~lo"#I IN
=
~j
6Q~p(Ri, R2) ~ 6(r) Ri 6(r( R2
,
(20)
Co,Ci;. Cm I=1
and therefore
(Z" (seq) )sea
=
/DQ /D#
exp(-F
+Fo )
F=
E(Q~p )
S(Q~p Fo
= In z(21)
The mean field evaluation of this
partition
functionimplies
a saddlepoint approximation
for theintegral
overQ (Eq. (21)). Normally,
this meanstaking
the maximal value of theintegrand.
It is
commonly believed, however,
that in order to find the correctanalytic
continuation in then - 0
limit,
one has to take the maximal rather than the minimal value of the relevant free energyF,
because there aren(n -1) /2 off-diagonal
elements in theQap matrix,
and thereforefor the 0 < n < 1 case, the
integral
overQ~p represents
summation over anegative
numberof variables.
Following
thisprinciple,
we write(Z"(seq))sea
=/D#
exp[-Maxj~jf(Q)]
=
Minj~j /D#
exp[-F(Q)]
,
(22)
I.e.,
we have to maximize the effective free energy functional(21).
4.
Replica symmetry breaking~
We choose some standard function
q~(x),
sayGaussian,
with the normalization conditionfdx q~(x)
=
1,
and say thatQnp(Ri R2)
=
~
~
7l~~ ~/~
,
(23)
(R/~) ~t
where d is the
dimensionality
of space,R/~
can beinterpreted
as the diameter of the tube in whichreplicas
o andfl coincide,
and the normalization condition defines the coefficient. Wenow
repeat
thearguments
of [6]: as theentropy
scales like-(ll~"~)
~ atn <
1,
weget
eacha
# fl
term of the free energy functional of the form~
Ai
~
A~
~~ ~~~~
iRi~)~ (RjP)~
where
AI
andA2
arepositive
numbers. For d >2,
which is the main concern of thiswork,
we find two
maxima, namely R/~
= cc and
R/~
= 0
(in
the later case, see the discussion in [6]concerning
the short distance cut-off ll~*~ =v~/3,
where v is the excludedvolume).
The firstcorresponds
to tworeplicas,
a andfl
which areindependent
and do notoverlap
at all(Qnp
"0),
while the secondcorresponds
toreplicas
which coincide at themicroscopic
level(Qap
= P6(Ri R2)). Thus,
from thesescaling arguments
inR/~,
we conclude thatQ~p
is of the formQ~~jRi R~)
= p q~~6(Ri R~) ja ~ p)
,
j2s)
where
off-diagonal
matrix elements of the new matrix q~p are either 0 or 1. If weadditionally
definediagonal
matrix elements q~~ asQna + 1-
)
,
(26)
we can write
(Z"(seq))sea
=
Minj~
~je~lo"#I /D#
exp~j
q~p
#* #P
,
(27)
z " 2
~
~
a,#=0
where
integration
over Rdisappears leaving
theproduct
of N= p
f
dRintegrals. Moreover,
we can
perform
Gaussianintegration
over# yielding (~)
(~) It is instructive to perform first the Gaussian functional
integrals
over#° yielding
~
N
(Z"(seq))sea
=
Minj~~
~je~lo"~l /l~#
exp~
~j )~p #~ #P
,
(28)
2 2
",P"1
where
~ q~oqop
(29)
q«P " q«P
~~~
In this form, we reduce the
problem
to a form similar to that of Sfatos et al. [2], with a new effective order parameter)~p.
This formexplains
thatreplica
0plays
the role of external field inreplica
space, adsorbing other replicas. In other words, two replicas cv, fl > 0 attract each other directly, as in therandom
polymer,
and,additionally,
because both are attracted to target replica.lz"lseq))sea
= exp
[-Maxjq~~jfiqnp ii Fiqnpi
=
)
inldet i-qnp)i sjq~pi
in zj30)
where we have
dropped
all irrelevant additive constants.To maximize the
n-replica
free energy over q~p means in factfinding
theoptimal grouping
of
replicas.
There is thefollowing
obvioustransitivity
rule:if,
say,R/~
= 0 and
Rf~
=
0,
meaning
that conformations ofreplicas
o,fl
and ~ are all the same, then Jl~°~ = 0 as well.In other
words,
if Qap " 1 and Qp~ "1,
then Qn~ = 1 as well.Using
matrix row and columnoperations,
we canorganize
any such matrix into blockdiagonal
form. This meansgathering replicas
thatoverlap
in the groups andplacing replicas
of the same group into the samediagonal
block in the matrix. One of the blocks is
comprised
of some y + Ireplicas
which dooverlap (I.e., practically coincide)
with the"target" replica
0. OtherIn
+I) Iv
+I)
= n y
replicas belong
to nlx
groups, some xreplicas
in each:--fi' 0 0
Bpl --+---
y+I
Ii-j
I~
0 0
(
'"(
l n+10 0 0
j
i- T
BP i
0 0 0
x
One can say that y
replicas
here are "adsorbed" on thetarget conformation,
whichplays
the role of external field for n other
replicas.
A similar situation exists in neural networks[11],
where the memorizedimage plays
a similar role to thetarget
conformation. On the otherhand,
the
grouping
of otherreplicas
is due to spontaneousreplica permutation symmetry breaking.
The determinant of the Qnp matrix can be
directly
calculated.First,
since the matrix isblock-diagonal,
its determinant is theproduct
of block determinants. Each x x x block hasa
lx I)-fold degenerate eigenvalue ii
Q) and one distincteigenvalue ii
q +xQ),
where4
= 1T/Bp
and" 1 are
diagonal
andoff-diagonal
matrixelements, respectively.
As tothe
Iv +1)
x(y +1) block,
it has one distinctdiagonal
element qoo=
Tp/Bp
=
fp
and forthis reason the
eigenvalue if
q] isonly (matrix
size2)
=(y -1)-fold degenerate,
while thetwo others are
(1/2) ii
+#p
+q(y I)]
+Iii fp
+
q(y 1)]~
+4yq2 Taking
theproduct
of all
eigenvalues throughout
allblocks,
andnoting
that det(-q~p)
=(-1)"+~
det(q~p),
weobtain
~~ l~~
~~~]~ ~ ~)~
~
,
In
[det (-Qn#)j
~~~~"
V
~~~
T
~
~~~T
~
~
~
~ ~~~ ~~~
To estimate the
entropy S(q~p)
related to thegrouping
ofreplicas,
we follow reference [2]to argue that due to the
polymeric
bondsconnecting
monomersalong
thechain,
once onemonomer is fixed in space, the next must be
placed
within a volumea3.
Sincereplicas
thatbelong
in the same group coincide within a tube of radiusRt
'~
v~/3,
there area3 Iv
ways toplace
the next monomer and thus theentropy
per monomer isIn(a3 Iv).
But since allreplica
conformations coincide within the group, we must restrict theposition
of the next monomer to asingle place. Thus,
the entropy loss for each group issix I),
where s=
In(a3 Iv)
is related to theflexibility
of thechain,
and thereforeS = Ns
fi(x 1)
+ y
(32)
X
As to the last term in
(30), lnz,
it isformally
related to the normalization condition for theprobability Pseq,
butphysically
it is the free energy ofsingle replica
0 taken at thepolymerization
temperatureTp.
It can be thereforeeasily
foundby taking
n =0,
y= 0 in the
preceding
formulae:In z
= N ln
~~
lj (33)
BP
Collecting equations (30), (31),
and(32),
we obtain~~ ~2x~
~~~
~~~
~ ~~~
l~)Tp~~
~ ~~
~x
~
~~~~
where we have
employed
the fact that the last two terms in(31)
cancel with normalizationconstants from Gaussian
integration
notexplicitly
written. We areleft, therefore, only
withmaximization over x and y in the n - 0
limit, yielding
theopportunity
to comment on thephysics
of thepossible phases.
5. Phase
diagram~
Let us discuss the
possible
values of x and y in the n - 0 limit. For thereplica
system, whenn is
positive integer,
we have 1 < x < n and 0 < y < n.Clearly,
x = 1 means there is nogrouping,
I.e. noreplica symmetry breaking.
On the otherhand,
x = n means all thereplicas belong
to the same group, orreplica symmetry
is broken. When n becomes less than I and goes to0, inequalities flip. Nevertheless,
x must remain in between of n andI,
andapproaching
x to I means
disappearance
ofreplica permutation symmetry breaking.
In otherwords,
x = Icorresponds
to thefreezing
transition. This transition has beeninvestigated
in[2]. Maximizing
x in
equation (34),
we recover the result of [2] for random chains(~)
~~ ~'~
~
~~~
~ l~~~~/~)x
~~~~The solution of this
equation
is of the form x=
T((s)/Bp,
where((s)
is the function definedby
theequation
2s=
In(I ()
+(/(I (). According
to ourdiscussion,
this solution is valid when itgives
x <I,
I-e- at T <Tf,
whereTf
isgiven by
~ /~)
°~ ~~ ~~~ ~~
~l
~i~~Tf
~~~~
Tf
is the temperature offreezing
transition for the chain with random sequence[2],
and we canwrite
~
~~~ #l~~s~ ~
~~~~We find that xo and
Tf
areindependent
of anydesign parameters
such as y andTp.
Thishas a clear
physical meaning:
if one considers the chainprepared by
ourprocedure
in someparticular
conformation *, then for almost all of the conformationsexcept
*, this chain behavesas if it had a
completely
random sequence. This iswhy freezing
into a random conformation is not at all affectedby
theprocedure
of sequence selection.Consider now maximization with respect to y. The condition 0 < y < n is obvious for
positive integer
n: y= 0 means no
replicas
in thetarget
group(3),
while y = n means allreplicas
are in thetarget
conformation. When n becomes less then I and goes to0,
y remains in between 0 and n, which is also 0. Since y isalways small,
the secondlogarithmic
term in(34)
can be linearized to obtainjF
=
) (In
I ~xj 2s)
+ sn)
(In
I ~xj
+)j~/)
2s(38)
X X P
P
Thus,
the effective free energy(38)
is linear in y. The maximal value is therefore reachedalways
at theboundary
of theinterval,
I.e. either at y = 0(no replicas
in thetarget group)
or at y = n
(all
thereplicas
are in thetarget group).
Thecorresponding phase
transitionoccurs when the y
dependence
of the free energyflips sign,
and the transitionpoint Tj~
canbe
easily found,
since linear in y term of the free energy(38)
vanishes at the transitionpoint.
We substitute s from the condition
(36)
and findTir
=BP i
in
imiiq
When T >Tf
i~~~
Tf
otherwiseClearly,
this is a first order transition.(~)In
fact, this result corresponds exactly to the so-called Parisi ansatz [9] with one-step replica symmetrybreaking.
In our model, however, itcan be obtained in
a more
sophisticated
manner,without any ansatz. Indeed, we can easily consider the general case of some g groups of replicas, with different numbers x, of replicas in each
group. We have then
£)_~In [I @x,j
instead ofQ
In[I @xj
in the In(det (-q~p))
term and s£(_~(x~ I)
instead of sfit(x I)
in the entropyterm. Maximization with respect to x, within the constraint
£(~~
= n y gives that all x~
= x are
the same,
leaving
us with thesimplified
version considered above.(~) In
thermodynamic
limit, theprobability
to obtain given target conformation out of random choice is negligible.~
Random
~
(
T(
~Target
# ~~°~~~
traps
lj
Polymerization Temperature Tp
Fig.
I. Phasediagram
fordesigned copolymers.
There are threephases: I)
randomglobule,
inwhich a vast number of conformations
(folds)
are allowed for the chain in theequilibrium; 2)
fro2englobule,
in whichonly
a few conformations or even one conformationare
allowed; 3)
targetglobule,
in which the
designed
conformation(*)
is the only allowed one. Note that the targetglobule phase region
ofphase diagram
can be divided in two parts: the target conformation is the most stable statein_both,
but a few of the other random conformations may bethermodynamically
either metastable, thusserving
as traps inkinetics,
or unstable without traps. Lines at low T andTp
represent the areasof
inapplicability
of thetheory.
Summarizing
thisdiscussion,
we conclude that there are three differentglobular phases
forheteropolymers prepared by
ourprocedure: Ii)
randomglobule, essentially
similar tohomopoly-
meric one, where
energetical preferences
between monomers are not sufficient to stabilize anyparticular conformation,
so that thethermodynamic equilibrium
is realized as the mixture ofastronomically large
number ofconformations; (it) frozen globule,
where each chain choosessome small number of the
minimally
frustrated [12]conformations,
but the choice isessentially unpredictable
and remains out ofcontrol; (iii) target globule,
where chain choosesexactly
the conformationprescribed
in thepreparation procedure.
This is shown inphase diagram (Fig. ii.
Now we are
prepared
tofinally perform
the n ~ 0 limit.Indeed,
for both y= 0 and y
= n
casks,
the effective free energy(38)
is linear in n.According
to theoriginal expression
of thereplica approach (4),
the real free energy of theheteropolymer
chainequals
to~ _ _~n
~irr~
iz"iS~t~S~~
- -Tiii
~~~~i~
"l
~~°~
From
this,
we write the freeenergies
of all threeglobular phases:
randomlx
= 1; y =
0),
frozen
(x
=
T/Tf;
y=
0),
andtarget (x
=
T/Tf
for T <Tf,
x = I for T >Tf;
y=
n)
)Frandom
" TIn(1- ~)~~j (41)
~'~°~~~
~~'~~~ ~~~~~ ~/~j)~) ~( ~
~~~~~~~~~~
~~'~~~ ~~~~~ ~
/~~)~ (
l
((s)~~Tp
~~~~
Note that these are
already
real freeenergies,
so that a lower free energycorresponds
to a more stablephase, according
to usualphysical logic. By looking
at the freeenergies above,
one caneasily reproduce phase diagram (Fig. 1):
in eachregion
of thediagram
thecorresponding
freeenergy is minimal.
6. Discussion.
The free
energies
of both frozen andtarget phases
do notdepend
ontemperature
in the low T limit:These limits are
naturally interpreted
as theenergies
ofground
state conformations for a chain with random and selected sequence,respectively.
Theground
state energy for a randomsequence is
independent
ofTp,
while the energy of thetarget
conformation increases withTp.
We see that the selection of sequences, orpreparation
ofheteropolymers by
oursynthetic procedure,
reduces the energy of theground
state. Thisimplies
a verypeculiar
character of thedensity
of states of the selected chains(Fig. 2). Indeed,
since the selected sequence looksrandom for all conformations
except
for thetarget
one, its energyspectrum
includes thetarget
conformation as theground
state and thetypical ground
state of random chain as the first excited state.As was
recently
understood[14],
this kind of spectrum is veryimportant
from thepoint
of view of the kineticaccessibility
of theground
state. Of course, one cannotanalyze
kineticspurely by thermodynamic
considerations. Ingeneral, self-organization
of the correctglobular
structure includes
coil-to-globule compaction
and some search for the correctglobular
confor- mation. We are not in theposition
to estimate the time scales involved in those processes.However,
we canqualitatively
compare the kinetics if thetarget phase self-organization
for the two cases T <Tf
and T >Tf.
Indeed,
consider thetarget phase
on thephase diagram
and examine first the T <Tf
case. Inthis case, the frozen
phase is,
from athermodynamic point
ofview,
metastable. Eventhough
itis less stable than the
target
state,metastability
means that amacroscopic
free energy barriermust be overcome to leave this state. It
is, therefore,
a verystrong
trapalong
the way ofchain
self-organization
into thetarget
conformation. Weconclude,
that at T <Tf,
thetarget
conformation may not bekinetically accessible,
eventhough thermodynamically
it is the most stable. On the otherhand,
at T >Tf,
therandomly
frozen conformation is not stable atall;
thus,
there are no effectivelong-living
traps on thepathway
ofself-organization, and,
thereforeself-organization
isexpected
to beconsiderably
faster and more reliable.We now
analyze
the conditions ofapplicability
of ourapproach.
Infact,
besides the fact that we weredoing
mean fieldtheory,
there isonly
one delicateapproximation
which comes inequation (13),
where weneglect higher
order terms in theexpansion
over#.
It is easy toshow,
that all thesubsequent
terms in#
arepositive land
therefore do not cause thedivergence
of theintegrals
over#
likeEq. (12)).
Inparticular,
the next term in#
looks like/dRidR~dR~dR~ ,p~=o injRi)iPiR~)i~iR~)16jR~)Qnp~~iRi, R~,R~,R~) 146)
fit~
I
~f©~enUJ P==E#m
--=w=-
"t"
~ ,
l~
-, ,
' ' ,
, , ,
~~ ~ '
Polymer(2ation Temperature (Tpj
Fig.
2.Sample
energy spectra for sequencesimprinted
at differentpolymerization
temperatures(Tp).
The energy of the target conformation(Etarget)
vs,polymerization
temperature(Tp)
isplotted.
As Tp is increased to Ti,
Etarget
increases. In the region Tp @ Ti(magnified section),
we see thatEtarget
isequal
toEfff~~~,
the averageground
state energy of a random chain. This is related to thephase
transition between target and frozenphases (see phase diagram, Fig, I).
It is instructive to see a realisticrepresentation
of the very bottom part of the energy spectrum, as is shown here in themagnified
section. As was shown in [13], conformations of low energy areabsolutely
differentstructurally, and therefore, different
pairs
of monomers are in contact and arecontributing
to the energyin those conformations. For this reason, the bottom part of the spectrum
obeys
the random energymodel
(REM)
[8]. With thechange
ofTp,
the energy of the target conformationchanges
in aregular
fashion, a5plotted.
Other states represent differentindependent
realizations of the REM system.Eight examples
are shown in the inset. ForTp
> Ti, the average energy for the target conformationstate is
larger
thanEff$~~~.
Note thatEtarget
is the average energy, and that forTp
» Ti, theprobability
distribution of the energy of the target conformation becomes(up
tonormalization) equal
to the density of all other states.
This should be
negligible compared
to theQ~p-term
inequation (13) throughout
theregion
of# contributing
to theintegration
over#.
In otherwords,
if we treatequation (13)
in terms of effectivej-dependent
Landau free energy and write itschematically
in the formf
=Q24~
+Q44~,
then the fourth power term should benegligible
up to wherequadratic
term is of order one. Fromequation (27),
it is clear that thequadratic
term can be estimated asNl#~,
where I is thesmallest,
and therefore mostdangerous, eigenvalue
of the -Qnp matrix.On the other
hand,
the normalization condition forQnp~& implies
thatQ4
'~ N.
Therefore,
the condition ofapplicability
isNl#(
»N#(,
where#o
isgiven by Nl#(
~-
I, yielding
I »
N~~/2. Note,
that this has a clearphysical meaning:
I goes to 0 means that the#- dependent
Landau free energyapproaches
aphase transition,
which is known asmicrophase segregation
[2].Thus,
ourtheory
becomesinapplicable
close to themicrophase segregation regime.
As toI,
we know all of theeigenvalues: they
areT/Bp, T/Bp
x, and(at
y =0) Tp/Bp
I. From(36),
we haveBp
=Tf((s); therefore,
the condition ofapplicability
of theapproach
can be written in the formT 1
~
Tp
1 ~ l
~P
>
jjs) 147)
~~jj~)
~@
~~Tffls) 4 Tf
On the other
hand,
the mean-fieldapproach
for theglobule
is valid ata3 Iv
»I,
or s » I. Inthis case,
((s)
m1-1/2s.
If we defineT and Tp
according
to T=
Tf Ii T)
andTp
=Tf(1- Tp),
then the conditions of
applicability (47)
take the form1 1
~~~~ @
~~~~P~£~@' (~~)
Thus,
our results are validonly
in a rather narrowregion
below thephase
transition. This is understandablephysically:
at lowpolymerization temperature phase segregation
occurs inpreparation system
of twotypes
of monomers,giving
rise to verylong homopolymeric parts
of theprepared
sequence. This of course prevents effectivefreezing
of the chain to either randomor
target
conformation. For this reason we expect, that notonly
ourtheory
breaksclosely
below thefreezing temperature,
but also the veryphenomenon
offreezing
andimprinting
exists
only
in rather narrowregion
of parameters for the two-letterheteropolymer.
Toimprove
thesituation,
one has to pass to a richer set of monomerspecies,
as it is indicated incomputer
simulations [5]. The
corresponding analytic theory
is therefore achallenging problem.
7. Conclusions.
In
conclusion,
we comment on the relevance of our results.First,
in the mean field approx-imation,
there is no difference between the sequencedesign
model ofbiological
evolution [4]and the
imprinting
model [5].Thus,
the above results should be valid for both. As forgeneral heteropolymers, including proteins,
we expect that thequalitative
results found here should also bevalid,
as thephysical origins
of the transition to thetarget
state is notdeeply
connectedto the nature of the
polymer investigated,
but the existence ofdesigned
sequences.From the
experimental point
ofview,
theimprinting
model is a method tosynthesize
het-eropolymers capable
ofrenaturing
to theirpolymerization conformation,
and thuscapable
ofrecognizing
someligand
moleculepresent prior
topolymerization.
In this theoreticalwork,
wehave indeed shown that this is
possible. Moreover,
based upon ourresults,
we formulate thefollowing
crudeprescription
to theexperimental
realization of thistheory. First,
thepolymer-
ization
temperature
must besufficiently low,
or in otherwords,
the set of monomers must be chosen such that thepreferential
energy(B)
should be not less than thepolymerization
temper-ature.
Furthermore,
toprovide
fast reliablefolding
one has to choose theacting temperature
in between the
freezing
and targetphase
transitions.The existence of a
simple procedure
which isautomatically,
I.e. without the biochemicalsynthesis
apparatus of theliving cell, capable
ofproducing
fast andreliably folding polymer
chains with
specific
actiie sites for molecularrecognition
may shedlight
also on apossible
scheme for
prebiotic evolution,
since all of the elements in ourpolymerization procedure
weremost
likely present
in the"primordial soup"
ofearly
Earth.After the
completion
of thiswork,
we were informed of the work[15] discussing
a similarsubject.
We are indebted to the authors forsending
us thepreprint
of their work.Acknowledgments.
We
acknowledge helpful
discussions with E. Shakhnovich and A. Gutin. The work was sup-ported by
NSF(DMR 90-22933)
and NEDO ofJapan.
VSPacknowledges
thesupport
of an NSFFellowship.
AYGacknowledges
the support of KacFellowship.
References
ill Wolynes P.G.,
inSpin
Glass Ideas inBiology,
D. Stein Ed.(World Scientific, Singapore, 1991).
[2] Sfatos
C.,
Gutin A, and Shakhnovich E.,Phys.
Rev. E. 48(1993)
465.[3] Garel T, and Orland
H., Europhys.
Lett. 6(1988)
307.[4] Shakhnovich E-I- and Gutin A.M., Proc. Nat. Acad.
Sci.,
USA 90(1993)
7195.[5] Pande
V.S., Grosberg
A. Yu. and TanakaT.,
Proc. Nat. Acad.Sm.,
USA(1994) (in press).
[6] Shakhnovich E, and Gutin A., Btophys. Chem. 34
(1989)
187.[7]
Sherington
D, andKirpatrik
S.,Phys.
Rev. Lett. 35(1975)
1792.[8] Derrida B.,
Phys.
Rev. Lett 45(1980)
79.[9] Parisi G., J.
Phys-
A 13(1980)
1887.[10] Mezard M-G-, Parisi
G.,
Sourlas N. etal.,
J.Phys.
France 45(1984)
843.[III
Amit D-J-, Gutfreund H. andSompolinsky
H.,
Ann.
Phys.
173(1987)
30-67.[12]
Bryngelson
J.D, andWolynes P-G-,
Proc. Nat. Acad.Sci.,
USA 84(1987)
7524.[13] Shakhnovich E-I- and Gutin A.M., Nature 346
(1990)
773.[14] Shakhnovich E-I-,
Phys.
Rev. Lett. 72(1994)
3907.[15] Ramanathan S. and Shakhnovich