HAL Id: jpa-00247343
https://hal.archives-ouvertes.fr/jpa-00247343
Submitted on 1 Jan 1997
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
A Kinetic Model for Chaperonin Assisted Folding of Proteins
Henri Orland, D. Thirumalai
To cite this version:
Henri Orland, D. Thirumalai. A Kinetic Model for Chaperonin Assisted Folding of Proteins. Journal
de Physique I, EDP Sciences, 1997, 7 (3), pp.553-560. �10.1051/jp1:1997174�. �jpa-00247343�
A Kinetic Model for Chaperonin Assisted Folding of Proteins
Henri
0rland
(~>*)and
D. Thirumalai(~)
(~)
CEA,
Service dePhysique Thdorique, CE-Saclay,
91191 Gif-sur-Yvette Cedex, France (~) Institute forPhysical
Science andTechnology, University
ofMaryland, College
Park,MD 20742-2341, USA
(Received
17July
1996, received in final form 22 November 1996,accepted
26 November1996)
PACS.82.20.-w Chemical kinetics PACS.87.15.-v Molecular
biophysics
PACS.87.15.Da
Physical chemistry
of solutions ofbiomolecules;
condensed statesAbstract. A master equation formalism is used to account for the possible kinetic action of
chaperonin
assistedfolding
of proteins. The model is based on the assumption that chaperoninsrescue misfolded protein structures and
stochastically provide
thempathways
to reach the na-tive state
by
a mechanisminvolving
ATPhydrolysis.
We assume that the misfolded structures are characterizedby
distinct freeenergies
and that these structures are connected to the nativeconformation by a suitable transition probabilities. The chaperonins do not
recognize
the native state. For this model it is shown that in the presence ofchaperonins
the native state is pop- ulatedexponentially.
The exponential population of the native state, which is in accord with experiments, isindependent
of the distribution of the freeenergies
of the misfolded structuresas well as the transition
probabilities connecting
them to the native state.1. Introduction
The classic
experiments
of Anfinsen[lj
showed thatprotein folding, namely
theacquisition
ofa
unique
structure with a well definedtopology
from theprimary
sequence of aminoacids,
isa
self-assembly
process. Recent studies[2-5j, however,
have shown that in many instances a group ofchaperones, belonging
to thefamily
of heat shockproteins,
isimplicated
in in vivafolding
ofproteins.
It issuspected
[6j that the molecularchaperones
do not convey additional information(thus biasing
thepolypeptide
chain to the nativeconformation)
to enable thefolding
of the nascentpolypeptide
chains. Thus the in vivafolding
ofproteins
should be viewed as an assistedself-assembly
process. Thechaperonins
involved in thefolding
ofproteins
in E. Coli are GrOEL and GrOES
(and perhaps others) [7j.
A
major challenge
is to understand the mechanism of the action ofchaperonins
in thefolding
of
proteins
in cells. In verygeneral
terms one couldclassify
the molecularchaperones
into two groups,namely,
intramolecularchaperones
and intermolecularchaperones.
In the formercategory
are those classes ofproteins
that serve assignal
sequences and whose presence seemsnecessary in the
folding
ofa-lytic
proteases[8j,
and in thefolding
of bovinepancreatic trypsin
inhibitor
[9j.
To ourknowledge
no mechanism for the kinetic action of these intramolecular(* Author for
correspondence (e-mail: orlandtlspht.saclay.cea.fr)
©
Les(ditions
de Physique 1997554 JOURNAL DE
PHYSIQUE
I N°3chaperonins
has beenproposed
so far. In this article we consideronly
thepossible
role of intermolecularchaperones, namely,
thechaperonin family.
The kinetic action of
chaperonins
has not beencompletely
understood. The overallgeometry
of GrOEL is that of a double toroid with seven subunits in eachring.
Theapproximate length
of the
roughly cylindrical
structure is looI
while the diameter of the hole is about 45I [10j.
The
geometry
of GrOEL was usedby
Saibil et al.ill,12j
tosuggest
that thepartially
foldedprotein gets trapped
in the hole of GrOEL.Assembly
ofproteins
isthought
to takeplace
in thesequestered
environment offeredby
GrOEL. In thisscenario,
thechaperonins play
apassive
role and their presence is meant to ensure acompletely protected environment,
I-e-they basically provide
an Anfinsen cage forfolding.
Fersht et al.[13]
havesuggested
thatfolding
could take
place
while theprotein
is still bound to the GrOEL. These authorssuggested
this mechanismby studying
thefolding
of barnase in the presence of GrOEL. Theirexperiments
did notrequire
ATPhydrolysis which,
onfairly general grounds,
has been observed to be necessary in the assistedself-assembly
ofproteins.
Thissuggests
that under the conditions used in theexperiments
of Fersht andcoworkers,
thefolding
of bamase would not haverequired GrOEL,
I.e. there are
permissive folding
conditions. In other words the in vitrofolding
of barnase is almostspontaneous involving, perhaps,
discrete intermediates.A
proposal
very different from the two described above has been madeby
Todd et al.[14j
based on the energylandscape perspective,
and the in vitro kineticpartition
mechanism. Thismechanism termed the iterative
annealing
mechanism(IAM)
assumes that themajor
role of GrOEL is to utilize the energy from ATPhydrolysis
to rescue misfolded structures that areseparated
from the native conformationby
barriers which are difficult to overcome onbiological
time scale. With this energy, the misfolded
protein
is released into solution and a fraction of such moleculespartition
to the native conformation whereas theremaining
onesget trapped
in a
large
number of misfolded conformations.Then,
another round of ATPhydrolysis
ensuesand this kinetic
partitioning
isrepeated
until sufficientyield
is obtained. This process ofrepeated
release of misfolded andsubsequent annealing
is the reason forlabeling
this scenario the iterativeannealing
mechanism. In thismechanism,
GrOEL isthought
toplay
an active role which whencoupled
with ATPhydrolysis
serves to enhance the rate offolding
as wasconjectured
earlier. It should beemphasized
that- the IAM does notimply
that thepolypeptide
chain does not
undergo large
structural fluctuations thatmight
lead tounfolding
even after the native state is reached. Theonly assumption
is that the GrOEL does not bind to the native conformation of theprotein.
The IAM mechanism seems to haveexperimental support, although
more work isrequired
to confirm variousaspects
of this scenario. The IAM bears some resemblance to the kineticproofreading
mechanism familiar in the context of DNAreplication.
The contrast between IAM and that based on kinetic
proofreading
treatment and the random energy model[lsj
ispresented
elsewhere. We propose in this paper a masterequation approach
to
predict
some of the consequences of the IAM.Recently
there have been lattice simulations of m vivafolding
based on the TAM[16,17j.
2. The Model
It is useful to recall the
properties
ofchaperonins
which are crucial in theirability
to rescue misfolded structures[14j. Briefly they
are:(I) perhaps
the mostimportant requirement
of thechaperonin system
is thatthey
enablefolding
to occur undernon-permissive
conditions I-e- whenspontaneous folding
underbiologically
relevant time scale isunlikely.
Ingeneral
thisrequires
the presence of both GrOEL andGrOES; (2)
it has been shownby
Lorimer et al.[18j
thatchaperonins
arepromiscuous
in their interaction with substrateproteins
I.e.they
recognize
alarge
number ofstructurally
unrelatedproteins
aslong
asthey
are not in the nativeconformation; (3)
upon ATPhydrolysis
the substrateprotein
is released in a nonspecific
stochastic manner and the released
protein
either reaches the native conformation orgets trapped
in one of the many misfolded structures. These twoproperties
are mimickedusing
asimple
masterequation
and the kinetic consequences of thetheory
are derived below.For
simplicity,
we model theprotein
asexisting
in several free energy statesfa (a
=
0, 1, 2, ).
We
imagine
that to alarge extent,
these statescorrespond
to misfolded structures that haveto cross
large
free energy barriers toget
to the nativeconformation,
which is chosen to beconformation a
= 0. The master
equation giving
theprobability
ofoccupation
of each of thestates
Pa (t)
can be written as(Pa (t)
=
~j I~~abPb(t) ~j WbaPa it)
11)b b
where as usual the transition
probabilities Wab (probability
for thesystem
to make a transitionfrom state to a per unit
time) satisfy
detailedbalance, namely
Wab exp( fl fb)
"Wba exp(- fl fa (2)
with
fl
=
I/kT.
For concreteness we chooseWab
"Ti~ ~XPI~(iia ib)) (3)
where To is a time constant that sets the scale for an
elementary
event in theproblem.
In order toadapt
thisgeneral
formulation to thechaperonin problem,
we make additional ap-proximations
for the transitionprobabilities involving
the 0th state(referred
to as the nativestate of the
protein).
It is known that GrOEL does notrecognize
the native conformation andonly-
binds to the misfolded structures[18j.
Furthermore the IAM allows for thepossibility
of any misfolded structures to be connected with someprobability (however small)
to the nativeconformation. These
important properties
can be mimickedby setting Wao
= 0 for all statesa
#
0. Thisimplies
that on certain timescales,
the native conformation acts as anabsorbing
sink and once it isreached,
transitions to other states areunlikely.
With theseapproximations,
which are intended to be a minimal
representation
ofchaperonin
actionaccording
to theTAM,
the
equations
to beanalyzed
become:jPo(t)
=~ wo~P~(t) (4)
b#o
and
jp~~~~ r~i ii e< <,a,b>P~(t~ i e-~
<>~>~>~~~~~~ ~~~for a
#
0. It iseasily
seen that this set ofequations
conserves the sum of thePa (t).
These
equations
can bereadily
solved toyield:
t
Po(t)
=
rp~e~P'°/~ / dt'PT it') (6)
o
where
I~T(t)
"~e~'~~~Pb(~) (~)
b#0
556 JOURNAL DE
PHYSIQUE
I N°3and
Pb(t)
=
rp~e~P'b/~ /~ dt'e~~b"~~~')/~°PT It')
+Cbe~~b'~/~° (8)
o
In
equation (8),
the various constants aregiven by
~ =
£ e~P'b/~ (9)
all b
Ab
"eP'b/~ (lo)
Cb
"Pb(t
=
0) Ill)
and for
simplification,
we have assumed that the initialpopulation
of the nativeprotein
is 0.Using
theLaplace
transform ofPo(t)
andanalyzing
the structure of itspoles,
it can be shown thatPo It) decays exponentially. Indeed, denoting by
PO ~P) theLaplace
transform ofPo It),
after somealgebra,
we obtain:1 1
0CbAb
~~~~ (i~)
~02
1~j ~
~ ~
~~~
a#0 ~ ~
~~~
where z
= TOP-
The function PO
It)
isgiven by
the inverseLaplace
transform:c+toJ ~ PO
(t)
=
~ eP~ PO
(P) (13)
/c__
2iorwhere C is a constant
larger
than thelargest singularity
of theintegrand.
This
iIitegral
can beperformed by
the residue method.Denoting respectively by
za = roPaand
Ra
thepoles
and residues of PO(P),
we have:Po(t)
=
flRa e~"~/" (14)
The
large
time behavior ofPo(t)
is determinedby
thepole
structure ofPo(P).
As seen inequation (12),
there is apole
at z =0,
with a residue which iseasily
seen to beequal
to I.
One
might
think that there arepoles
at zb "-~Ab,
b# 0,
but thesepoles
are neutralizedby
the denominator of the
premultiplying fraction,
which alsodiverges
at thesepoles. Therefore,
PO(P)
isregular
at zb. Note that due to the definition of ~ andAb, equations (9, lo),
we seethat for any distribution of
energies,
zb < -I.Finally,
there is one series ofpoles given by
theequation:
~
z +
~Aa
~' ~~~~This
equation
caneasily
be solvedgraphically (see Fig. I).
To characterize this set ofpoles, they
are interleaved with the(zb)
and are all smaller than-I,
except for onepole,
which we denote
by
zi=
-Ko,
which isseparated
from the others and satisfies theinequality
-I < zi < 0. This result
again
holds for any distribution ofenergies.
Inparticular,
ifIQ
j ,
-_
-')
0
~
'j
-io
~[
i.1
i~
-6 -4 -2 0
z
Fig.
I.Graphical
solution ofequation (12)
to locate thepoles.
The poles arerepresented by
diamonds. One can see the set ofpoles
below -I, and the isolatedpole larger
than -1.the
spectrum
ofenergies
becomescontinuous,
allpoles
will form a continuous band below-I,
except for thepole
at z= 0 and the
pole
zi which remains above -I.As a result of our
analysis,
we see that thelarge
time behavior of PO(t)
isgiven by:
PO
It)
r~ I
Rie~~°°~ (16)
A
simple
estimate of the effective rate constantKo
can be obtained if we assume that it is close torp~
We obtain:~°
-l +
~Aa Koro
= 1 ~~°~
Ii?)
~ i-I
+~Aa)2
a#o
The form for PO
It) given
inequation (16)
asserts that no matter what the distribution of freeenergies
of the misfolded structures are, the native state ispopulated exponentially, provided
that on a certain time
scale,
transitions from the native conformation areunlikely.
We have verifiedexplicitly
that similar results are obtained for other choices ofWab.
Inparticular
we have shown thatequation (16)
is obtained(with
a differentKo)
ifWab
"
rp~ exp(-flfa).
Recently Zwanzig [19j
hassuggested
asimple
model forprotein folding
kinetics in which the reaction kinetics is described in terms ofchanges
in an orderparameter
that measures thesimilarity
of agiven
conformation to the native state. A masterequation description
of thismodel,
in which the native conformation has thelargest probability
ofoccupation,
also shows that the native conformation ispopulated
in anexponential
manner. Our results show that this is moregeneral.
In factthey
are obtainedusing
the rather weakassumptions
that the misfolded structures are connected with someprobability
to the native conformation and that the transition out of the native state isunlikely
on a certain time scale. For thechaperonin
mediated
folding
this time scale isessentially
setby Kp~
558 JOURNAL DE
PHYSIQUE
I N°3An estimate of the rate of
production
of the native state in thechaperonin
inducedfolding
is
complicated
due to several reasons. Consider asingle
turnoverexperiment.
In such anexperiment
the GrOEL isinitially
bound to thepolypeptide
chain which is in a misfolded conformation. After a round of ATPhydrolysis,
which leads to the release of thepolypeptide chain,
GrOEL goes to the off state for aperiod
of time. In the off state GroEl does not bind to the misfolded states. After the ATPhydrolysis
iscomplete
and a fraction of thepolypeptide
chain molecules haspartitioned
to the native conformation(referred
to as thepartition
factor in[14j)
GrOEL reaches the on state and onceagain
binds to the molecules still in the misfolded conformation. Thefolding
rateKp~,
whichdepends
on the concentration ofGrOEL,
isessentially
set upby
the transition time for the GrOEL to reach the on state fromthe off state. In
experiments
it is of the order of a minute. Theutility
ofequation (16)
isthat it can be used to estimate the fraction of molecules that reach the native conformation (oc
RI
in one round of ATPhydrolysis. Using
theexperimental
rate ofpopulation
of thenative conformation for RUBISCO
[14j
we estimate thepartition
factor to about5%
which is consistent with theexperimental
measurement[14j.
3. Conclusions
In this paper, we have
proposed
asimple
kinetic model forchaperonin
facilitatedfolding
ofproteins.
The crucialexperimental
observation[14j
that GrOEL does notrecognize
the nativestate has been used to construct a master
equation
in which theprobability
of escape from thenative conformation is
negligible.
Thissimple assumption
leads to the fact that thepopulation
of the native conformation occurs
exponentially,
inagreement
with recentexperiments.
It is remarkable that our result appears to beindependent
of the freeenergies
of the misfolded structures, I.e. the states other than the native one. Previous studies of the use of masterequations
for disordered systems have shown that one could obtain stretchedexponential [20j
orpower law
[21j
behaviordepending
upon the exact form of the transitionprobability Wab
used.The
Wab
used in this paperdepend explicitly
on the initial and final states. We haveexplicitly
checked that our conclusion
(cf. Eq. (16))
remainsunchanged
if theWab
of reference[20j
is utilized. Thephysical
reason forobtaining exponential
relaxation is due to the fact that the native conformation actsessentially
as a trap and that there is somepath
that connects all misfolded structures to the nativestate,
I-e-Woa # 0,
for all a. In the case ofchaperonin
assistedfolding, lfia #
0 is ensured becauseaccording
to the iterativeannealing
mechanism the barriersseparating
the misfolded structure and the native states are overcomeby consuming
the energy from ATP
hydrolysis.
Thus frozen states, which are the hallmark ofglassy systems,
are not
essentially
allowed and this translates into fastpopulation
of the lowest free energy minimum which in the case ofproteins corresponds
to the native conformation.In
describing
the model in this article we have notexplicitly
addressed the role of GrOESa
peptide
which like GrOEL also has a seven foldsymmetry
and acts as a dome when bound to GrOEL.Although
it has been demonstrated thatprotein folding
can occur in the absence of GrOES[2-4j
recentexperiments [23j suggest
that GrOES doesplay
animportant
role. Thebasic
assumption
ofIAM, namely,
thechaperonin machinery (GrOEL, GrOES,
andothers)
enables misfolded structures an
opportunity
to reach the native state, still remains valid.However,
in the presence of GrOES the release of misfolded structures takesplace only
whenthe dome opens. This additional
event,
which may havebiological significance,
does not alterthe mathematical model
presented
here. Theonly input
in our treatment is thatchaperonins
provide
the necessary free energy so that barriersconnecting
agiven
misfolded structure and the native conformation can be overcome. Thus the various misfolded structures are connected to the native state with someprobability.
The
theory proposed
here is based on the iterativeannealing
mechanism(IAM)
ofchaperonin
action. One of the
assumptions
of IAM is that the free energy released upon ATPhydrolysis
is sufficient to release the substrate
protein
from GrOEL so that thefolding
to the native conformation can occur via the kineticpartition
mechanism. If the interaction between theprotein
and GrOEL is toostrong
then it follows that theGrOEL-protein complex
would be too stable. If this were the casefolding
rate(if
theprotein
folds atall)
would beconsiderably
slowed down. An estimate of the
optimal
interactionstrength
between thechaperonin system
and the substrateprotein
can be obtainedusing scaling arguments
and the fact that ATPhydrolysis
occurs in theheptameric
GrOEL structure in a"quantized" [22]
manner. These calculations show that the interaction energy per residue between theprotein
and the GrOEL shouldsatisfy
theinequality [22]
~
~ ~
~i~BT~~~~
~~~~
where n is the
stoichiometry
of the moles of ATP involved in eachiteration,
A is the free energy released upon ATPhydrolysis,
N is the number of residues in the substrateprotein,
and T is thetemperature.
If the aboveinequality
is not satisfied weexpect
thatfolding
in the presence ofchaperonin system
is either retarded or does not takeplace
at all. In the GrOELsystem
n is sevencorresponding
to theheptameric ring
with eachacting
as a center for ATPhydrolysis,
A isroughly
about 7kcal/mole.
Thisgives
e between(0.6-1.2) kcal/mole
for N= 150
depending
on theprecise
value used for A. Thusoptimal folding
isexpected
to occurby
the IAM mechanism if e lies between(1 2) kBT.
The assessment that the
strength
of interaction between GrOEL and the substrateprotein
andthe
folding
process could be correlated is consistent with recentexperimental
observations[24].
The above
arguments
alsosuggest
that if theinequality
inequation (18)
is not satisfied then thefolding
of the substrateprotein,
if it occurs atall,
would have to takeplace
while it is still bound to GrOEL. If this were the case the mechanismby
which the foldedprotein
would be released is not at all clear. These considerationssuggest
that in all likelihood theproteins
thatrequire
thechaperonin system (I.
e. whenspontaneous folding
does not takeplace)
are bound to GrOEL in such a way thatequation (18)
is satisfied and hence the native conformationwould be reached
by
the iterativeannealing
mechanism. It is clear that furtherexperiments
offolding
of avariety
ofproteins
in the presence ofchaperonin system
as well as theenergetics
of interaction of the substrate
proteins
with GrOEL arerequired
toclarify
these issues.Acknowledgments
One of us
(DT)
isgrateful
to G-H- Lorimer for several useful discussions on variousaspects
ofchaperonin
mediatedprotein folding.
This work wassupported
in partby
agrant
from the National Science Foundation and the Air Force Office of scientific research.References
[lj
AnfinsenC-B-,
Science181(1993)
223-230.[2j
ToddM.J.,
Viitanen P-V. and LorimerG.H.,
Science 365(1994)
659-666.[3j Martin
J., Mayhew M., Langer
T. and HartlF.-H.,
Nature 366(1993)
228-233.[4j Weissman
J.S.,
KashiY.,
Fenton W.A. and HorwichA.L.,
Cell 78(1994)
693-702.[5j Ellis
R-J-,
Curr.Opin.
Struct. Biol. 4(1994)
l17-122.560 JOURNAL DE
PHYSIQUE
I N°3[6j Ellis R.J. and Van der Vies
S.M.,
Ann. Rev. Biochem. 60(1991)
321-333.[7j
Craig E.A.,
Science 260(1993)
1902-1903.[8j Baker
D.,
Sohl J.L. andAgard D.A.,
Nature 356(1992)
263-265.[9j
Creighton
etal.,
J. Mol. Biol. 232(1993)
l176-l196.[10j Braig
etal.,
Nature Struct. Biol. 2(1995)
1083-1094.[llj
Saibil etal.,
Curt.Biology
3(1993)
265-273.[12j Creighton T-E-,
Nature 352(1991)
17-18.[13j
Corrales F. and FershtA-R-,
Proc. Natl. Acad. Sci. 92(1995)
5326-5330.[14j
ToddM.J.,
Lorimer G.H. and ThirumalaiD.,
Proc. Natl. Acad. Sci. 94(1996)
4030-4034.[lsj
Gulukota K. andWolynes P.G.,
Proc. Natl. Acad. Sci. 91(1994)
9292-9296.[16j
Chan H.S. and DillK.A.,
Proteins Struct. Funct. Genet.(1995)
in press.[17j
SfatosC-D-,
GutinA.M.,
Abkevich A.I. and ShakhnovichE., Biochemistry (1996)
in press.[18j
LorimerG.H.,
Todd M.J. and VittanenP-V-,
Phil. Ykans. R. Sac. London Ser B 339(1993)
297-314.[19j Zwanzig R.,Proc.
Natl. Acad. Sci. 92(1995)
9801-9804.[20j
De DominicisC.,
Orland H. and LainAeF.,
J.Phys.
Lett. France 46(1985)
L463-466.[21j Koper
G.J.M. andHillhorst, Europhys.
Lett. 3(1987)
1213-1217.[22j
ThirumalaiD.,
In "StatisticalMechanics,
ProteinStructure,
and Protein-SubstrateInteraction",
S.Doniach,
Ed.(Plenum Press,
NewYork, 1994)
pp. lls-134.[23j Buchberger A.,
ShroderH., Hesterkamp T.,
Schonfeld H.-J. and BukauB.,
J. Mol. Biol.261