arXiv:0912.3703v1 [physics.bio-ph] 18 Dec 2009
Frédéri Fourré1∗
DenisBaurain2†
De ember 18,20091
Aliation: MonteoreInstitute,DepartmentofAppliedMathemati s, Univer-sityofLiège,B28,B-4000Liège,Belgium
2
Aliation: Algology,My ologyandExperimentalSystemati s,Departmentof LifeS ien es, UniversityofLiège,B22,B-4000Liège,Belgium
Abstra t
This paper is on erned with the redu tion of a noisy syn hronous Booleannetworktoa oarse-grainedMarkov hainmodel. Consideran
n
-nodeBooleannetworkhavingatleasttwobasinsofattra tionandwhere ea hnodemaybeperturbedwithprobability0 < p < 1
duringonetime step. Thispro essisadis rete-timehomogeneousMarkov hainwith2
n
possiblestates. Nowunder ertain onditions,thetransitionsbetweenthe basins ofthe network may beapproximatedby a homogeneousMarkov hainwhereea hstateofthe hainrepresentsabasinofthenetwork,i.e. the size ofthe redu ed hainis the numberof attra torsof the original network.
Contents
1 NoisyBoolean networks(NBNs) 2
1.1 Denition . . . 2
1.2 Themeanspe i path. . . 5
1.3 Timeevolutionequation . . . 6
1.4 Stationaryprobabilitydistribution . . . 7
2 Methodfor the al ulation of the sojourn timedistribution in
a basin ofa NBN 9
3 The problem ofgeometri approximation 10
∗
PresentAddress:SystemsBiologyGroup,LifeS ien esResear hUnit,Universityof Lux-embourg,162a,avenuedelaFaïen erie,L-1511 Luxembourg,Grand-Du hyofLuxembourg. Corresponden eshouldbeaddressedtofrederi .fourreuni.lu
†
PresentAddress:UnitofAnimalGenomi s,GIGA-RandFa ultyofVeterinaryMedi ine, UniversityofLiège,B34,B-4000Liège,Belgium
4 The geometri approximation in
(n, C)
networks 204.1
(n, C)
networksand onden eintervals . . . 224.2 Simulationresultsanddis ussion . . . 25
4.3 Approximationformula forthe meantime spentin abasin of a NBN . . . 31
5 Methodforthe redu tion of a NBN 36 5.1 Dis rete-time redu tion . . . 36
5.1.1 Thelow
p
regime . . . 365.1.2 Case
p = 1/2
. . . 405.2 Continuous-timeredu tion. . . 40
6 Statisti al u tuationsin NBNs 42
7 Redu tion ofa NBN to a two-state Markov hain 46
8 Con lusion 46
9 Note 48
10A knowledgment 48
1 Noisy Boolean networks (NBNs)
1.1 Denition
Boolean networks (BNs) havebeenused for several de ades asmodels of bio- hemi al networks, mainly to predi t their qualitative properties and, in the aseofgeneti regulatorynetworks,to infertheinputsandintera tionrulesof theirnodesfrommi roarraydata(Kauman,1969;GlassandKauman,1973; Kauman,1993;Lietal.,2004;Martinet al.,2007; SamalandJain,2008).
An
n
-node BNmodel onsists ofn
intera ting nodeswhi h, in the ontext ofbio hemi alnetworks,generallyrepresentgenesormole ularspe iessu has proteins,RNAsormetabolites. LetX
i
(k)
denotetheBooleanvariable des rib-ingthestateofnodei
atdis retetimek = 0, 1, . . .
Ifnodei
representsaprotein, thenwesaythatifthevalueofthenodeis1
,thentheproteinispresent,inits a tiveform and itstarget (or substrate)present. UsingDe Morgan'slaw, the negationofthis onjun tiongivestheinterpretation forvalue0
: theproteinis absentorina tiveoritstarget(orsubstrate)absent.Remark 1 A more on ise interpretation instead of present (resp. absent) is present and not being degraded (resp. absent orbeingdegraded)or evenpresent andprodu tion rate greater thandegradation rate (resp. absent ordegradationrate greaterthanprodu tion rate).
Foran
n
-nodesyn hronousBN,theintera tionsbetweenthenodesare mod-eledbyasetofn
Booleanintera tionfun tions su h that:X
i
(k + 1) = F
i
[X
1
(k), X
2
(k), . . . , X
n
(k)],
i = 1, 2, . . . , n,
(1)with
F
i
intera tion fun tion of nodei
. If the state of nodei
at time(k + 1)
depends onthe state of nodej
at previoustimek
, then nodej
is saidto be aninputfornodei
. Thenumberofinputsofanodeis alled the onne tivity of that node. Thenetwork is said to besyn hronousbe ause asexpressed in (1)then
nodesareupdatedsyn hronously. Alsofrom(1)thedynami sof the network deterministi .Letustakeanexampletoillustratesomeessentialpropertiesof BNs. Con-siderthefollowingBN:
X
1
(k + 1) = ¬X
4
(k)
X
2
(k + 1) = ¬X
3
(k)
X
3
(k + 1) = X
1
(k) ∨ X
2
(k) ∨ X
4
(k)
X
4
(k + 1) = X
1
(k) ∧ X
2
(k) ∧ X
3
(k),
(2)where symbols
¬
,∨
and∧
represent logi al operators NOT, OR and AND respe tively. The size of the state spa e of this network, i.e. the number of possiblestates, is2
n
= 16
sin e
n = 4
here. From(2), thenextstatefor ea h possiblestate an be omputedtoobtainthestatetableofthenetwork. Using this state table then, the state diagram an be built. This is shown in Fig 1 whereit anbe seenthat thestate spa ehasbeenpartioned intotwodisjoint sets. Thesesetsare alledbasinsofattra tionandaredenotedbyB1andB2in thegure. Ea h basin onsistsof anattra torand sometransientstates. The attra torof B1 is the xed point1010
: whatever the initial state in B1, the network will onvergeto1010
and stay there forever. Attra tors may also be periodi asisthe aseforbasinB2. Thesizeofabasinorofanattra toristhe numberofstatesthat onstituteit.Now suppose that, due to random perturbations, ea h node of a BN has probability
0 < p < 1
to swit h its state (0
to1
or vi e versa) between any twotimesk
and(k + 1)
. Then we get anoisy BN(NBN)1
. The introdu tion of disorder
p
is supported by the sto hasti nature of intra ellular pro esses oupled to the fa t that, thermodynami ally speaking, bio hemi al networks areopensystems.One important dieren e between a BN and a NBN is that in the latter, transitionsbetweenthebasinsofthenetworkareallowed. Forexample,in the statediagramofFig.1,ifweperturbsimultaneouslynodes
2
and3
ofattra tor state1010
whi h isin B1,thenwegoto transientstate1100
whi hbelongsto B2.1
Here,forthesakeofsimpli ity,perturbationprobability
p
issupposedtobeindependent oftimek
andofthestatesofthenodes.If,forinstan e,foraparti ularnode,0
to1
random swit hingprobabilityissetgreaterthan1
to0
one,thenitmeansthatrandomperturbations tendtoturnthenodeONratherthanOFF.0110 1010 0001 1001 1000 1011 0010 0011 0111 1111 B1 B2
Figure1: Statediagramofthefour-node networkdened byintera tionrules(2). Intera tionsbetweenthenodesleadtothepartitioningofthestatespa eintotwo basinsB1 andB2 ofrespe tivesizes
6
and10
.Remark2 FlippingthevalueofsinglenodeshasbeenenvisagedbyKauman(1993 ) ingeneti Booleannetworkstostudytheirstabilitytowhathe alledminimal perturba-tions. Shmulevi h etal.(2002 )proposedamodelforrandomgeneperturbationsbased onprobabilisti Booleannetworks (a lassofBooleannetworks thaten losedthe lass ofsyn hronousBooleannetworks byassigning more than oneintera tion fun tion to ea hnode)inwhi hanygeneofthenetworkmayipwithprobability
p
independentlyof othergenesandinterpretedtheperturbationeventsastheinuen eofexternalstimuli onthea tivityof thegenes.Let
L(x; n, p)
be the probability that exa tlyx
nodes will be perturbed duringonetimestep. Thenlettingq = 1 − p
:L(x; n, p) =
n
x
p
x
q
n−x
if0 < x < n,
q
n
ifx = 0,
p
n
ifx = n.
(3)This is the binomial probability distribution with parameters
n
andp
. On average,np
nodes are perturbed at ea h time step. The probability that at leastonenodewillbeperturbedduring onetimestepistherefore:n
X
x=1
L(x; n, p) = 1 − q
n
= r.
(4)Sin ewearemainlyinterestedinthebehaviourofthenetworkas
p
varies,inthe followingn
andx
will be onsideredas parameterswhilep
will be onsidered asavariable. Thusweshould writeL(p; n, x)
insteadofL(x; n, p)
2 . 2
p
0.002 0.02 0.2 0.5L(p; 4, 1)
0.0080 0.0753 0.4096 0.2500L(p; 4, 2)
0.0000 0.0023 0.1536 0.3750L(p; 4, 3)
0.0000 0.0000 0.0256 0.2500L(p; 4, 4)
0.0000 0.0000 0.0016 0.0625L(p; 8, 1)
0.0158 0.1389 0.3355 0.0312L(p; 8, 2)
0.0001 0.0099 0.2936 0.1094L(p; 8, 3)
0.0000 0.0004 0.1468 0.2188L(p; 8, 4)
0.0000 0.0000 0.0459 0.2734Table1: Some valuesofthefun tion
L(p; n, x)
.For
0 < x < n
,L(p; n, x)
has a maximum atp = x/n
andL(0; n, x) =
L(1; n, x) = 0
.TheMa laurinseriesof
L(p; n, x)
uptoorder2
is:L(p; n, x) =
1 − np + n(n − 1)p
2
/2 + . . .
ifx = 0,
np − n(n − 1)p
2
+ . . .
ifx = 1,
n(n − 1)p
2
/2 + . . .
ifx = 2,
0 + . . .
if2 < x ≤ n.
Thus onehas:
L(p; n, x) =
1 − np + o(p)
ifx = 0,
np + o(p)
ifx = 1,
o(p)
if2 ≤ x ≤ n.
(5)This means that,
n
being xed, for su iently smallp
the probability that2 ≤ x ≤ n
nodes be perturbed during onetimestep isnegligible ompared to theprobabilitythat just onenodebeperturbed. Table1givessomevaluesofL(p; n, x)
roundedtofourde imalpla es. Weseethatatp = 0.002
,L(p; 4, 1) =
0.0080 = 4p
andL(p; 8, 1) = 0.0158 ≈ 8p
.1.2 The mean spe i path
Consideraseries ofBernoullitrials whereea h timestepdenesonetrialand where asu ess means that at least onenodehas beenperturbed. From (4), theprobabilitythattherstsu esswillo urontrial
i
is:p
i
= r(1 − r)
i−1
,
i = 1, 2, . . . ,
(6)whi h is ageometri distribution with parameter
r
. The rst momentof this distribution(the meantimebetweentwosu esses) is:τ =
1
r
=
1
1 − q
n
.
(7)Weseethat
τ
isade reasingfun tionofp
andn
,thattendsto1
asp → 1
and to∞
asp → 0
. Sin etime stepisunity,τ
equalsthemeannumberd
of state transitions between two su esses. By analogy with the mean free path of a parti lein physi s3
,
d
will be alled mean spe i path. Thetermspe i isusedtore allthatbetweentwosu esses,thetraje toryin thestatespa eis spe i tonodeintera tions,i.e.,itisentirelydeterminedbynodeintera tions.Forsu ientlysmall
p
onehasr = 1 − q
n
≈ np ≈ L(p; n, 1)
,wherethelast approximation omesfrom (5). Hen e,forsu ientlysmall
p
wemaywrite:p
i
≈ np(1 − np)
i−1
,
i = 1, 2, . . .
Dependingon thevalueof
p
, dierentdynami alregimes arepossible. Forp = 0
, the network is trapped by an attra tor where it stays forever. For0 < p < 1
, transitions between the basins of the network o ur. The morep
is lose to one, theshorter the times spent on the attra tors,the more the network suers from fun tional instability. In the lowp
regime, the network may both maintain a spe i a tivity for a long time period and hange its a tivity: fun tionalstabilityandexibility(ordiversity) oexist.1.3 Time evolution equation
ANBNis infa t adis rete-timeMarkov hain
{X
k
, k = 0, 1, . . .}
,whereX
k
is therandomvariablerepresentingthestateofthenetworkattimek
. Thestate spa eofann
-nodeBNwill bedenoted by{1, 2, . . . , E}
,withE = 2
n
andwith state
i
orrespondingto binaryrepresentationof(i − 1)
.Let
π
ij
= Pr{X
k+1
= j|X
k
= i} ≥ 0
bethe onditional probabilitythat the networkwill beinstatej
at(k + 1)
whileinstatei
atk
. ThematrixofsizeE
whoseelementsare theπ
ij
'swill bedenoted byΠ
andis alled thetransition probabilitymatrixin Markovtheory. Thesumofelementsin ea hrowofΠ
is unity. Letz
(k)
i
betheprobabilitythat thenetwork will bein statei
at timek
and denote byz
(k)
theve torwhose elementsare the stateprobabilities
z
(k)
i
. Givenaninitialstateprobabilityve torz
(0)
,theve torsz
(1)
, z
(2)
, . . .
arefound from(Kleinro k,1975):z
(k+1)
= z
(k)
Π,
k = 0, 1, . . .
(8)Matrix
Π
isthesumof twomatri es: 1. TheperturbationmatrixΠ
′
,whose
(i, j)
thelementisequaltoπ
′
ij
=
p
h
ij
q
n−h
ij
ifi 6= j,
0
ifi = j,
3Inkineti theory of gases,the meanfree path ofa gas mole uleis the meandistan e traveledbythemole ulebetweentwosu essive ollisions.
where
h
ij
refers to(i, j)
th element of Hamming distan e matrixH
, a symmetri matrix that depends only onn
. Elementh
ij
is equal to the numberof bits that dierin the Boolean representationsof statesi
andj
. DiagonalelementsofH
aretherefore0
. Forexample,ifn = 2
,H
takes theform:H =
0 1
1 2
1 0
2 1
1 2
0 1
2 1
1 0
.
(9)Note that in ea h row of
H
, integersx = 1, 2, . . . , n
appear respe tivelyn
x
times. Thusfrom(3) and(4), thesumofelementsin ea h rowof
Π
′
mustbe
r
.2. Theintera tionmatrix
Π
′′
. Ifthenodeintera tionsaresu hthatstarting in state
i
thenetwork isfor ed totransition to statej
in onetimestep, then(i, j)
th element ofΠ
′′
equals
q
n
= 1 − r
, otherwise it is
0
. Noti e that ifi
th diagonalelementofΠ
′′
equals
q
n
thenstate
i
is axed point (attra torofsize1
).Therefore,forxed
n
,anyprobabilityπ
ij
willbeeither0
orafun tionofp
.1.4 Stationary probability distribution
Sin e
p
does not depend onk
, theπ
ij
's are independent of time. The hain isthushomogeneous. Asshown by Shmulevi hetal.(2002)forgeneti proba-bilisti Boolean networks,for0 < p < 1
,the hainisirredu ibleandaperiodi . Consequently, forgivenp
there is aunique stationary distribution¯
z
whi h is independentofz
(0)
(Kleinro k,1975). Thestationary distributionsatises the twoequations:
z
= zΠ
andX
i
z
i
= 1.
(10) Inaddition,wehave:lim
k→∞
Π
k
= ¯
Π,
(11)whereea h rowof
Π
¯
isequaltove tor¯
z
.Fig. 2showsthestationary stateprobabilities
z
¯
i
forthenetwork ofFig. 1 andtwop
values. Bluepoints orrespondtop = 0.04
and redonestop = 0.4
. As anbeseenin thegure,whenp = 0.04
,thestationary probabilitiesof the transientstatesare small ompared to those of theattra torstates(attra tor statesarerepresentedinboldtypeinthegure). Thereforeinthelowp
regime, thea tivityofthenetworkinthelongtermisgovernedmostlybyitsattra tors4 . 4
Equivalently,inthelow
p
regime, itismostly attra torsthatdeterminethe fateof the ells.0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0000
0001
0010
0011
0100
0101
0110
0111
1000
1001
1010
1011
1100
1101
1110
1111
Boolean states
S
ta
ti
on
ar
y
pr
ob
ab
ili
ti
es
¯z
i
Figure 2: Stationary state probabilitiesfor the network shownin Fig. 1and two
p
values. Blue points:p = 0.04
; red points:p = 0.4
. Attra tor states are in boldtype. The stationary probability ofB1 (obtained by summingthe stationary probabilitiesofthestatesofB1)is0.3270
whenp = 0.04
and0.3708
whenp = 0.4
. Sin ethesizeof B1is6
, thisprobability onvergesto6/16 = 0.375
asp
tendsto1
.Infa t,thestationaryprobabilitiesofthetransientstatestendto
0
asp
tends to0
. Alsonoti eforp = 0.04
thestationaryprobabilitiesofthese ondattra tor almostequal.As
p
tendsto unity,thestationaryprobabilitiestend tobeuniform,i.e. asp → 1
onehasz
¯
i
→ 1/E = 1/16 = 0.0625 ∀i
(seethe asep = 0.4
). Thusforp
su iently lose tounity, thestationary probabilitiesof thetransient states arenotnegligible omparedtothoseoftheattra torstates.Remark 3 Forsu iently small
p
, the stationary probabilityof an attra tor state maybeapproximatedby¯
Z
∗
/A
where¯
Z
∗
isthelimitas
p → 0
ofthestationary o upa-tionprobabilityZ
¯
ofthebasin ontainingtheattra torandA
thesizeoftheattra tor (see further 4.3). Thus in the ase of the network of Fig. 1, we get for attra tor state11
(xed point) thatz
¯
11
≈ 1/3 = lim
p→0
Z/1
¯
, and forthe states of these ond attra tor,thatea hstationary stateprobability≈ 1/6 = lim
p→0
¯
Z/4
.distribution in a basin of a NBN
ConsideraBNhavingatleasttwobasins. ConsiderabasinBofsize
B
ofthis network andaxedperturbationprobabilityp
. LetΠ
b
=
Q
a
0
1
.
(12)Thesquare
B × B
sub-matrixQ
ontainstheone-step transition probabilitiesπ
ij
between the states of B. For example, in the ase of basin B1 of Fig. 1, element(1, 4)
ofQ
isequaltotheone-steptransitionprobabilitybetweenstate2
(0001
) and state10
(1001
). Elementi
of ve tora
represents the one-step transitionprobabilitybetweenstatei ∈
B andan absorbingstate5 regrouping states
j /
∈
B ,i.e.a
i
=
X
j /
∈
Bπ
ij
=
X
j /
∈
Bπ
′
ij
=
X
j /
∈
Bp
h
ij
q
n−h
ij
.
0
isarowve toroflengthB
withallelementszeroand1
isas alar. FrommatrixQ
,we an al ulatetheprobabilityW
(k)
thatthenetworkwill beinbasinBattime
k
givenaninitialprobabilityve torb
(0)
oflengthB
with sumofelements1
:b
(k+1)
= b
(k)
Q,
k = 0, 1, . . . ,
(13) andbydenition:W
(k)
=
B
X
i=1
b
(k)
i
,
k = 0, 1, . . . ,
withW
(0)
= 1
.Nowwewantthe sojourntime distributionin B. Weshall denoteby
S
the dis reterandomvariablerepresentingthesojourntimeinB,withsamplespa es = 1, 2, . . .
, andbyψ
s
theprobabilitydistribution ofS
, i.e.ψ
s
= Pr{S = s}
. Foragivenbasin,ψ
s
dependsonp
andinitial ve torb
(0)
. Itisgivenby:
ψ
s
= W
(s−1)
− W
(s)
,
s = 1, 2, . . . ,
(14)whilethe umulativedistributionfun tionof
S
is:ˆ
ψ
s
= Pr{S ≤ s} = 1 − W
(s)
,
s = 1, 2, . . .
(15)Matrix
Π
b
in(12)representsanabsorbingMarkov hain(Snell ,1959),i.e. it hasatleastoneabsorbingstateandfromeverynon-absorbingstate(everystate∈
B )one anrea hanabsorbingstate(anystate∈
/
B ). Ifthe hain isinitially in statei ∈
B , then the mean time spent in statej ∈
B before absorption is5
the
(i, j)
th elementof fundamental matrix(I − Q)
−1
(Snell, 1959). Thus if
µ
denotesthemeanofψ
s
then:µ = b
(0)
(I − Q)
−1
1
.
(16)Here
1
is a olumnve toroflengthB
withea helement1
.Alsonoti e from the denitions ofthe meanof
S
and ofψ
s
thatµ
anbe expressedasfollows:µ =
∞
X
k=0
W
(k)
= 1 + W
(1)
+ W
(2)
+ . . .
3 The problem of geometri approximation
Let
g
s
be a geometri distribution with parameterp
µ
= 1/µ
, i.e.ψ
s
andg
s
have same mean. Consider the maximum deviation between the umulative distributionfun tions ( dfs)of thesetwodistributions,namely:δ
∗
= max
s≥1
δ
(s)
,
(17) withδ
(s)
= | ˆ
ψ
s
− ˆ
g
s
|
andg
ˆ
s
the df ofg
s
,i.e.g
ˆ
s
= 1 − (1 − p
µ
)
s
,
s = 1, 2, . . .
In probabilitytheory,δ
∗
is alledtheKolmogorovmetri . As
ψ
s
,δ
∗
isnotdened for
p = 0
and, for a given basin, depends onp
andb
(0)
. It omes from (15) that:δ
(s)
= |(1 − p
µ
)
s
− W
(s)
|,
s = 1, 2, . . .
(18) Thusδ
(s)
representstheabsoluteerrorbetween
W
(s)
anditsgeometri approx-imation
(1 − p
µ
)
s
. Forbasinsof size
B = 1
(one xed pointand notransient states),ψ
s
is given by(6) whi h is geometri . Hen e for su h basinsδ
(s)
= 0
∀s, p
andthusδ
∗
= 0 ∀p
. Thisisaparti ular aseandinthefollowingweshall studythebehaviourof
δ
∗
as
p
tendsto0
withoutanyassumptionsonψ
s
. Theprobabilitytoexit Bduring(k, k + 1)
is:p
e
(k, k + 1) = Pr{S = k + 1|S > k} = 1 −
W
(k+1)
W
(k)
,
k = 0, 1, . . .
(19)Thisprobabilitydependson
k
,p
andb
(0)
. Inparti ular,for
B = 1
,p
e
(k, k + 1)
is onstant and equaltor
. Whenp > 0
, any statei ∈
B ommuni ates with any statej /
∈
B, thusp
e
(k, k + 1) > 0 ∀k = 0, 1, . . .
and therefore probabilityW
(k)
isstri tlyde reasing. Inversely,sin eW
(0)
= 1
,wehave:W
(s)
=
s−1
Y
k=0
[1 − p
e
(k, k + 1)],
s = 1, 2, . . .
(20)δ
(s)
= |(1 − p
µ
)
s
−
s−1
Y
k=0
[1 − p
e
(k, k + 1)]|,
s = 1, 2, . . .
(21)Wesee thatif
p
e
(k, k + 1) = p
µ
∀k
thenδ
∗
= 0
. Alsonotethefollowing: 1.
lim
p→0
µ = ∞ ∀b
(0)
.
(22)
Whatevertheinitial onditions,as
p
be omessmaller,ittakesonaverage moreandmoretimetoleavethebasin. Alsothismeansthat∀b
(0)
:p
µ
→ 0
asp → 0
. 2.lim
p→0
p
e
(k, k + 1) = 0
∀k, b
(0)
.
(23)Whatever
k
andthe initial onditions,the probability toleavethebasin during(k, k + 1)
goesto0
asp → 0
. Indeed,fromequations(13)and(19), it omesthatp
e
(k, k + 1) =
X
i∈
Ba
i
ˆb
(k)
i
=
X
i∈
Bn
X
x=1
Γ
x
i
p
x
q
n−x
ˆb
(k)
i
,
k = 0, 1, . . . ,
(24) withΓ
x
i
≥ 0
thenumberofwaysofleavingBbyperturbingx
bitsofstatei ∈
B 6 andˆb
(k)
i
=
b
(k)
i
W
(k)
,
i.e.X
i∈
Bˆb
(k)
i
= 1,
k = 0, 1, . . .
Inordertoshowthat
∀b
(0)
any
δ
(s)
tendsto
0
asp
tendsto0
,weintrodu e twopropositions.Proposition1 ForanybasinofanoisyBooleannetworkwithxed
0 < p < 1
, thefundamental matrix(I − Q)
−1
has areal simpleeigenvalue
λ
∗
> 1
whi h is greaterin modulusthanany other eigenvaluemodulus, thatis
λ
∗
> |λ|
for any othereigenvalueλ
of(I − Q)
−1
. Wehave:lim
k→∞
p
e
(k, k + 1) =
1
λ
∗
.
6Thisnumberisequaltothenumberofelementsinrow
i
oftheHammingdistan ematrixForagiven basin,the rate of onvergen edependson
p
andb
(0)
whilethelimit
1/λ
∗
dependsonly on
p
.From that proposition,weseethat if
ψ
s
is geometri thenp
µ
= 1/λ
∗
. For example,ifB = 1
thenp
µ
= r = 1/λ
∗
. Demonstration.Wehavetoshowthat:
lim
k→∞
W
(k+1)
W
(k)
= λ
b
,
withλ
b
= 1 − 1/λ
∗
arealsimpleeigenvalueof
Q
greaterin modulusthanany othereigenvaluemodulusand0 < λ
b
< 1
.Matrix
Q
isnonnegative. Itisirredu ibleandaperiodi thusprimitive. From thePerron-Frobeniustheorem,wehavethat: (1)Q
hasarealeigenvalueλ
b
> 0
whi h is greater in modulus than any other eigenvalue modulus. Sin eQ
is substo hasti , wehave0 < λ
b
< 1
. (2)λ
b
isasimplerootofthe hara teristi equationofQ
. Moreover(DouglasandBrian,1999):lim
k→∞
Q
k
λ
k
b
= vu
T
,
with
v
andu
rightandlefteigenve torsasso iatedwithλ
b
hoseninsu haway thatu > 0
,v > 0
andu
T
v = 1
. Thegreater
k
, thebetterthe approximationQ
k
≈ λ
k
b
vu
T
. Thus:W
(k+1)
= b
(0)
Q
k+1
1
≈ b
(0)
λ
k+1
b
vu
T
1
≈ b
(0)
λ
b
Q
k
1
= λ
b
W
(k)
.
Therefore:W
(k+1)
W
(k)
→ λ
b
ask → ∞.
Proposition 2µ
isasymptoti ally equivalenttoλ
∗
:lim
p→0
µ
λ
∗
= 1.
Sin eλ
∗
doesnot depend on
b
(0)
, the last statement is equivalent to say thatas
p
tendsto0
,µ
islessandlessdependentonb
(0)
. Inotherwords,from (16),theelementsofve tor
(I − Q)
−1
1
tendstobeequalas
p → 0
. Demonstration.If
ψ
s
isgeometri thenµ
doesnotdependonb
(0)
. Thusfrom(16)it omes that:
(I − Q)
−1
1
= µ1,
orequivalentlyQ1 = (1 −
1
This means that
(1 − 1/µ)
is an eigenvalue ofQ
with asso iated eigenve tor1
. Sin eQ
isnonnegativeprimitive,bythePerron-Frobeniustheorem,(1 −
1
µ
)
mustequalλ
b
,whi himpliesµ > 1
. Nowλ
b
= 1 − 1/λ
∗
. Thus
µ = λ
∗
. From (25) we see that the sum of elements in any row of
Q
is a onstant. This is be ausewhenψ
s
isgeometri ,theprobabilitytoleavethebasinfromanystate is onstant.When
Q
isnotgeometri ,thePerron-Frobeniustheoremgives:Qv = λ
b
v,
0 < λ
b
< 1.
Nowas
p
tendsto0
,Q
tendstoasto hasti matrix(be auseve tora
tendsto ve tornull). Therefore asp
tends to0
,λ
b
musttend to1
andv
must tendto1
. Thusforsu ientlysmallp
wemaywrite:Q1 ≈ λ
b
1
,
(26)orequivalently
(I − Q)
−1
1
≈ λ
∗
1
.
Thereforeµ = b
(0)
(I − Q)
−1
1
≈ b
(0)
λ
∗
1
= λ
∗
.
Thus as
p → 0
,µ
willbelessand lessdependentoninitial onditionsand the error in the aboveapproximationwill tend to0
. Note that sin eλ
b
→ 1
asp → 0
wemusthaveµ → ∞
asp → 0
whi hisresult(22).Remark 4 Aswillbe dis ussed later, approximation
µ ≈ λ
∗
may be good in some neighborhood ofsome
p
(typi allyintheneighborhoodofp = 0.5
, seeFig.13in4.3).From Proposition 2now,
(1 − p
µ
)
s
will tend to
(1 − 1/µ)
s
= λ
s
b
asp → 0
. Ontheotherhand,from(26),wegetQ
s
1
≈ λ
s
b
1
,
andthus:W
(s)
= b
(s)
1
= b
(0)
Q
s
1
≈ λ
s
b
.
Hen efrom(18)anyδ
(s)
willtendto
0
asp
tendsto0
,whateverb
(0)
. Thusthe maximumwill doso:
lim
p→0
δ
∗
= 0
∀b
(0)
.
Sin e the maximum deviation between the two dfs of
ψ
s
andg
s
vanishes asp → 0
, these two dfs tend to oin ide asp tends to0
. Alsothis means that(1 − p
µ
)
s
isagoodapproximationtowithin±δ
∗
ofW
(s)
atanyandevery
s
and thatthe error an bemade arbitrarilysmall bytakingp
su iently loseto0
(butnotequalto0
).Inaddition,from(22)wemaywriteforsu ientlysmall
p
:ˆ
g
s
= Pr{S ≤ s} = 1 − e
s ln(1−p
µ
)
≈ 1 − e
−p
µ
s
.
Yetthe dfofanexponentialdistribution withparameterγ
is:Pr{T ≤ t} = 1 − e
−γt
.
For
t = s
andγ = p
µ
,thetwoprobabilitiesareequal. Therefore,asp → 0
and∀b
(0)
,the dfof
ψ
s
tendsto oin idewiththe dfofanexponentialdistribution. Sin eµ
is lessand less dependent onb
(0)
as
p → 0
(see Proposition 2), one has for su iently smallp
that an exponential distribution an be used to approximatethetimespentinabasinofaNBN.Remark5 Dependingon
b
(0)
,
ψ
s
maybevery losetoageometri distributionwhilep
isnot small. See Examples1and2below.Suppose that for ea h basin of a NBN the sojourn time
S
is su iently memoryless. Then, under someadditional assumptions that will bedis ussed in se tion5,onemaydeneanother dis rete-timehomogeneousMarkov hain˜
Y
k
whosestatesarethebasinsofthenetwork. (1)Thisnewsto hasti pro ess is oarserthanX
k
sin eonlytransitionsbetweenthebasinsofthenetworkare des ribed. (2) The size of its state spa e is in general mu h smaller thanE
, thesize ofthestatespa eoftheoriginalpro ess. (3)Ithastobeviewedasan approximation. ThetildesymbolinY
˜
k
isusedtore allthatingeneral,abasin doesnotretaintheMarkovproperty(i.e.,ψ
s
is notgeometri ).Example 1 Cal ulation of sojourn time distributions and illustration of the geo-metri approximation problem. Let us take basin B1 of Fig. 1 with
p = 0.02
andb
(0)
i
= 1/6 ∀i = 1, 2, . . . , 6
, andlet us omputeψ
s
andg
s
. Thetwo distributionsare plottedinFig.3a. The ir lesstandfortheprobabilitiesψ
s
(µ = 13.3530
),thepoints forthe probabilitiesg
s
(p
µ
= 1/µ = 0.0749
). Fig. 3bshowsW
(k)
and itsgeometri approximation
(1 − p
µ
)
k
versus time
k
. Maximum deviationδ
∗
between the two is
2.3007%
.Table2gives
µ
andδ
∗
fordierent
p
valuesanddierentinitial onditionsforthe twobasinsofFig. 1 . Ea hof thefour olumnsbelowµ
andδ
∗
orrespondsto one
p
value,fromlefttoright:p = 0.002
,0.02
,0.2
and0.8
. Themeanvaluesofthe olumns arealsogiven(seetherowswithMean). Forthe al ulationofµ
,we onsideredtwo typesofinitial onditions: (1)thenetworkstartsinonestateofthebasin(su essively ea hstateofthebasinwastakenasinitialstate)and(2)b
(0)
i
= 1/B ∀i = 1, 2, . . . , B
(seeUnif. intheTable). Weseethatfor therstthreevaluesof
p
themeansojour timesinB1aresmallerthanthoseinB2(seetherstthree olumnsbelowµ
). ThusB1 islessstablethanB2. Also al ulatedforea hbasinandthefourp
valuesthevariation oe ientofthesojourntimesobtainedwiththersttypeofinitial onditions. The resultsare( olumns1
,2
,3
and4
):0.1876
,1.7880
,9.3454
and15.9039%
forB1;0.1566
,1.3030
,2.5152
and17.0044%
forB2. Thereforeµ
islessandlesssensitivetotheinitial stateasp
de reases,whi hisina ordan ewithProposition2.Anotherimportantobservationinthistableisthat,dependingontheinitial ondi-tions,
ψ
s
maybe losetoageometri distributionwhilep
isnotsmall(seeinthetable thevaluesofδ
∗
when
p = 0.2
). Fig. 4showsthefun tionsδ
∗
(p)
forthetwobasinsof Fig. 1andthe uniforminitial ondition. Weseethat inboth ases
δ
∗
(p) → 0
p → 0
,δ
∗
(0.5) = 0
(ψ
s
isgeometri atp = 0.5
) 7 andδ
∗
(p)
tendstoaglobalmaximum as
p → 1
. ForbasinB1(blue urve),thereisonelo almaximumatp = 0.112
,while for basinB2 (green urve) thereare two lo al maxima, one atp = 0.0530
and the otheratp = 0.3410
,and onelo alminimumatp = 0.2600
(0.0677%
). Asmentioned above,fun tionδ
∗
(p)
dependsoninitial onditions. Forexample,ifthenetworkstarts instate
0000
,thenB2hasstillonelo alminimumbutthisnowo ursatp = 0.3640
(0.0830%
).Example2 Inthisexample,weillustratethefa tthatwhile
p
isnotsmall,ψ
s
may be lose toageometri distribution,dependingonb
(0)
. Wetake basinB1 ofFig. 1 and al ulate
ψ
s
wheninitiallythenetworkis(a)instate0101
and(b)instate0110
. Fig. 5showstheresults. As anbeseeninthegure,whenthenetworkstartsin0110
,ψ
s
is lose toageometri distributionwhi hisnotthe ase whentheinitialstate is0101
.To omparethetwoapproximations,we omputedthevarian es
σ
2
and
σ
2
g
ofψ
s
and
g
s
respe tivelyandthetotalvariationdistan ed
T V
betweenψ
s
andg
s
(another probabilitymetri )whi hisgivenby:d
T V
=
1
2
X
s≥1
|ψ
s
− g
s
|.
ThevaluesoftheseparametersaregiveninTable 3 . Weseethatthetotal variation distan eisabout
10
timesgreaterwhentheinitialstateis0101
thanwhenitis0110
. Themoregeometriψ
s
,thesmallerd
T V
,thebettertheapproximationsσ
2
≈ σ
2
g
andµ ≈ λ
∗
.
Tillnow,wehaveassumedea hnodeofthenetworkhasthesame probabil-ityofbeingperturbed. InExample
3
below,welookathowthenetworkofFig. 1behaveswhenoneof itsnodes isperturbed withaprobabilitywhi his high ompared to the other nodes. Suppose nodes represent proteins. Within the ellinterior, some proteinsmay be moresubje tto ompeting rea tionsthan others. Theserea tionsmaybeassumedtoa trandomly8
,eithernegativelyor positively,onthestateofthetargetproteins. Forexample,somerea tionsmay leadto protein unfolding while others, like those involving mole ular haper-ones,may res ue unfoldedproteins(Dobson,2003). Onthe other hand,some proteinsmaybemoresensitivetophysi o- hemi alfa tors,liketemperatureor pH,in reasingtheirprobabilitytobeperturbed.
Example3 Inthisexample,itisassumedthatonenodeisperturbedwitha proba-bilitywhi hishigh omparedtotheothernodes. Thebehaviourofthenetworkwhen
p → 1
is omplexand will notbedis ussed here so we takep
1
= p
2
= p
3
= 10
−3
(therstthreenodesare rarelyperturbed)and
0 < p
4
≤ 0.9
. Fig. 6shows¯
z
for two valuesofp
4
. Forp
4
= 0.04
(thebluepoints)the networkbehavesasifall thenodes hadthe sameprobabilityp = 0.04
ofbeing perturbed(seeFig. 2 , theblue points). Simulationshave shown this to be true whateverthe three rarelyperturbednodes. Whenp
4
in reasesin]0, 0.9]
,thestationarystateprobabilitiesdonottendtobeequal (seeFig. 6 , the asep
4
= 0.9
), rather,sometransient states, typi allystate0011
of B2,tendtobemorepopulatedattheexpenseofattra torstates(seeattra torstates7
Whatevertheinitial onditions,at
p = 0.5
,µ = 8/5
forB1and8/3
forB2. See5.1.2for analyti alexpressionofµ
whenp = 0.5
.8
0
10
20
30
40
50
0
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
Sojourn time (s)
P
ro
ba
bi
lit
ie
s
(a)
0
10
20
30
40
50
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Time (k)
W
(k
)
(b)
Figure 3: Sojourn time distribution and geometri approximation for basin B1 of Fig. 1 with
p = 0.02
andb
(0)
i
= 1/6 ∀i = 1, 2, . . . , 6
. (a) Cir les: sojourn timedistributionψ
s
(with meanµ = 13.3530
); points: approximatinggeometri distributiong
s
(with parameterp
µ
= 1/µ = 0.0749
). (b) Cir les:W
(k)
; points: geometri approximation(1 − p
µ
)
k
. Maximum deviationδ
∗
is2.3007%
.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0
5
10
15
20
25
30
35
p
δ
∗
(p
)
Figure 4: Maximumdeviation
δ
∗
(in
%
)as a fun tion ofp
for the twobasinsof Fig. 1 andinitial onditionsb
(0)
i
= 1/B ∀i = 1, 2, . . . , B
. Blue urve: basin B1. Green urve: basinB2. One hasδ
∗
(0.5) = 0
forbothbasins(
ψ
s
is geometri atp = 0.5
).Ini. ond.
µ
δ
∗
B10001
126.00 13.52 2.37 1.54 0.39 3.44 7.47 10.390101
126.00 13.52 2.39 2.07 0.39 3.47 8.49 4.090110
125.50 13.02 1.94 2.28 0.00 0.00 0.92 10.661001
126.00 13.52 2.39 2.07 0.39 3.47 8.49 4.091010
125.50 13.02 1.94 2.28 0.00 0.00 0.92 10.661101
126.00 13.52 2.37 1.54 0.39 3.44 7.47 10.39 Mean 125.83 13.35 2.23 1.96 0.26 2.30 5.63 8.38 Unif. 125.83 13.35 2.23 1.96 0.26 2.30 4.59 2.72 B20000
252.85 27.68 4.27 3.77 0.39 2.96 2.31 5.400010
251.86 26.81 4.08 2.25 0.00 0.11 0.50 20.870011
252.36 27.26 4.22 3.77 0.20 1.63 2.59 5.420100
251.86 26.80 4.02 3.60 0.00 0.12 1.99 0.930111
251.86 26.80 4.02 3.60 0.00 0.12 1.99 0.931000
251.86 26.80 4.02 3.60 0.00 0.12 1.99 0.931011
251.86 26.80 4.02 3.60 0.00 0.12 1.99 0.931100
252.36 27.26 4.22 3.77 0.20 1.63 2.59 5.421110
251.86 26.81 4.08 2.25 0.00 0.11 0.50 20.871111
252.85 27.68 4.27 3.77 0.39 2.96 2.31 5.40 Mean 252.16 27.07 4.12 3.40 0.12 0.99 1.88 6.71 Unif. 252.16 27.07 4.12 3.40 0.12 0.81 0.24 2.81Table2: Meansojourntime
µ
andmaximumdeviationδ
∗
(in
%
)forthetwobasins of Fig. 1. Twotypes of initial onditions have been onsidered: (1) the network startsinonestateofthebasin(su essivelyea hstateofthebasinistakenasinitial state)and(2)ea helementofb
(0)
isonedividedbythesizeofthebasin(seeUnif. intheTable). The four olumnsbelowaparameter(
µ
orδ
∗
) orrespondfromleft toright to
p = 0.002
,0.02
,0.2
and0.8
. The mean valueforea h olumn isalso indi ated(seeMean). Attra torstatesareinboldtype.0101
0110
δ
∗
(%) 8.49 0.92d
T V
(%) 10.75 1.18σ
2
2.38 1.91σ
2
g
3.34 1.82λ
∗
2.00 2.00µ
2.39 1.940
2
4
6
8
10
12
14
16
18
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
s
P
ro
ba
bi
lit
ie
s
(a)
0
2
4
6
8
10
12
14
16
18
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
s
P
ro
ba
bi
lit
ie
s
(b)
Figure5: Comparisonbetweentwogeometri approximations. Twodierentinitial statesaretakenin basinB1 ofFig. 1with
p = 0.2
(seetable2). Cir les: probabil-itiesψ
s
. Points: geometri approximationg
s
. (a)Initial state0101
.δ
∗
= 8.49%
,µ = 2.39
. (b)Initialstate0110
.δ
∗
= 0.92%
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0000
0001
0010
0011
0100
0101
0110
0111
1000
1001
1010
1011
1100
1101
1110
1111
Boolean states
S
ta
ti
on
ar
y
pr
ob
ab
ili
ti
es
¯z
i
Figure6: Stationarystate probabilitiesforthenetwork shown inFig. 1and non-equiprobablenodeperturbations. The rstthreenodesarerarely perturbed:
p
1
=
p
2
= p
3
= 10
−3
. Blue points:p
4
= 0.04
; red points:p
4
= 0.9
. Attra torstates areinboldtype.1000
,1010
,1011
and1110
inFig. 6 ) 9.
Fig. 7 ashows distribution
ψ
s
with approximating distributiong
s
for basinB2,p
4
= 0.1
andstate0011
as initialstate. Maximumdeviationδ
∗
is
8.2011%
. Notethe splittingofψ
s
into twosubdistributions, one for odd frequen ies and the other for evenfrequen ies. ProbabilitiesW
(k)
andapproximations
(1 − p
µ
)
k
areplottedinFig. 7b.
4 The geometri approximationin
(n, C)
networks In Example 1, we illustrated, using the network of Fig. 1, the fa t that asp
tendsto0
,the dfofthetimespentinabasinofattra tionapproa hesthe df of ageometri (resp. exponential) distribution and that theexpe ted sojourn timetends tobeindependentof initial onditions(see thede reasesofδ
∗
and 9Atthe ellpopulationlevel,thismeansthata ellpopulationwouldexhibitphenotypes that ouldnothavebeenobserved(ornot easilyobserved) inthe low
p
regime. Notethat thesephenotypesarenotne essarilysurvivableforthe ells.0
10
20
30
40
50
60
70
0
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
0.1
Sojourn time (s)
P
ro
ba
bi
lit
ie
s
(a)
0
10
20
30
40
50
60
70
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Time (k)
W
(k
)
(b)
Figure 7: Sojourntimedistribution andgeometri approximation for basin B2 of Fig.1. Therstthreenodesarerarelyperturbed:
p
1
= p
2
= p
3
= 10
−3
;
p
4
= 0.1
. Initially, the network is in state0011
. (a) Cir les: distributionψ
s
(with meanµ = 22.9653
); points: approximating geometri distributiong
s
(with parameterp
µ
= 0.0435
). Forthesakeof larity,sojourntimess > 70
havebeenomitted. (b) Cir les:W
(k)
;points: geometri approximation
(1 − p
µ
)
k
. Maximumdeviation
δ
∗
variation oe ient as
p
be omes smaller, respe tively in Table 2 and in the text). As will be seen later, for this network,a two-state dis rete-time (resp. ontinuous-time) homogeneousMarkov hain may be used for approximating basin transitions in the lowp
regime,i.e., a oarse-graineddes riptionof this network existsinthelowp
regime.Nowweaddresstheproblem ofgeometri approximationin randomly on-stru ted
(n, C)
networks. Arandom(n, C)
networkisbuiltbyrandomly hoos-ingforea hnodeC
inputsandoneintera tionfun tion (Kauman,1993).4.1
(n, C)
networks and onden e intervalsSix
(n, C)
ensembleswereexaminedtakingn = 6
,8
or10
(E = 64
,256
or1024
) andC = 2
or5
. Ea hbasinofarandomly onstru tednetwork wasperturbed withp = 0.002
,0.01
,0.02
,0.1
,0.3
,0.5
and0.8
.A ordingtothe lassi ationofKauman(1993),
(n, 2)
networksare om-plexwhile(n, 5)
onesare haoti . Onedieren ebetweenthesetwoensembles of networks,whi h is of parti ular interest here, is that in the former If the stabilityof ea h state y leattra tor isprobed by transient reversing ofthe a -tivity of ea h element in ea h state of the state y le, then for about80
to90
per ent ofall su hperturbations,the systemowsba ktothe samestate y le. Thusstate y lesareinherentlystabletomostminimaltransientperturbations. (Kauman, 1993, p. 201). In haoti networks, the stability of attra tors to minimalperturbationsisat best modest(Kauman,1993,p. 198).For ea h
(n, C)
ensemble, we generated about2500
basins (whi h orre-sponds to about700
to800
networks depending on the ensemble). We es-tablished onden e intervals for three statisti al variables of whi h two are probabilities:1. Consideran
n
-node network. Forea h of its basins, one an dene two onditional probabilities: (1)the onditional probabilityα
of leavingthe basingiventhatoneattra tor bitoutofnA
hasbeenipped,withA
the size oftheattra tor,and (2)the onditional probabilityβ
ofleaving the basingiventhatonebasinbitoutofnB
hasbeenipped.In ea h basin sample, a small proportion of basins having
α = 0
were found. The25
thper entileofthe relativesizeB/E
ofα = 0
basinswas morethan0.8
forC = 2
networks and0.9
forC = 5
ones. Thusα = 0
basinsaremostoftenbigbasins. We al ulated2
ratios: ratioκ
¯
between the median mean sojourn times ofα = 0
andα > 0
basins and ratioκ
∗
betweenthemaximummeansojourntimesofthetwotypesofbasins. TheseratiosaregiveninTable4for
(n, 2)
networksandfourp
values. In the aseofα = 0
basins,the omputationofψ
s
forp = 0.01
andstopping riterionψ
ˆ
s
> 0.9999
variesbetweenafewminutestoseveraldaysusinga PowerEdge2950serverwith2
Quad-CoreIntelXeonpro essorsrunningat 3.0GHz. As anbeseenfromthistable,withp
de reasing,themaximum meansojourntimeofα = 0
basinsin reasesdrasti ally omparedtothatp
0.03 0.05 0.1 0.5¯
κ
n = 6
30.1612 19.9012 12.1390 6.8534n = 8
44.3541 26.0081 15.1438 11.9929n = 10
35.5351 21.6376 14.5090 10.3930κ
∗
n = 6
122.3414 19.3446 3.6270 1.0000n = 8
300.3482 32.4353 3.1643 1.0000n = 10
542.8109 67.3100 4.9887 1.0000Table 4: Comparison between medianmean sojourn times of
α = 0
andα > 0
basins(ratioκ
¯
)andbetweenmaximummeansojourntimesofbothtypesofbasins (ratioκ
∗
)for
(n, 2)
networksandfourp
values. Cal ulationofψ
s
: attime0
,the statesof thebasinareequiprobable. The proportionsofα = 0
basinsfor samplesn = 6
,8
and10
werefoundtobe4.0759
,3.1989
and2.5971%
respe tively.of
α > 0
basins 10. Theproportion of
α = 0
basins variesin oursamples from1.7
to4.1%
dependingontheensemble. ForxedC
,itisade reasing fun tionofn
andforxedn
,itissmallerinthe haoti regimethaninthe omplexone. Although we dobelievethatα = 0
basins havenot to be reje tedfromabiologi alinterpretationperspe tive,networkswithatleast oneα = 0
basinwereomittedduringthesamplingpro edure(essentially be ause of the high omputational time that is needed to omputeψ
s
whenα = 0
andp
issmall).Fig. 8 shows the onditional probabilities
α
for sample(8, 2)
. For ann
-nodenetwork,one andenen
mainprobabilitylevels1, 2, . . . , n
orre-spondingto probabilities1/n, 2/n, . . . , 1
. The per entage ofdata points lo atedinmainprobabilitylevelsaregiveninTable5forthesixsamples and thetwo onditional probabilitiesα
andβ
. It anbe seenfrom this tablethat whateverthe onne tivityandthe onditionalprobability,this per entagede reaseswhenn
in reases. Foragiven onditionalprobability andanyn
,itisgreaterinthe omplexregimethaninthe haoti one. Thestatisti al analysis of thesix data sampleshas shownthe following. ForC = 2
,there is a learquantizationofα
andaweakoneofβ
. Also thehistogramsofα
andβ
are symmetri (skewnessequalto∼ 0.2
). ForC = 5
,thequantizationofα
isweakandthereisnoobviousquantization ofβ
. Thehistogramsofα
andβ
arenegativelyskewed(skewnessequalto∼ −0.9
)andthelasttwomainprobabilitylevels(n−1)
andn
arestrongly populated omparedto theotherones.10
For
α = 0
basins,the onditionalprobabilityα
x
ofleavingthebasingiventhat2 ≤ x ≤ n
bitsofanattra torstatehavebeenperturbedmaybenonnull.HoweverL(p; n, x) = o(p)
for2 ≤ x ≤ n
. Ontheotherhand,fortransientstatesandsu ientlylargek
onehasˆ
b
(k)
i
→
0
0
500
1000
1500
2000
2500
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Basins
α
Figure8: Conditionalprobabilities
α > 0
sortedin as endingorder,sample(8, 2)
. About74%
ofthedatapointsarein main probabilitylevels. These orrespondto probabilities1/8, 2/8, . . . , 1
.C
n
α
β
2
6
78.6 52.08
73.9 45.910
70.2 35.35
6
56.6 31.58
50.8 23.510
43.6 15.4Table 5: Per entages of datapoints lo ated in main probability levels for the six ensembles
(n, C)
andthetwo onditionalprobabilitiesα > 0
andβ > 0
.α
β
C
n
Mean Median Mean Median2
6
0.5057 0.5000 0.4817 0.50008
0.4523 0.4375 0.4294 0.414410
0.4149 0.4000 0.3907 0.40005
6
0.6869 0.7917 0.6852 0.77788
0.7032 0.8125 0.7038 0.821910
0.7050 0.8125 0.7106 0.8375Table6: Meanandmedianof onditionalprobabilities
α > 0
andβ > 0
forthesix ensembles(n, C)
.Table6 givesthe mean and median of probabilities
α
andβ
for thesix ensembles(n, C)
. Forxedn
, themeanand median ofboth onditional probabilities arehigher for haoti basins than for omplex ones. Addi-tionally,forC = 2
basins,themeanandmediande reasewithn
whileforC = 5
,bothin reasewithn
(ex eptforthemedian ofα
).2. The third statisti al variable is the mean time spent in a basin,
µ
. ForC = 2
andanyn
,thereisa learquantizationatp = 0.002
whi hrapidly disappearsasp
in reases. ForC = 5
andanyn
,thereareonlytwolevels ofquantizationatp = 0.002
andtheserapidlyvanishasp
in reases. Also noti e that the minimum ofµ
is equal to the mean spe i pathd = τ
whi h, a ordingto(7), doesnotdependonC
.Most ofthe limitsof the
95%
onden e intervalsranged between1.5
and2.5%
(forα
andβ
: onden e intervalforthe mean ifC = 2
, forthe median andforthetrimmedmeanifC = 5
;forµ
: onden e intervalforthetrimmed mean)11 .
4.2 Simulation results and dis ussion Mostofthebasinvariables(su has
µ
orδ
∗
)werepositivelyskewed. Therefore forea hofthesevariableswe al ulatedquartiles
Q
1
,Q
2
(themedian)andQ
3
.Forthe al ulationof
µ
,twotypesofinitial onditionswere onsidered:1. Ea helementof
b
(0)
isequalto
1/B
. We allthis onditiontheuniform initial ondition.11
Twomethodswereused:anonparametri methodbasedonthebinomialdistribution(for the medianonly) andthe bootstrap method (usedforthe medianand the trimmedmean). Forthetrimmedmean,weaveragedthesampledatathatwere(1)betweenthe
25
thand75
th per entiles,(2)lessthanthe75
thper entileand(3)lessthanthe90
thper entile.in that state(the elementsof
b
(0)
areall
0
ex ept onewhi his equalto1
). We allthis onditiontherandominitial ondition.Let's start with the basin variable
µ
. First the uniform initial ondition. Coe ients of skewness forµ
distributions mostly ranged from5
to15
with meanof10.0568
(strongskewness). Forexample,forsample(8, 5)
withp = 0.01
, weobtaineda oe ient ofskewnessof10.2415
, ameanof44.7471
,amedian of15.7922
, a75
th per entile of26.0523
, a99
th per entile of608.4743
and a maximumof3.0117 × 10
3
. A small proportionof the meansojourntimesare thereforeveryfarfrom the median (takenhereasthe entral tenden y). The threequartilesof
µ
versusln p
areshowninFig.9forthesixensembles(n, C)
. The blue squares orrespond toC = 2
and the red triangles toC = 5
. The size of a symbol is proportional ton
. Forthe sakeof larity, the abs issæ of thepoints orresponding toensembles(6, C)
and(10, C)
havebeentranslated (respe tivelytotheleftandtotherightofthep
values).It an beseenfromFig. 9that: (1)forxed
p
andanyC
,themedianofµ
is ade reasingfun tion of the sizen
of the network(not trueatp = 0.5
and0.8
),althoughα
de reaseswithn
(seeTable6). (2)ForxedC
andanyn
,the medianofµ
isade reasingfun tionofp
. (3)Forxedp
andanyn
,medianµ
of haoti basins(C = 5
)islessthanthatof omplexones(C = 2
). Atp = 0.002
, themediansofµ
forC = 5
ensemblesarerespe tively1.58
,1.86
and2.02
times smallerthan those forC = 2
ones. Similarbut smallervalueswere foundforp = 0.01
andp = 0.02
. Wetherefore onrmtheresultsofKauman (1993, p. 198,201and488-491)that haoti basinsarelessstabletonodeperturbations than omplexones.We looked forthe relationshipbetweenthe median of
µ
andp
. Wefound thatforsu ientlysmallp
:Q
(2;µ)
≈
c
2
p
,
(27)i.e. the median of the mean sojourntime is inversely proportional to
p
. The proportionality onstantc
2
has been estimated by the least squares method forthesixensembles(n, C)
and wasfoundto de reasewhenn
in reases(only therstthree data pointswere tted,i.e. thepoints withabs issap = 0.002
,0.01
and0.02
). ForC = 2
, we obtainedc
2
= 0.3342
,0.2871
and0.2510
; forC = 5
,c
2
= 0.2117
,0.1547
and0.1242
. Thusforxedn
, haoti basinsareless sensitivetoavariationinp
than omplexones. Theresultoftheleastsquares tispresentedinFig. 10. Graph(a)showsthehyperboli relationshipbetween themedian ofµ
andp
forthesixensembles(n, C)
. Forthesakeof larity, the fun tions were drawnup top = 0.05
. Whenalogarithmi s ale forea h axis is used, oneobtains the graph(b) whi h showsfor the three ensembles(n, 2)
a linear relationship betweenln Q
(2;µ)
andln p
at lowp
(up top = 0.02
on thegraph)12
. Thesameruleappliesto thethreeensembles
(n, 5)
. Wewillsee furtherhowtoexpressc
2
infun tion ofn
andmedianα
probability.12
0.002
0.01
0.02
0.1
0.3
0.5 0.8
0
50
100
150
200
250
300
p
Q
ua
rt
ile
s
of
µ
Figure9: Medianandinterquartilerangeofmeansojourntime
µ
versusln p
forthe sixensembles(n, C)
. Cal ulationofψ
s
withtheuniforminitial ondition. QuartilesQ
1
andQ
3
areindi atedbyhorizontalbars. Bluesquares: omplexregime(C = 2
); redtriangles: haoti regime(C = 5
). Thesize ofasymbol(squareortriangle)is proportionalton
.0
0.01
0.02
0.03
0.04
0.05
0
20
40
60
80
100
120
140
160
180
p
M
ed
ia
n
µ
(a)
0.002
0.01
0.02
0.1
0.3
0.5 0.8
1
5
10
50
150
p
M
ed
ia
n
µ
(b)
Figure10: Fitof(27)bytheleastsquareste hnique. (a)Linears aleforbothaxis, tsfor thesixensembles
(n, C)
. Forthesakeof larity, thepointswithp > 0.05
have been omitted. Blue squares:C = 2
; red triangles:C = 5
. The size of a symbol is proportional ton
. (b) Logarithmi s ale for both axis (ln-ln), ts for ensembles(n, 2)
only. Thesizeofasymbolisproportionalton
.With therandominitial ondition,thequartilesof
µ
arevery losetothose obtainedwiththeuniforminitial ondition.We then al ulatedfor ea h basin the relative errorbetween themean so-journtimeobtainedfrom theuniform initial onditionandthatobtainedfrom therandominitial ondition. This erroris nullat
p = 0.5
(as forthenetwork ofFig. 1,thisisbe auseψ
s
isgeometri atp = 0.5
). Thequartilesoftheerror tendto0
asp → 0
andtheyrea hamaximumatp = 0.1
whateverthenetwork ensemble. Thusthesmallerp
,themoreµ
isindependentofinitial onditions.To end, we turn to the maximum deviation
δ
∗
. First the uniform initial ondition. Theresultsforthesix
(n, C)
ensemblesarepresentedinFig. 11. Atp = 0.002
andforxedC
,thequartilesin reaselinearlywithn
,ex epttherst quartileofC = 2
basinswhi his onstantandapproximatelyequalto0.0005%
(whatevern
,25%
ofC = 2
basins have a sojourn time that is geometri or loselyfollowsageometri distribution).Forboth onne tivities,thefun tions
Q
(2;δ
∗
)
(p)
andQ
(3;δ
∗
)
(p)
have similar-itieswiththefun tionsδ
∗
(p)
ofFig. 4: theytend to
0
whenp → 0
,theyhave atleastonelo almaximumin0 < p < 0.5
andarenullatp = 0.5
.With therandom initial ondition, these ond andthird quartiles of
δ
∗
in- reaseformostofthesixensembles
(n, C)
omparedtotheuniform ase. Qual-itatively, thebehaviourof thequartiles with respe ttop
is similar tothe one foundwiththeuniforminitial ondition.Tosummarizese tions3and4,wehavethefollowingproposition:
Proposition 3 Consider abasinof anoisy Boolean network. As
p
tends to0
then:1. whateverthe initial onditions,
µ → ∞
.2.
µ
tends to be independent of the initial onditions:µ → λ
∗
with
λ
∗
> 1
the Perron-Frobeniuseigenvalue offundamentalmatrix(I − Q)
−1
. 3. whatever the initial onditions,δ
∗
→ 0
,where
δ
∗
is the Kolmogorov dis-tan e between the umulative distributionfun tion of the sojourn time
S
inthebasinandthatofageometri distributiong
s
havingthesamemean asS
.4.
g
s
onvergestoan exponentialdistribution.Proposition 3 doesnot guaranteethe existen e ofa oarser representation ofaNBNin thelow
p
regime. What anbesaidfrom thispropositionis that if su h a representation exists, then it is in general an approximation of the originalpro essandit analwaysbeexpressedinadis reteor ontinuoustime framework. Thusweareleadtothepropositionbelowthatwillbedis ussedin moredetailsin thenextse tion:thelinearizedproblem:
Y
2
= a
2
−
X
whereY
2
= ln Q
(2;µ)
,a
2
= ln c
2
andX = ln p
.Onlythe rstthreedatapointsweretted.0.002
0.01
0.02
0.1
0.3
0.5 0.8
0
0.5
1
1.5
2
2.5
p
Q
ua
rt
ile
s
of
δ
∗
Figure11: Geometri approximationfor
(n, C)
ensembles. Medianandinterquartile rangeofδ
∗
(in
%
)versusln p
. Cal ulationofψ
s
withtheuniforminitial ondition. Bluesquares:C = 2
; red triangles:C = 5
. Asp → 0
, the threequartilesofδ
∗
oarser time-homogeneous representation of this network in the low
p
regime. Then the dynami s between the basins of the network may be approximated by a system of linear ordinary dierential equations with size being equal to the numberofattra torsof the network.Re all that Proposition 3 does not mean that
ψ
s
an never be geometri norbeapproximatedbyageometri distributionwhenp
isnotsu iently lose to0
. For example we knowψ
s
is geometri whenp = 0.5
or whenB = 1
whatever0 < p < 1
. Onedieren ebetweenastronglyandaweaklyperturbed networkisthatinthelatter,sin ethemeanspe i pathd
islarge omparedto1
,the network spends longperiods ontheattra torswithoutbeingperturbed. Undernormal onditions,bio hemi alnetworksaresupposedtoworkinthelowp
regimebe auseas explainedin 1.2, this regimeis asso iatedwith fun tional stability.4.3 Approximation formula for the mean time spent in a basin of a NBN
Forsu ientlysmall
p
,(i, j)
th elementofΠ
′
anbeapproximatedby(i 6= j
):π
′
ij
= p
h
ij
q
n−h
ij
≈
p
ifh
ij
= 1,
0
ifh
ij
> 1.
From(24)thenwemaywrite:
p
e
(k, k + 1) ≈ p
X
i∈
BΓ
1
i
ˆb
(k)
i
.
(28)Wesee that in thelow
p
regime,theprobabilityp
e
(k, k + 1)
depends ontime andinitial onditions.Nowforsu ientlysmall
p
, wehavethefollowing:µ ≈ 1/npα,
(29)whi h, fromProposition2,isequivalentto:
lim
p→0
npαλ
∗
= 1.
Toshowapproximationformula(29),we onsidertwo ases:
1. Supposeforanygiven
0 < p < 1
,theelementsofve tora
areequal. Then from(24)p
e
(k, k + 1)
is onstantwhi hmeansψ
s
geometri . Thustakingp
su ientlysmall,Γ
1
i
mustbethesameforallstatei ∈
Bandtherefore from(28):1/µ = p
µ
≈ pΓ
1
= npα,
(30)with