SEMI-MARKOV PROCESSES FOR RELIABILITY STUDIES
CHRISTIANE COCOZZA-THIVENT AND MICHEL ROUSSIGNOL
Abstract. We study the evolution of a multi-component system which is modeled by a semi-Markov process. We give formulas for the avaibility and the reliability of the system. In ther-positive case, we prove that the quasi-stationary probability on the working states is the normalised left eigenvector of some computable matrix and that the asymptotic failure rate is equal to the absolute value of the convergence parameter
r.
1. Introduction
The motivation of this paper comes from reliability studies. We consider a system whose possible states form a nite set
E
. The setE
is splitted into two subsets, M corresponding to the working states and P corresponding to the failure states. We are interested in the time evolution of the system.There is an important literature on this subject when the evolution is mod- eled by a Markov process (cf Pages and Gondran (1980)). However there are cases where the evolution cannot be modeled by a Markov process, but can be modeled by a semi-Markov process. In this case by standard tech- niques on semi-Markov processes (cf Cinlar (1975)), it is possible to obtain formulas for the availability and the reliability. Furthermore, in order to obtain a description of this evolution up to the rst failure time, it is inter- esting to compute the quasi-stationary distribution onM. The existence of quasi-stationary distributions for semi-Markov processes has been proved in Cheong (1970). The main goal of this paper is to give a method to compute this distribution in the context of reliability studies. In addition we prove that the convergence parameter is equal to the asymptotic failure rate.
In section 2 we recall classical properties of semi-Markov processes and we give a formula to compute the Laplace transform of the availability. In section 3 we dene a transient semi-Markov process which allows to compute the Laplace transform of the reliability. A family
A
(s
) of matrices appears in the formulas of availability and reliability. In the Markov case these matrices are equal to the generating matrix of the process. In section 4 we recall properties ofr
-recurrent andr
-positive semi-Markov processes wherer
is the convergence parameter. If the process isr
-recurrent, we prove thatr
is the Perron-Frobenius eigenvalue of the matrixA
1(r
), restriction toMof the matrixA
(r
). In section 5 we prove that when the process isr
-recurrent, with additional hypotheses, the process isr
-positive and that the quasi-stationary distribution on M is the normalized left eigenvector of the matrixA
1(r
).URL address of the journal: http://www.emath.fr/ps/
Received by the journal October 20, 1995. Revised September 16, 1996 and January 14, 1997. Accepted for publication February 14, 1997.
c Societe de Mathematiques Appliquees et Industrielles. Typeset by LATEX.
This gives a practical method to compute the quasi-stationary distribution onM. We prove in section 6 that under natural hypotheses, the asymptotic failure rate of the system is equal to j
r
j. We give in section 7 a numerical example.2. Some properties of a semi-Markov process
The semi-Markov process (
Y
t)t 0 is dened by means of a random se- quence (X
nT
n)n2Nwhere (X
n)n2Nrepresents the dierent positions of the process and (T
n)n2Nthe jump times. The process (X
n)n2Ntakes its values in a nite setE
and (T
n)n2Nis an increasing sequence of positive random variables withT
0 = 0. We have:Y
t=Xn 0
X
n1fTnt<Tn+1gThe random properties of the process are characterized by the following semi-Markov property. For any Borel set
A
ofR+, any statej
inE
and anyn
in N, we have:P(
X
n+1=jT
n+1;T
n2A=X
0::: X
nT
0::: T
n)= P(
X
n+1=jT
n+1;T
n2A=X
n)=
Z
A
Q
(X
njdu
)where
A
!Q
(ijA
) is a bounded measure on R+ for alli
andj
inE
. We shall use results on semi-Markov processes which can be found in Cinlar (1975). The process (X
n)n 0 is a Markov chain with transition probabilitiesP
(ij
) =Q
(ij
R+) =RR+Q
(ijdu
). Letf
(idu
) be the probability law ofT
n+1;T
n conditionally onfX
n=i
gandh
(it
) =R]t+1f
(idu
). We have:f
(idu
) =Xj2E
Q
(ijdu
)Let
Q
(n)(ijdu
) be the probability law of (X
nT
n) conditionally onfX
0 =i
g:P(
X
n=jT
n2A=X
0=i
) =Z
A
Q
(n)(ijdu
)We have
Q
(0)(ij:
) =I
(ij
)0(:
) whereI
is the identity matrix onE
E
and 0 the Dirac measure on 0,Q
(1)(ij:
) =Q
(ij:
),Q
(n+1)(ik:
) =P
j2E
Q
(ij:
)Q
(n)(jk:
).We dene the renewal Markovian kernel by:
R
(ijdu
) = Xn 0
Q
(n)(ijdu
):
We have:P(
Y
t=j=Y
0=i
) =Z
0t]
h
(jt
;s
)R
(ijds
):
We shall deal with Laplace transforms
Q
(ijs
),f
(is
),Q
(n)(ijs
),R
(ij
s
),h
(is
),P
(ijs
) respectively of measuresQ
(ijdu
),f
(idu
),Q
(n)(ij
du
),R
(ijdu
) and of functionst
!h
(it
),t
! P(Y
t =j=Y
0 =i
). LetQ
(s
),Q
(n)(s
),R
(s
),P
(s
) be the associated matrices. We have:Q
(n)(s
) = (Q
(s
))nf
(is
) =Xj2E
Q
(ijs
)s h
(is
) = 1;f
(is
)R
(s
) = Xn 0
(
Q
(s
))nP
(ijs
) =R
(ijs
)h
(js
)Since the measures
Q
(ijdu
)are bounded, the Laplace transformsQ
(ijs
),Q
(n)(ijs
) andf
(is
) are nite fors
non negative.We get the following result.
Proposition 2.1. Let us dene the matrix
A
(s
) onE
E
byA
(ijs
) =Q
(ijs
)h
(is
)if i
6=j A
(iis
) =; Xfj:j6=ig
Q
(ijs
)h
(is
)If
s
is strictly positive, the Laplace transform of the transition probabilities matrix is given by:P
(s
) = (sI
;A
(s
));1Proof. We know that:
P
(kjs
) =R
(kjs
)h
(js
). ReplacingR
(kjs
) by Ph(k js)(js) in the equationR
=I
+R
Q
gives:P
(ijs
)h
(js
) =I
(ij
)+Xk 2E
P
(iks
)h
(ks
)Q
(kjs
) The relationh
(is
) = 1;fs(is) can be written:h
(1js
) =s
+f
(js
)h
(js
) =s
+Xk 2E
Q
(jks
)h
(js
) So we get:sP
(ijs
)+P
(ijs
)Xk 2E
Q
(jks
)h
(js
) =I
(ij
)+Xk 2E
P
(iks
)Q
(kjs
)h
(ks
) This relation is equivalent to:P
(s
)(sI
;A
(s
)) =I
which is the desired result.In the Markovian case, the matrices
A
(s
) are constant and equal to the generating matrix of the Markov process. The formula above is a generali- sation of a well-known formula in the Markovian case.3. The transient process
To compute the reliability of the system, we dene a new semi-Markov process (
Z
t)t 0 which behaves exactly like the process (Y
t)t 0 until the rst failure time. However, when the process reaches a failure state, it stays in failure states forever. Let ~Q
(ijdu
) be the measures dening this new process, then we get:for
i
in M: ~Q
(ijdu
) =Q
(ijdu
)for
i
in P andj
in M: ~Q
(ijdu
) = 0for
i
in P andj
in P: ~Q
(ijdu
) = anything.Of course the new process is not irreducible in
E
.From now on, we suppose that M is a transient irreducible set for the process (
Z
t)t 0.We denote by \tilde" letters the quantities related to the process (
Z
t)t 0. LetQ
1(s
) andR
1(s
) be the restrictions of the matrices ~Q
(s
) and ~R
(s
) toMM
It is easy to check that
8
ij
2M: ~A
(ijs
) =A
(ijs
) ( ~Q
(s
))n(ij
) = (Q
1(s
))n(ij
)R
1(s
) = Xn 0
(
Q
1(s
))nR
1(s
) =I
1+R
1(s
)Q
1(s
) (3.1) whereI
1 is the identity matrix on MM.3.1. Some definitions
Let ~
r
(ij
) be the convergence abscissa of the Laplace transformQ
1(ijs
)and
r
~= supij
r
~(ij
):
Remark 3.1. If there exist
i
andj
(in M) such thatQ
1(ij
~r
) = +1, we shall say that we are in the usual case. The reason for this terminology is that this is the case for measures with rational Laplace transforms.It was shown in Cheong (1968) that all the convergence abscissas of the Laplace transforms
R
1(ijs
) are equal (since the setMis irreducible). We denote this number byr
.Definition 3.2. The common convergence abscissa of the Laplace trans- forms
R
1(ijs
) is called the convergence parameter.We get:
r
~r
0:
Let
(s
) be the Perron-Frobenius eigenvalue of ~Q
1(s
). This function en- ables us to computer
.Remark 3.3.
1. The function
s
! (s
) is decreasing (Seneta (1973)) and continuous (since the coecients of the matrixQ
1(s
) are continuous) on the in- terval ]~r
+1.2. If
(0) exists, then, since M is a transient irreducible set, the Perron Frobenius theorem gives(0)<
1.3. If
(s
) tends to a limit strictly greater than 1 ass
decreases to ~r
(which is true in the usual case - see remark 3.1 - ) and if (0)<
1 then the greatest values
such that (s
) = 1 is the convergence parameterr
by formula (3.1).3.2. Reliability
In order to compute the reliability of the system, we must compute the transition probabilities of the new process.
Proposition 3.4. Suppose that for
s > r
,Q
(ijs
)<
1 for alli
in M andj
in P. IfA
1(s
) is the restriction of the matrixA
(s
) to MM, then we have:8
s > r P
~1(s
) = (sI
1;A
1(s
));1where ~
P
1(s
) is the restriction to MM of the Laplace transforms matrix of the transition probabilities for the new process.Proof. For
s > r
we havef
(is
)<
1. It is easy to check thatf
(is
)<
1 impliesh
(is
)<
1. Hence we can prove proposition 3.4 in the same way as proposition 2.1.Remark 3.5. Note that if
Q
(ijdu
) =P
(ij
)f
(idu
), then fors > r
,f
(is
)<
1 for alli
in M, andQ
(ijs
)<
1for everyj
. So the assump- tion in the above proposition is satised.Proposition 3.4 enables us to calculate the Laplace transform
RE
(is
) of the reliability for the initial process starting from the working statei
and the associated mean time to failureMTTF
(i
). Given thatRE
(it
) =P
j2M
P(
Z
t=j=Z
0=i
) andMTTF
(i
) =RE
(i
0), we get:RE
(i
) = Xj2M
(
sI
1;A
1(s
));1(ij
)MTTF
(i
) =; Xj2M
(
A
1(0));1(ij
)For
i
6=j
,A
1(ij
0) =P
(ij
)=
E(T
1=Y
0 =i
) andA
1(ii
0) = (P
(ii
); 1)=
E(T
1=Y
0=i
). This proves the following result: since theMTTF
depends only on the embedded markov chain (X
n)n 0 and the mean sojourn time in the states, it is equal to the mean time to failure of a Markov process with the same transition probabilitiesP
(ij
) and the same mean sojourn time in each state.4. The r-recurrence property Before introducing
r
-recurrence, we need some material.For any state
j
, let ~F
(jj:
) be the law of the rst return time toj
of the process (Z
t)t 0 starting fromj
for any statei
dierent fromj
, let ~F
(ij:
) be the law of the rst visit time toj
of the process (Z
t)t 0 starting fromi
. These laws are dened on R+S+1. Because the process is irreducible and transient on M, the quantities ~F
(jj
R+) are strictly less than 1 forj
in M. Ifi
is in P andj
in M, ~F
(ij
R+) equals 0. Let ~F
(ijs
) be theLaplace transfom of ~
F
(ij:
). Given that ~R
(ijs
) = ~F
(ijs
)~R
(jjs
) fori
6=j
and ~R
(jjs
) =Pn 0( ~F
(jjs
))n (cf Cinlar (1975) 10.2.12 and 10.2.13), then fors > r
andj
in Mwe have:F
~(jjs
)<
1R
~(jjs
) = 11;
F
~(jjs
)R
~(ijs
) =F
~(ijs
)1;
F
~(jjs
)if i
6=j
Remark 4.1.
If lims&r
R
~(jjs
) = +1, thenlim
s&rF
~(jjs
) = ~F
(jjr
) = 1.If lims&r
R
~(jjs
)<
+1, then lims&rF
~(jjs
) = ~F
(jjr
)<
1 so in any case ~F
(jjr
)1.Lemma 4.2. For
s > r
andi
,j
in M, we have the relations:F
~(ijs
) = Xk 2M
Q
~(iks
)~F
(kjs
)+ ~Q
(ijs
)(1;F
~(jjs
))F
~(ijs
) = Xk 2M
F
~(iks
)~Q
(kjs
)) 1;F
~(jjs
) 1;F
~(kks
) + ~Q
(ijs
)(1;F
~(jjs
))Proof. If
i
is dierent fromj
, replace ~R
(jjs
) by 1=
(1;F
~(jjs
)) andR
~(ijs
) by ~F
(ijs
)=
(1;F
~(jjs
)) in formulas ~R
(s
) =I
+ ~Q
(s
) ~R
(s
) and ~R
(s
) =I
+ ~R
(s
) ~Q
(s
).Remark 4.3. We can deduce from the preceeding lemma and the irre- ducibility assumption that for
s
r
andi
,j
in M, we have:F
~(ijs
)<
1and Q
~(ijs
)<
1:
The notion of
r
-recurrence for a semi-Markov process was dened in Cheong (1968). It is known that if for at least one statej
in Mwe havelim
s&r
R
1(jjs
) =R
1(jjr
) = +1 then this property applies for any statej
in M.Definition 4.4. The process is said to be
r
-recurrent on Mif for at least one statej
in Mwe havelim
s&r
R
1(jjs
) =R
1(jjr
) = +1Remark 4.5. The equality
R
1(jjr
) = +1 is equivalent to the equalityF
~(jjr
) = 1 forj
2M(cf remark 4.1).Remark 4.6. Note that since,
R
1(jjr
) = Pn 0(Q
1)n(jjr
), the condi- tionR
1(jjr
) = +1 is equivalent to(r
)1.Remark 4.7. We have seen that in the usual case the Perron-Frobenius eigenvalue
(r
) ofQ
1(r
) is equal to 1. In that case the process isr
-recurrent.Remark 4.8. Since the semi-Markov process (
Z
t)t 0 is transient and irre- ducible on M, we have forj
in M, ~F
(jj
0)<
1. Thus if the process isr
-recurrent,r
is necessarily strictly negative.In order to nd other properties of the value
r
, we need a technical lemma.Lemma 4.9. If the process is r-recurrent we have for
i
andj
in M:F
~(ijr
) = Xk 2M
Q
~(ikr
)~F
(kjr
)F
~(ijr
)( ~
F
)0(jjr
) =X
k 2M
F
~(ikr
)( ~
F
)0(kkr
)Q
~(kjr
)Proof. We let
s
tend tor
in the rst formula of lemma 4.2 and we obtain, using remark 4.1, forij
in M:F
~(ijr
) = Xk 2M
Q
~(ikr
)~F
(kjr
)):
Now multiply the second formula of lemma 4.2 by (
s
;r
)=
(1;F
~(jjs
)) and take the limit ass
decreases tor
. The limit of (1;F
~(jjs
))=
(s
;r
) = ( ~F
(jjr
);F
~(jjs
))=
(s
;r
) always exists ass
decreases tor
and is equal to ;( ~F
)0(jjr
) = R01xe
;r xF
~(jjdx
) (use the dominated convergence theorem or Fatou's lemma). This quantity is nite or innite. In either case we obtain the last formula.Proposition 4.10. If the process is r-recurrent, then
r
is the Perron-Frobe- nius eigenvalue ofA
1(r
).Proof. For
j
and`
in Mwe have:X
i2M
A
1(`ir
)~F
(ijr
)= X
fi2M=i6=`g
Q
(`ir
)h
(`r
)F
~(ijr
); Xfi2E=i6=`g
Q
(`ir
)h
(`r
)F
~(`jr
)= X
i2M
Q
(`ir
)h
(`r
)F
~(ijr
);Xi2E
Q
(`ir
)h
(`r
)F
~(`jr
) Using lemma 4.9, we obtain:X
i2M
A
1(lir
)~F
(ijr
) = ~F
(ljr
)1;f
(lr
)h
(lr
)=
r F
~(ljr
):
Therefore
r
is an eigenvalue ofA
1(r
) associated with the positive vector ( ~F
(ijr
)i
2M). By the subinvariance theorem (cf Seneta (1973) p.23),r
is the Perron-Frobenius eigenvalue ofA
1(r
).In the following sections, we shall use the renewal theorem to prove the existence of limits. The next proposition gives the version of that theorem we shall use. We say that a nite measure is spread out if, for some
n
, itsn
;th
convolution power has a component which has a density with respect to the Lebesgue measure (cf Asmussen (1992) p.140).Proposition 4.11. Suppose that for any
j
in M the measures ~F
(jjdu
) are spread out and that the process is r-recurrent. Ifg
is a positive function on R+, such that the functionx
!e
;r xg
(x
) is bounded, tends to 0 asx
tends to +1 and is Lebesgue integrable, then for anyi
andj
inMwe have:lim
t!1
e
;r tZ t0
g
(t
;s
)R
(ijds
) =g
(r
) ~F
(ijr
);( ~
F
)0(jjr
) (whereg
(s
) is the Laplace transform of the functiong
).This theorem can be deduced from the renewal theorem given in Asmussen (1992) corollary VI.1.3.
The limit in the above theorem is strictly positive if the quantity ( ~
F
)0(jj r
) is nite. It was proved in Cheong (1968) that if for at least onej
in M the quantity ( ~F
)0(jjr
) is nite, then it is nite for anyj
in M.Definition 4.12. The process is said to be
r
-positive if it isr
-recurrent and if( ~
F
)0(jjr
) =;Z 10
xe
;r xF
~(jjdx
) is nite for anyj
in M.In the following proposition we consider a process which is not
r
-recurrent.Proposition 4.13. Suppose that the process is not r-recurrent. If
g
is a positive function on R+, such that the functionx
!e
;r xg
(x
) is bounded, tends to 0 asx
tends to +1 and is Lebesgue integrable, then for anyi
andj
in M we have:lim
t!1
e
;r tZ t0
g
(t
;s
) ~R
(ijds
) = 0 Proof. Fori
6=j
in Mwe have:e
;r tZ t0
g
(t
;s
) ~R
1(ijds
) =Z
t
0
e
;r (t;s)( ~F
(ij:
)g
(:
))(t
;s
)e
;r sR
~1(jjds
):
Since the process is notr
-recurrent, the measuree
;r sR
1(jjds
) is nite onR
+. The function
t
!e
;r (t;s)( ~F
(ij:
)g
(:
))(t
;s
) tends to 0 ast
tends to +1and is bounded uniformly ins
. It is sucient now to use the dominated convergence theorem.The proof for the case
i
=j
is similar.5. Quasi-stationary probability on working states A quasi-stationary probability on Mis given by :
lim
t!1
P(
Z
t=j=Z
0=i
)P
j2M
P(
Z
t=j=Z
0=i
):
The existence of the quasi-stationary probability onMis proved in Cheong (1970) if the process is r-positive and if the total variations in 0
1) of the functionst
!e
;r th
(it
) are nite. In this section we rst prove the existence of the quasi-stationary distribution under slightly dierent conditions. Our main improvement on the results of Cheong (1970) is that we are able to identify the quasi-stationary distribution as a left eigenvalue of a computablematrix. As we are on a nite space we also prove that under our assumptions, an
r
-recurrent process isr
-positive.For the next theorem, we will introduce the following assumption:
(
A
0)8
<
:
1)
the process is r
;recurrent
2)8j
2MF
~(jjdu
)is spread out
3)8i
2M 8j
2PQ
~(ijr
)<
+1Remark 5.1. Since the process (
Z
t)t 0 is irreducible on M, for allj
in M there exists a pathi
1i
2::: i
k in Msuch that the process starting from j returns to j by this path with a strictly positive probability. If at least one of the probability lawsQ
(i
l;1i
ldu
) (1l
k
+ 1i
0 =ji
k +1=j
) has a density with respect to the Lebesgue measure then the measure ~F
(jjdu
) is spread out. Then condition 2) in assumption (A
0) is satised.Remark 5.2. Suppose that
Q
(ijdu
) is equal toP
(ij
)f
(idu
) for anyi
andj
inE
. We know thatQ
(ijr
) is nite for anyi
andj
in M(remark 4.3), consequentlyf
(ir
) is nite. In this case, condition 3) is always true.Proposition 5.3. Under assumption (
A
0), for anyi
andj
in Mwe have:lim
t!1
e
;r tP(Z
t=j=Z
0=i
) =h
(jr
) ~F
(ijr
);( ~
F
)0(jjr
) Proof. We have:P(
Z
t=j=Z
0=i
) =Z
t
0
h
(jt
;s
) ~R
(ijds
):
We want to apply proposition 4.11 with
g
(:
) =h
(j:
). Using the fact thatr
is strictly negative (remark 4.8), lemma 4.2 and the assumption (A
03), one gets :e
;r uh
(ju
)Z
]u+1
e
;r sf
(jds
) =Z
]u+1
e
;r sXk 2E
Q
(jkds
)X
k 2E
Q
(jkr
) =f
(jr
)<
+1:
Thus the function
u
!e
;r uh
(ju
) is bounded and tends to 0 asu
tends to +1. The Lebesgue integrability condition of the proposition 4.11 is equivalent toh
(jr
)<
+1, which is true sincef
(jr
)<
+1.Then under assumption (
A
0), if the process is notr
-positive, we have for anyij
in M:lim
t!1
e
;r tP(Z
t=j=Z
0 =i
) = 0A direct application of proposition 4.13 implies that if the process is not
r
-recurrent we obtain the same limit. Then the limit is strictly positive only in ther
-positive case.Proposition 5.4. Under assumption (
A
0), the process is r-positive.Proof. For
i
andj
in M, let'
ij(t
) =e
;r tP(Z
t=j=Z
0 =i
).If the process is not
r
-positive we have forij
in M: limt!1
'
ij(t
) = 0and hence lim
u&0
u '
ij(u
) = 0 ('
ijLaplace transform of '
ij):
Using limu&0
u'
ij(u
) = lims&r(s
;r
) ~P
(ijs
) and proposition 3.4, we obtain:lim
s&r
(
s
;r
)(sI
1;A
1(s
));1(ij
) = 0:
Since
r
is a simple eigenvalue ofA
1(r
) by proposition 4.10, and sinces
!A
1(s
) is continuous, in a neighbourhood ofr
there exists a simple eigenvalue (s
) ofA
1(s
) such that: lims&r(s
) =r
.We can triangularise the matrix
A
1(s
) asA
1(s
) = (s
)T
(s
);1(s
) and then (sI
1;A
1(s
));1= (s
)(sI
1;T
(s
));1;1(s
). The matrixT
(s
) is lower triangular we can suppose that its elementT
(s
)(11) is equal to(s
) and the other elements of its rst column are equal to 0. The rst column of the matrix (s
) is equal to a right eigenvectorU
(s
) ofA
1(s
) associated with (s
). The rst row of ;1(s
) is equal to a left eigenvectorV
(s
) ofA
1(s
) associated with (s
). The element (sI
1;T
(s
));1(11) is equal to s;(s)1 , and we then get:lim
s&r
(
s
;r
)(sI
1;T
(s
));1(11) = 1:
For (ij
)6= (11), (sI
1;T
(s
));1(ij
) satisfy the following:K
(ij
)(s
;(s
))Q
i(
s
;T
(s
)(ii
)) =K
(ij
)Q
i6=1(
s
;T
(s
)(ii
)):
Then for (ij
)6= (11) we get:lim
s&r
(
s
;r
)(sI
1;T
(s
));1(ij
) = 0:
On the other hand we havelim
s&r
(
s
;r
)(sI
1;A
1(s
));1(ij
) =U
(ri
)V
(rj
):
But the quantity
U
(ri
)V
(rj
) is non null for anyi
andj
. So there is a contradiction and the process isr
-positive.Proposition 5.5. Under assumption (
A
0), for anyi
andj
in Mwe have:lim
t!1
P(
Z
t=j=Z
0=i
)P
j2M
P(
Z
t=j=Z
0=i
) =B
(ij
)P
j2M
B
(ij
) with 0< B
(ij
) =h
(jr
) ~F
(ijr
);( ~
F
)0(jjr
)<
+1Proof. This is a direct application of proposition 5.3 and proposition 5.4.
We want to understand the structure of the matrix
B
(ij
) on MM. Proposition 5.6. Under assumption (A
0), for anyi
in M, the vector (B
(ij
)j
2 M) is a left eigenvector of the matrixA
1(r
) associated with the eigenvaluer
and for anyj
in M, the vector (B
(ij
)i
2M) is a right eigenvector of the matrixA
1(r
) associated with the eigenvaluer
.Proof. Using lemma 4.2, we get for
l
andj
in M:X
i2M
B
(li
)A
1(ijr
) = Xfi2M=i6=jg
h
(ir
)~F
(lir
)Q
(ijr
);( ~
F
)0(iir
)h
(ir
); X
fi2E=i6=jg
Q
(jir
)h
(jr
)h
(jr
)~F
(ljr
);( ~
F
)0(jjr
)=
F
~(ljr
);( ~
F
)0(jjr
)(1;f
(jr
))=
F
~(ljr
);( ~
F
)0(jjr
)rh
(jr
)=
rB
(lj
):
In the same way we get:X
i2M
A
1(lir
)B
(ij
) = Xfi2M=i6=lg
Q
(lir
)h
(jr
)~F
(ijr
);
h
(lr
)(~F
)0(jjr
); X
fi2E=i6=lg
Q
(lir
)h
(lr
)h
(jr
)~F
(ljr
);( ~
F
)0(jjr
)=
h
(jr
);
h
(lr
)(~F
)0(jjr
)F
~(ljr
)(1;f
(lr
))=
rB
(lj
):
We have seen that
r
is the Perron-Frobenius eigenvalue ofA
1(r
) and we know that the eigenvector is unique up to scalar multiples. IfV
andW
are left and right eigenvectors, we can immediately deduce that:B
(ij
) =CW
(i
)V
(j
). We then obtain the following theorem.Theorem 5.7. Under assumption (
A
0), for anyi
andj
in M: limt!1
P(
Z
t=j=Z
0 =i
)P
j2M
P(
Z
t=j=Z
0=i
) =V
(j
)P
j2M
V
(j
) whereV
is a left eigenvector ofA
1(r
) associated withr
.6. Failure rate
An important quantity in reliability studies is the failure rate and espe- cially the asymptotic failure rate.
Definition 6.1. For any state
k
and any timet
, let (kt
) be the failure rate of the system dened by : (kt
) = lim&0
1
P(Z
t+2P=Z
t2MZ
0 =k
) when this limit exists.We can also write:
(kt
) = 1P
i2M
P(
Z
t=i=Z
0 =k
)X
i2M X
j2P
lim
&0
1
P(Z
t+=jZ
t=i=Z
0=k
)In this section, we suppose that the following assumptions are satised:
(
A
1) :8
>
>
>
>
<
>
>
>
>
:
1)
assumption
(A
0)2)8
i
2M 8j
2PQ
(ijdu
) =q
(iju
)du
3)8
i
2M 8j
2Pu
!q
(iju
)is continuous a:e:
4)8
i
2M 8j
2Pu
!q
(iju
)is bounded
5)8i
2M 8j
2P limu!+1e
;r uq
(iju
) = 0Proposition 6.2. Under assumption (
A
1), we have for anyk
inM andt
in R+: (kt
) = 1P
i2M
P(
Z
t=i=Z
0=k
)X
i2M X
j2P Z
t
0
q
(ijt
;s
) ~R
(kids
):
Proof. Fori
in Mandj
in P we get:P(
Z
t+=jZ
t=i=Z
0 =k
) =X
n 0
P(~
T
nt < T
~n+1t
+< T
~n+2X
~n=i X
~n+1=j= X
~0=k
)+X
n 0 X
l
1 2E
X
l
2 2E
P(~
T
nt < T
~n+1< T
~n+2< t
+X
~n=i X
~n+1=l
1X
~n+2=l
2Z
t+ =j= X
~0=k
):
The last term is less than:
X
n 0 X
l
1 2E
X
l
2 2E
Z
fu
1 t<u
1 +u
2
<u
1 +u
2 +u
3
<t+ g
Q
~n(kidu
1) ~Q
(il
1du
2) ~Q
(l
1l
2du
3)X
l
1 2E
X
l
2 2E
Z
0t]
R
~(kidu
1)Z
ft;u
1
<u
2 u
2 +u
3
<t;u
1 + g
Q
~(il
1du
2) ~Q
(l
1l
2du
3)o
():
Elsewhere:X
n 0
P(~
T
nt < T
~n+1t
+< T
~n+2X
~n=i X
~n+1=j= X
~0=k
)= X
n 0 Z
(]0t]
Q
~n(kids
)P( ~X
1=jt
;s < T
~1t
;s
+< T
~2= X
~0=i
)= X
n 0 Z
0t]
Q
~n(kids
)Z]t;st;s+ ]
Q
~(ijdu
1)Xl Z
ft;s+ <u
1 +u
2 g
Q
~(jldu
2)= X
l Z
0t]
R
~(kids
)Z
Q
~(jldu
2)Z
ft;s<u1t;s+ <u1+u2g
q
(iju
1)du
1:
Sinceq
(iju
) is a.e. continuous, we havelim
!0
1
Z
1ft;s<u1t;s+ <u1+u2g
q
(iju
1)du
1= lim
!0
1
Z
1ft;s<u1t;s+ g
q
(iju
1)du
1=
q
(ijt
;s
)a:s:
We can conclude with the dominated convergence theorem.
We want to identify the asymptotic failure rate, that is the limit, if it exists, of
(kt
) whent
tends to +1.Theorem 6.3. Under assumption (
A
1), we have:lim
t!1
(kt
) =jr
j:
Proof. The conditions of theorem 5.7 are fullled and we know that:
lim
t!1
e
;r tP(Z
t=i=Z
0=k
) =CW
(k
)V
(i
):
Now apply proposition 4.11 with the function
g
equal to the functionu
!q
(iju
) withi
in Mandj
in P. We then obtain fork
in M:lim
t!1
e
;r tZ t0
q
(ijt
;s
) ~R
(kids
)) =q
(ijr
)~F
(kir
);( ~
F
)0(iir
)=
CW
(k
)V
(i
)A
(ijr
)for i
6=j:
and lim
t!1
(kt
) = 1P
i2M
CW
(k
)V
(i
)X
i2M X
j2P
CW
(k
)V
(i
)A
(ijr
)= 1
P
i2M
V
(i
)X
i2M
V
(i
)(Xj2P
A
(ijr
))= ; 1
P
i2M
V
(i
)X
i2M
V
(i
)(Xj2M
A
(ijr
))= ; 1
P
i2M
V
(i
)X
j2M
rV
(j
)= j
r
j:
So we nd for the semi-Markov process that the same result as for the Markov process holds.
7. A numerical example: a two-unit parallel system with sequential preventive maintenance
7.1. The reliability model
A semi-Markov reliability model of a two-unit parallel system with se- quential preventive maintenance (PM) will now be considered. This model has been used as an example in Alam (1984) and Csenki (1995).
The system consists of two units, A and B. The model has nine states.
The states and the transitions are shown in Fig. 1.
Preventive maintenance is carried out o-line on A and B alternately.
The unit which is due for PM is removed from the system after
c
hours of parallel service and returned to service after a random time of maintenance.Thus, until failure occurs, states 1, 2, 3 and 4 are visited in this order.
In states 1 and 3 the units A and B are up. In state 2 unit A is under preventive maintenance and unit B is up. In state 4 unit A is up and unit B is under preventive maintenance. This implies that from state 1 the next PM is on unit A and not B. The process makes a transition from state 1
to state 6 if unit B fails during a sojourn in state 1. In state 6, unit A is in service while unit B undergoes repair. There are two possible transitions from state 6. If unit A fails before B's repair is completed, system failure ensues by transition to state 9 where the two units are down. The other possible move from state 6 is back to state 1 which happens if A remains in service throughout B's repair. From state 1, the only transition hitherto not considered is that of state 5 which occurs if unit A fails within the projected
c
hours of parallel service with unit B still being in the up state. Since it is assumed that completed repair includes PM, the system enters state 3 (the next projected PM is on unit B) upon successful completion of the repair on A in state 5. State 8 can be entered from state 2 only. This happens if unit B fails while unit A is under PM. The system then resides in state 8, after which unit A enters service and repair is started on unit B (state 6).
2
A PM, B up
1 A up, B up
3 A up, B up
4 A up, B PM
6 B downA up
5 A downB up
9 A downB down
A PM, B down8
A down, B PM7
6
?
-
;
;
;
;
;
;
;
;
;
;
;
;
@
@
@
@
@
@
@ R
@
@
@
@
@
@
@ I A
A
A
A
A
A
A
A
A
A
A
A
A
A
A U
@
@
@
@
@
@
@ I
@
@
@
@
@
@
@ R
A A A A A A A A A A A A A A A K
6
? 6
?
PM : preventive maintenance Fig. 1
Departure from state 9 happens as soon as the repair on any one of the two units is completed. The remaining aspects of the system are obtained by interchanging the roles of A and B. In this sense, 3, 4, 5 and 7 correspond to 1, 2, 6 and 8. We assume that all the random variables corresponding