HAL Id: hal-00589696
https://hal.archives-ouvertes.fr/hal-00589696
Submitted on 30 Apr 2011
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires
Estimation of the parameters of a Markov-modulated loss process in insurance
Armelle Guillou, Stéphane Loisel, Gilles Stupfler
To cite this version:
Armelle Guillou, Stéphane Loisel, Gilles Stupfler. Estimation of the parameters of a Markov-
modulated loss process in insurance. Insurance: Mathematics and Economics, Elsevier, 2013, 53,
pp.388-404. �10.1016/j.insmatheco.2013.07.003�. �hal-00589696�
loss process in insurance
Armelle Guillou
(1)
, Stéphane Loisel
(2)
& GillesStuper
(1)
(1)
UniversitédeStrasbourg&CNRS,IRMA,UMR7501,7rueRenéDescartes,
67084Strasbourgcedex,France
(2)
UniversitédeLyon,UniversitéLyon1,InstitutdeScienceFinancièreetd'Assurances,50
avenueTonyGarnier,69007Lyon,France
Abstract. We present a new model of loss processes in insurance. The process is a
couple
(N, L)
whereN
isaunivariateMarkov-modulatedPoissonprocess(MMPP)andL
isamultivariatelossprocesswhosebehaviourisdrivenby
N
. Weprovethestrongconsistency of the maximumlikelihood estimator of theparameters of this model, and presentan EMalgorithmtocomputeitinpractice. Themethodisillustratedwithsimulationsandrealsets
ofinsurancedata.
Keywords: Markov-modulatedPoisson process, maximumlikelihood estimator,strong
consistency,EMalgorithm.
1 Introduction
AMarkov-modulatedPoissonprocess(MMPP)isadoublystochasticPoissonprocesswhose
intensityisdrivenbyanon-observablecontinuous-timeMarkovchainwithnitestatespace.
A comprehensive surveyof the properties of MMPPs is given in [15]. Such processes are
used tomodelcommunicationnetworks (see[18, 21]), environmentalphenomena asin[13],
and the surplus of an insurance company as in [1]. It has then been crucial to develop
methodstoestimatetheparametersofsuchprocesses. Fromatheoreticalpointofview,the
strongconsistencyof themaximumlikelihoodestimator(MLE) foranMMPPis shownby
Rydénin [32];his proofisstronglyinuencedby[23], inwhichconsistencyfortheMLEfor
general hiddenMarkov models (HMMs) is established. Theproperties of theMLE in this
contexthavebeenextensivelystudiedsinceBaumandPetrie [3]: in additiontoconsistency
in [23], asymptotic normality was proved in [5]. Now, from a practicalpoint of view, the
the MLE. For other referenceson EM algorithms, we refer thereader to Baum et al. [4],
whorstpresentedsuchalgorithmforHMMs;recentsurveysonEMalgorithmsincludethe
monograph byMcLachlan and Krishnan[27]. Other possible approachesinclude matching
moments and covariance functions, see [17, 31], or maximizing a split-time likelihood, as
introduced by Rydén in [33, 34], further studied by Vandekerkhove[36] in the context of
hiddenmixturesofMarkovprocesses. In[25],Loiselsuggestedthatcorrelationbetweenlines
of business of an insurance company could be caused by common shocks and modulation
by a common Markovian environment process. Our goal is to extend the MLE approach
to estimatethe parametersofaprocess
(N, L)
whereN
is aunivariateMMPPandL
isa(possiblymultivariate)lossprocesswhosebehaviorisdrivenby
N
, inorder toestimatetheparametersofsuchaprocessintworealsetsofinsurancedata. Wealsocarryoutasimulation
study of loss processesfor 2 and 3lines of business modulated by acommon environment
process. Our results conrm that themethod works quitewell aslong as theobservation
periodcontainsenoughchangesoftheMarkovianenvironmentprocess.
2 Model, assumptions and notation
We consider an MMPP
(J, N )
, whereJ
is an irreducible continuous-time Markov process with generatorL
on thestate space{1, . . . , r}
, wherer ∈ N \ {0}
, andN
is a univariatecountingprocess such that, when
J
is in statei
,N
is aPoisson process with intensityλ i
.Wefurther consideralossprocess
S = (S 1 , . . . , S n )
(namely, theS k
arepiecewiseconstantprocesseswithnonnegativeincrements)whosebehaviorisdrivenby
N
inthefollowingsense:assume that the
S k
canonly jump whenN
does, and that ifN
jumps at timet
and ifJ
is in state
i
, then asimultaneous jump ofthe processesS k 1 , . . . , S k p
at timet
occurs withprobability
p i (e)
wheree = {k 1 , . . . , k p }
is a subset of{1, . . . , n}
. We then assume thatthe random variables
E s
, such that theS k
withk ∈ E s
jumped (and only these) at thetimeofthe
s−
thjump ofN
,areindependentgiventheprocess(J, N )
. Finally,assumethatthevalue ofthejump
X s
hasdistributionP θ(i, e)
, where( P θ ) θ∈Θ
isaparametric statistical model,thatisP (X s = x | J (τ s ) = i, E s = e) = P θ(i, e) (∀ m, m ∈ e ⇒ X m = x m )
where
τ s
isthetime of thes−
thjump ofN
, withclearlyx m = 0
ifm / ∈ e
. Note that thismodelcanbeseenasacommonshockmodelasin[24]: itisassumedthatgiventheprocess
(J, N )
andthesequence(E s )
,theX s
areindependentrandomvariables.Thecontextofourworkisthefollowing: letusassumethattheprocess
S
hasbeenobserveduntiltime
T
,sothattheavailabledatais:1. Thenumber
r
ofstatesofJ
;2. Thefullknowledgeoftheprocesses
N
andS
betweentime0
andtimeT
,bothassumedto betimeswhen
N
jumps.Thegoalisto estimatetheunknownparametersofthemodel,namely:
1. Theelements
` ij
ofthetransitionintensitymatrixL
ofJ
;2. Thejump intensities
λ i
ofN
;3. Theprobabilities
p i (e)
,wheree
isasubsetof{1, . . . , n}
;4. Theparameters
θ(i, e)
.Remarkthattheprocess
J
isnotobserved,whichinducestechnicaldiculties. Forthesakeofshortness,welet
Φ
betheglobalparameterofthemodel. Thedistributionoftheprocess withparameterΦ
isthen denotedbyP Φ
.3 Asymptotic properties of the maximum likelihood esti-
mator
Our aim is to estimate the parameters with a maximum likelihood estimator (MLE). Let
then
Y i = τ i − τ i−1
betheamountoftime betweenthe(i − 1)−
thandthei−
thshock,andΛ = diag(λ 1 , . . . , λ r )
.Theavailabledatais:
1. Thevalues
0 < t 1 < . . . < t k = T
oftheτ i
,i.e. thetimeswhenN
jumps(equivalently, theinter-eventtimesy 1 , . . . , y k
,wherey j = t j − t j−1
,t 0 = 0
);2.
e 1 , . . . , e k
thesuccessivevaluesoftheE k
;3.
x 1 , . . . , x k
thesuccessivevaluesofthejumpsofS
.Letnow
f ij (t, Φ) dt := P Φ (T 1 ∈ dt, J(t) = j | J (0) = i) F ij (t, Φ) := P Φ (T 1 > t, J(t) = j | J (0) = i).
Therefore(see[28]),wehave
f (t, Φ) = exp(t(L(Φ) − Λ(Φ)))Λ(Φ), F(t, Φ) = exp(t(L(Φ) − Λ(Φ))).
p(e, Φ) = diag((p i (e, Φ)) 1≤i≤r ),
P θ(·, e, Φ) (X = x) = diag(( P θ(i, e, Φ) (X = x)) 1≤i≤r ),
andin matrixnotation
∀ e ⊂ {1, . . . , n}, e 6= ∅ , g(t, e, x, Φ) = f (t, Φ) · p(e, Φ) · P θ(·, e, Φ) (X = x) g(t, ∅ , x, Φ) = f (t, Φ) · p( ∅ , Φ) · 1l {x=0} .
With thesenotations,the
(i, j)−
thelementofthematrixg(t, e, x, Φ)
is∀ e ⊂ {1, . . . , n}, e 6= ∅ , g ij (t, e, x, Φ) = f ij (t, Φ) p j (e, Φ) P θ(j, e, Φ) (X = x) g ij (t, ∅ , x, Φ) = f ij (t, Φ) p j ( ∅ , Φ) 1l {x=0} .
It is now sucient to specify the starting distribution of
J
to compute the likelihood oftheobservations. Denote by
P (Φ)
thetransition matrix ofthe discrete-timeMarkovchain(J i = J (τ i ))
: integratingf
,onegetsP (Φ) = (Λ(Φ) − L(Φ)) −1 Λ(Φ).
According to [32],
P (Φ)
hasa uniquestationary distributionπ(Φ)
and we have, ifa(Φ)
istheonlystationarydistributionofthecontinuous-timeprocess
(J(t)) t≥0
andη
isthecolumnvectorofsize
r
withallentriesequalto1
,π(Φ) = 1
a(Φ)Λ(Φ)η a(Φ)Λ(Φ).
Weassumethatthestartingdistributionof
J
isπ(Φ)
; theprocess((J i , Y i , E i , X i )) i
isthenP Φ −
stationary,becausethebivariateprocess((J i , Y i )) i
isaMarkovrenewalprocess(seee.g.[12,p. 313]). Thus, thelikelihoodoftheobserveddataunderthedistribution
P Φ
isL((y i , e i , x i ) 1≤i≤k , Φ) = π(Φ) Y k i=1
g(y i , e i , x i , Φ)
! η.
Assuming nowthat weknowthestates
j 0 , j 1 , . . . , j k
ofthe (hidden) MarkovprocessJ
atthetimeswhen
N
jumps,thecompletelikelihoodofthedataisL((j i ) 0≤i≤k , (y i , e i , x i ) 1≤i≤k , Φ) = π j 0 (Φ)
Y k i=1
g j i−1 , j i (y i , e i , x i , Φ)
! .
TogivearesultonthestrongconsistencyoftheMLE, werstneedsomenotations: foran
arbitraryparameter
Φ
,denotebyF Φ
thesetofallparametersΦ 0
suchthatforalle (∀ j λ j (Φ) p j (e, Φ) = 0) ⇔ (∀ j λ j (Φ 0 ) p j (e, Φ 0 ) = 0).
F Φ
can be thought of asthe set of the elementsΦ 0
such that a simultaneous jump of the processesS k 1 , . . . , S k q
isa.s. impossibleunderthelawP Φ
ifandonlyifitisa.s. impossibleunder the law
P Φ 0
. WritefurtherΦ ∼ Φ 0
whenever((Y i , E i , X i )) i
hasthe samelawunderP Φ
and underP Φ 0
.Wenallywritedownthehypothesesweneedto stateourmainresult:
(A 1 )
Foralle 6= ∅
,thedistributionsP θ(·, e)
havethesamesupport,withnoatomat0
.(A 2 )
Foralle 6= ∅
and allΦ, Φ 0
, there exists aneighborhoodG
ofΦ 0
such that foreverysubset
G Φ 0
ofG
andalli, j ∈ {1, . . . , r}
,Z ln sup
ϕ∈G Φ0
P θ(i, e, ϕ) (m ∈ e ⇒ X m = x m )
P θ(j, e, Φ) (m ∈ e ⇒ X m = x m ) dx < ∞.
(A 3 )
For alle 6= ∅
, alli ∈ {1, . . . , r}
and allx
,ϕ 7→ P θ(i, e, ϕ) (m ∈ e ⇒ X m = x m )
is acontinuousfunction.
Thisallowsustostateourmain result:
Theorem 1. Assumethat
(A 1 − A 3 )
hold. LetΦ 0
be the true value of the parameter, andlet
C
beacompact setofF Φ 0
suchthatΦ 0 ∈ C
. LetΦ b p
bethe MLEforΦ 0
onC
,computedwith
p
observations. Then ifO ⊂ C
isan open set inF Φ 0
containing the equivalence class ofΦ 0
modulo∼
,onehasΦ b p ∈ C
a.s. forp
largeenough.Proof of Theorem 1. We closely follow the proof of Theorem 1 in [32]: pick
Φ
andΦ 0 ∈ F Φ 0
such thatΦ 0 Φ
. Lemma 8impliesthat thereexistsε > 0
such thatH (Φ, Φ 0 ) <
H (Φ, Φ) − 2ε
. Now, with the notations of Lemma 3, Lemma 5 entails that there existsN ∈ N \ {0}
with1
N E Φ (q 0N (Φ 0 )) − H (Φ, Φ 0 ) < ε
sothat
1
N E Φ (q 0N (Φ 0 )) < H (Φ, Φ) − ε.
We then pick aneighborhood
G
ofΦ 0
inF Φ 0
given by Lemma 3; in particular, for every subsetG Φ 0
ofG
containingΦ 0
,E Φ ln sup
ϕ∈G Φ0
q 0N (ϕ) < ∞.
Letting
B 1/t
betheopenballcenteredatΦ 0
withradius1/t
,thecontinuityofq 0N
gives:ln sup
ϕ∈G ∩ B 1/t
q 0N (ϕ) −−−→
t→∞ ln q 0N (Φ 0 ).
Set now
A t = (
sup
ϕ∈G ∩ B 1/t
q 0N (ϕ) ≤ 1 )
, and let
A c t
denote the complement ofA t
. Noticethat
ln sup q 0N (ϕ) = − ln
"
sup q 0N (ϕ)
#
1l A + ln
"
sup q 0N (ϕ)
#
1l A c
whichentails
ln sup
ϕ∈G ∩ B 1/t
q 0N (ϕ)
≤ | ln q 0N (Φ 0 )| + ln sup
ϕ∈G
q 0N (ϕ) .
Wecanthenusethedominatedconvergencetheoremto getaneighborhood
G Φ 0 ⊂ G
ofΦ 0
in
F Φ 0
suchthat1 N E Φ
ln sup
ϕ∈G Φ0
q 0N (ϕ) ≤ 1
N E Φ (ln q 0N (Φ 0 )) + ε
2 < H(Φ, Φ) − ε 2 .
Now,because
(Z st = ln sup ϕ∈G Φ0 q st (ϕ))
isP Φ −
subadditiveandergodic,Kingman'stheorem (see[22])impliesthatthereexists aniteconstantH(Φ, Φ 0 , G Φ 0 )
such thatn→∞ lim 1 n E Φ
"
ln sup
ϕ∈G Φ0
q 0n (ϕ)
#
= H (Φ, Φ 0 , G Φ 0 )
and
n→∞ lim 1
n ln sup
ϕ∈G Φ0
q 0n (ϕ) = H (Φ, Φ 0 , G Φ 0 ) P Φ −
a.s.Theorem1.1in [22]entails
H (Φ, Φ 0 , G Φ 0 ) ≤ 1 N E Φ
"
ln sup
ϕ∈G Φ0
q 0N (ϕ)
#
< H (Φ, Φ) − ε 2 ;
putting
p st (ϕ | J (0) = j) = L((Y i , E i , X i ) s+1≤i≤t , ϕ | J (0) = j)
andremarkingthat forall
ϕ ∈ G Φ 0
q 0n (ϕ) =
X
i∈C(ϕ)
π i (ϕ)
max
i∈C(ϕ) p 0n (ϕ | J (0) = i)
≥ X
i∈C(ϕ)
π i (ϕ)p 0n (ϕ | J(0) = i)
= p 0n (ϕ),
onegets
ln sup
ϕ∈G Φ0
p 0n (ϕ) − ln sup
ϕ∈G Φ0
q 0n (ϕ) ≤ 0
andthuslim sup
n→∞
( 1 n ln sup
ϕ∈G Φ0
p 0n (ϕ) )
≤ H (Φ, Φ 0 , G Φ 0 ) < H (Φ, Φ) − ε 2 .
Covernowthecompactset
O c ∩ C
bytheG Φ 0 i
,1 ≤ i ≤ d
. Wehavesup
ϕ∈O c
{ln p 0n (ϕ) − ln p 0n (Φ 0 )} ≤ max
1≤i≤d
( ln sup
ϕ∈G Φ0 i
p 0n (ϕ) − ln p 0n (Φ 0 ) )
−−−−→
n→∞ −∞
with
P Φ
0 −
probability1
. This showsthat necessarilyΦ b p ∈ C
a.s. forp
large enough, andcompletestheproof.
∼
. Inthat sense,thisresultisthebest possibleone.Undersomeadditionalassumptions,onecanapply theasymptoticnormalitytheorem in[5]
in order to obtain the one of our estimator. This result is rather technical: we refer the
readerto[16]fordetails.
4 An EM algorithm to compute the MLE
WenowgiveanEMalgorithm,adapted from[35],allowingus to computethe MLEin our
context. Recalltheavailabledata:
1. Thevalues
0 < t 1 < . . . < t k = T
oftheτ i
,i.e. thetimeswhenN
jumps(equivalently, theinter-eventtimesy 1 , . . . , y k
,wherey j = t j − t j−1
,t 0 = 0
);2.
e 1 , . . . , e k
thesuccessivevaluesoftheE k
;3.
x 1 , . . . , x k
thesuccessivevaluesofthejumpsofS
.Wewantto estimate
1. Theelements
` ij
ofthetransitionintensitymatrixL
ofJ
;2. Thejump intensities
λ i
ofN
;3. Theprobabilities
p i (e)
,wheree
isasubsetof{1, . . . , n}
;4. Theparameters
θ(i, e)
.Welet
0 < u 1 < . . . < u m < T
bethejumptimesofJ
inthetimeinterval[0, T ]
,u 0 = 0
andu m+1 = T
;letfurthers i
bethestateofJ
ontheinterval[u i−1 , u i [
,∆u i = u i − u i−1
andz i
bethenumberofjumpsof
N
intheinterval[u i−1 , u i [
.Recall that, if
N 0
is an homogeneous Poisson process, then given{N 0 (t) = n}
, the eventtimes of
N 0
in the interval[0, t]
are uniformly distributed. Consequently, Bayes' formula impliesthatthecompletelikelihoodofthedataisL c = π s 1
" m Y
i=1
` s i , s i+1
−` s i , s i
· (−` s i , s i exp(` s i , s i ∆u i ))
#
exp(` s m+1 , s m+1 ∆u m+1 )
×
" m+1 Y
i=1
(λ s i ∆u i ) z i
z i ! exp(−λ s i ∆u i ) · z i ! (∆u i ) z i
#
× Y r i=1
Y
e⊂{1, ..., n}
e6= ∅
p i (e) card(A i (e)) Y
j∈A i (e)
P θ(i, e) (∀ m ∈ e, X m = x m, j )
· p i ( ∅ ) card(A i (∅))
where
A i (e) = {j ∈ {1, . . . , k} | J (t j ) = i, e j = e}
standsforthesetof thejump timesofN
whenthe
S k
withk ∈ e
(andonlythese) jump andJ
isin statei
;A i ( ∅ )
standsforthesetofthejumptimesof
N
whennoneoftheS k
jumpsandJ
isin statei
.Fromthat identity,wededucethatthecompletelog-likelihoodis
ln L c = X r
i=1
1l {X(0)=i} ln(π i ) + X r
i=1
T i ` ii + X r
i=1
X r
j=1 j6=i
m ij (T ) ln(` ij ) + X r i=1
(n i ln(λ i ) − λ i T i )
+ X r i=1
X
e⊂{1, ..., n}
card(A i (e)) ln(p i (e))
+ X r i=1
X
e⊂{1, ..., n}
e6=∅
X k j=1
ln P θ(i, e) (∀ m ∈ e, X m = x m, j )1l {j∈A i (e)}
where
1.
T i = Z T
0
1l {J(u)=i} du
isthetimespentbytheprocessJ
in statei
untiltimeT
;2.
m ij (T ) = card({s : 0 < s ≤ T | J (s − ) = i, J (s) = j})
is the numberof jumps fromstate
i
tostatej
oftheprocessJ
;3.
n i = X k j=1
1l {J(t j )=i}
isthenumberofeventsthatoccurredwhenJ
isinstatei
.The Mstep. Wenowcomputetheconditionalexpectationof
ln L c (Φ)
underaparameterϕ
,giventheevent{N(u), S(u), 0 ≤ u ≤ T }
: onehasE ϕ (ln L c (Φ) | N (u), S(u), 0 ≤ u ≤ T )
= X r i=1
1l {X(0)=i} \ ln(π i ) + X r i=1
T b i ` ii + X r i=1
X r
j=1 j6=i
m \ ij (T ) ln(` ij ) + X r i=1
( n b i ln(λ i ) − λ i T b i )
+ X r i=1
X
e⊂{1, ..., n}
card(A \ i (e)) ln(p i (e))+
X r i=1
X
e⊂{1, ..., n}
e6= ∅
X k j=1
ln P θ(i, e) (∀ m ∈ e, X m = x m, j ) 1l {j∈A \ i (e)}
where
A b = E ϕ (A | N (u), S(u), 0 ≤ u ≤ T )
.For
T
largeenough,thersttermmaybeneglected;recallingthat` ii = − X r
j=1 j6=i
` ij , p i ( ∅ ) = 1 − X
e⊂{1, ..., n}
e6=∅
p i (e), card(A \ i ( ∅ )) = n b i − X
e⊂{1, ..., n}
e6=∅
card(A \ i (e)),
onegets,forall
i, j ∈ {1, . . . , r}
andi 6= j
,theidentitiesb
p i (e) =
card(A \ i (e)) b n i
, ` b ij =
m \ ij (T ) T b i
, λ b i = n b i
T b i
, X k
j=1
∂
∂θ(i, e) ln P θ(i, e) (∀ m ∈ e, X m = x m, j )
θ(i, e)=b θ(i, e)
1l {j∈A \ i (e)} = 0
,where
b p i (e)
,` b ij
andλ b i
arethedesiredestimators,andthelastsetofequationsistobesolved takingthepropertiesofthestatisticalmodel( P θ )
intoaccount.The E step. Accordingto Lemma9,if
A(e) = [ r i=1
A i (e) = {j ∈ {1, . . . , k} | e j = e}
,thenT b i = Z T
0
P ϕ (J (v) = i, N (u), S (u), 0 ≤ u < v) P ϕ (N (u), S(u), 0 ≤ u ≤ T)
× P ϕ (N(u), S (u), v ≤ u ≤ T | J (v) = i) dv, b
n i = X k q=1
P ϕ (J(t q ) = i, N (u), S(u), 0 ≤ u ≤ T ) P ϕ (N (u), S(u), 0 ≤ u ≤ T ) , 1l {j∈A \ i (e)} = 1l {j∈A(e)} P ϕ (J (t j ) = i | N (u), S(u), 0 ≤ u ≤ T ), card(A \ i (e)) =
X k j=1
1l {j∈A \ i (e)} = X k j=1
1l {j∈A(e)} P ϕ (J (t j ) = i | N(u), S(u), 0 ≤ u ≤ T ), m \ ij (T ) = ` ij (ϕ)
Z T 0
P ϕ (J (v) = i, N (u), S(u), 0 ≤ u < v) P ϕ (N (u), S(u), 0 ≤ u ≤ T )
× P ϕ (N (u), S(u), v ≤ u ≤ T | J (v) = j) dv.
Let
w i
bethecolumnvectorofsizer
withallentriesexceptthei−
thequalto0
,anditsi−
thentryequalto
1
. Firstly,P ϕ (N(u), S (u), 0 ≤ u < v, J (v) = i) = π(ϕ)
N(v) Y
q=1
g(y q , e q , x q , ϕ)
F (v − t N (v) , ϕ)w i .
Secondly,if
w t i
isthetransposeofw i
,P (N (u), S(u), v ≤ u ≤ T, ϕ | J(v) = i)
= w t i g(t N (v)+1 − v, e N (v)+1 , x N (v)+1 , ϕ)
Y k q=N(v)+2
g(y q , e q , x q , ϕ)
η,
andnally
P ϕ (J(t q ) = i, N (u), S(u), 0 ≤ u ≤ T )
= π(ϕ) Y q p=1
g(y p , e p , x p , ϕ)
! w i w t i
Y k p=q+1
g(y p , e p , x p , ϕ)
! η.
θ
is generallyestimatedwithanumerical(e.g. quasi-Newton)method.Procedure. Here,wedescribeawayto implementouralgorithm,byinduction on
` ∈ N
. Dene, ifΦ `
istheparameterestimateatstep`
,1.
G ` (0) = π(Φ ` )
and∀ 0 ≤ q ≤ k − 1, G ` (q + 1) = G ` (q) · g(y q+1 , e q+1 , x q+1 , Φ ` );
2.
D ` (k) = η
and∀ 0 ≤ q ≤ k − 1, D ` (k − q − 1) = g(y k−q , e k−q , x k−q , Φ ` ) · D ` (k − q)
.Setthen
A ij (Φ ` ) = B i (·, Φ ` ) = C i (Φ ` ) = 0
anddo,forallq ∈ N
suchthat1 ≤ q ≤ k
,A ij (Φ ` ) ← A ij (Φ ` ) +
Z t q
t q − 1
G ` (q − 1) F (t − t q−1 , Φ ` )w i w t j g(t q − t, e q , x q , Φ ` ) D ` (q) dt, B i (q, Φ ` ) ← G ` (q)w i w t i D ` (q),
C i (Φ ` ) ← C i (Φ ` ) + B i (q, Φ ` ).
Theestimatesatstep
` + 1
arethenb
p i (e) = P k
j=1 1l {j∈A(e)} B i (j, Φ ` )
C i (Φ ` ) , ` b ij = ` ij (Φ ` ) · A ij (Φ ` )
A ii (Φ ` ) , λ b i = C i (Φ ` ) A ii (Φ ` ) ,
andthe
θ(i, e) b
that maximizethefunctionalsθ 7→
X k j=1
ln P θ (∀ m ∈ e, X m = x m, j )B i (j, Φ ` )1l {j∈A(e)} .
5 A posteriori reconstruction of the states, with a maxi-
mum likelihood method
Once theparametersof themodel are estimated, itcanbeinterestingto estimate thesuc-
cessivestatesoftheMarkovchain
(J i )
. Tothisend, wecanadapt theprocedure describedin[28]: considerthelog-likelihoodofboththeobservedandmissing data
(j 0 , . . . , j k ) 7→ ln(π j 0 (b Φ)) + X k i=1
ln g j i−1 , j i (y i , e i , x i , Φ). b
An estimatorof
(j 0 , . . . , j k )
isthena(k + 1)−
tuple( j b 0 , . . . , j b k )
whichmaximizesthisfunc-tional. Such an estimatorhasexcellentproperties, see [8]. Froma practicalpointof view,
onemayreconstructthestatesusingtheViterbialgorithm(see[37]), namely:
1. Set
V j = 0
andC j = [j]
forallj ∈ {1, . . . , r}
,andq = 1
.2. If
q ≥ k + 1
,gotostep6. Otherwise,setα (q) i, j = ln g ij (y k−q+1 , e k−q+1 , x k−q+1 , Φ). b
3. Forall
i
,j ∈ {1, . . . , r}
,computeβ i, j (q) = α (q) i, j + V j
andanindexj i (q)
suchthatβ (q)
i, j (q) i =
j∈{1,..., r} max β i, j (q)
.4. Forall
i ∈ {1, . . . , r}
,replaceV i
byβ (q)
i, j i (q)
and
C i
by[j i (q) , C i ]
.5. Replace
q
byq + 1
andgobacktostep2.6. Find anindex
i
suchthatV i = max
j∈{1,..., r} V j
.An estimateof thestatesisthenthesequence
( j b 0 , . . . , j b k ) = C i
.6.1 Computing a rst estimate
Providing a rst estimate for an iterative algorithm is usually a daunting task. Here, we
describeaprocedure,adapted fromtheonedescribedin [28], thatworkedquitewellin our
examples:
1. Compute the average of the inter-eventtimes
λ c ∗ = k/T
, and mobile averages of theinter-eventtimes
y i
,denotedbyz i
(fortherstandlasttimesoftheobservedsample,put
z i = y i
).2. Set
J b (·) = 0
;pickq 1 ≤ 1 < q 2 < · · · < q r−1
. Foralli ∈ {1, . . . , k}
:(a) if
z i > 1/(q 1 c λ ∗ )
, setJ b (t i ) = 1
;(b) forall
j ∈ {1, . . . , r − 2}
,if1/(q j+1 λ c ∗ ) < z i ≤ 1/(q j λ c ∗ )
,setJ b (t i ) = j + 1
;(c) if
z i ≤ 1/(q r−1 λ c ∗ )
,setJ b (t i ) = r
.3. Compute
n b j =
k−1 X
i=1
1l { J(t b i )=j}
forj ∈ {1, . . . , r}
.4. Compute,forall
i, j ∈ {1, . . . , r}
P b ij = X k
`=2
1l { J(t b `
− 1 )=i, J(t b ` )=j}
b
n i ,
which is the rst estimate of
P ij
, the probability that the Markov chain(J (t k )) k≥0
jumps fromstate
i
tostatej
.5. Calculate,forall
j ∈ {1, . . . , r}
,b π j = n b j + 1l { J(t b k )=j}
k
,therstestimateofπ j
.6. Thankstotheidentities
∀ j ∈ {1, . . . , r} λ j = λ ∗ π j a −1 j
andL = Λ(Id −P −1 ),
(where
λ ∗ = P r
j=1 λ j a j
istheaveragejump rateofN
),considerL
andΛ
asfunctionsof
a 1 , . . . , a r−1
,andmaximizethecompletelikelihoodwithrespectto theparametersa 1 , . . . , a r−1
givenc λ ∗
,π b 1 , . . . , π b r
,P b
,y 1 , . . . , y k
andJ b
: letb a 1 , . . . , b a r−1
betheestimateobtainedthisway.
7. For all
j ∈ {1, . . . , r}
, computec λ j = λ c ∗ b π j b a −1 j
, letΛ b
be the diagonal matrix withcoecients
c λ 1 , . . . , λ c r
in that order andcomputeL b = Λ(Id b − P b −1 )
. These areroughestimatesfor
Λ
andL
.8. Use
L b
andΛ b
asinitialvaluesforanEMalgorithmtoprovideestimatesforL
andΛ
(see[35]),whichwedenoteby
L
andΛ
. Computethecorrespondingstationarydistributionsa
andπ
.9. Performastatereconstructionof
J
withtheViterbialgorithm usingL
andΛ
,andletJ
betheprocessobtainedthisway.10. Forall
j ∈ {1, . . . , r}
,calculaten j =
k−1 X
i=1
1l {J(t i )=j}
.11. Forall
i 1 , . . . , i n ∈ {0, 1}
andj ∈ {1, . . . , r}
,ife
isthesubsetof{1, . . . , n}
suchthatk ∈ e ⇔ i k = 1
,computep j (e) = 1 n j
k−1 X
`=1
1l {J(t ` )=j} 1l {∀p∈{1, ..., n}, S p (t ` )−S p (t ` − 1 )>0 ⇔ i s =1}
whichistheinitialestimateof
p j (e)
.12. Forall
j = 1, . . . , r
ande 6= ∅
, consider theX i
such thatJ(t i ) = j
andE i = e
asindependent andidentically distributed randomvariableswith parameter
θ(j, e)
, andestimate
θ(j, e)
withastandardmethod(maximumlikelihoodmethod forinstance).This procedure isadapted in the particularcasewhen
λ 1 < · · · < λ r
strongly dier,whichshallbethecaseinournumericalstudybelow.
6.2 A non-life insurance example
We now use our algorithm on a real set of non-life insurance data. From January
2004
to November
2009
,594
accidents corresponding to blazes causing industrial damages or losses were observed. The days of these events were recorded, and so were, if necessary,the compensationsfor thevictims; the processes
N
andS
obtainedthiswayare shown onFigure 12. This situation corresponds to the case
α = n = 1
of our model. We nallychoose
r = 2
, which isjustiedbythefact thattheMLE, computedonlyforL
andΛ
withr = 3
setsallparameterscorresponding tothethird stateto0
. Beforemodelingtheclaimsthemselves,theparametersofthismodelare
1.
` 12
and` 21
,thejump ratesofthehiddenMarkovprocessJ
;2.
λ 1
andλ 2
,thejump intensitiesoftheshockcountingprocessN
;3.
p 1 (1)
andp 2 (1)
, theprobabilitiesthat, when anaccidenthappens, theinsurancerm hastocompensate.0 500 1000 1500 2000 0
100 200 300 400 500
Figure1: Thecountingprocess
N
0 500 1000 1500 2000
0e+000 1e+006 2e+006 3e+006 4e+006 5e+006 6e+006 7e+006 8e+006
Figure2: Thelossprocess
S
Asfortheclaimsizes, aquickanalysisofthedatashowsthatsomeclaimshaveasmallsize
and afew others are very large, which prevents us from modeling the situation by a log-
Normal,GammaorGeneralizedParetodistribution (GPD).Inactuarialstatistics,onemay
inmanySolvencyIIpartialinternalmodels,ordealdirectlywithamixtureofdistributions,
or with a distribution that looks like Lognormal or Gamma distributions for small values
and gets moreand more Pareto-typefor largevalues, likethe Champernownedistribution
(see[9, 10] and[20]). Anotherpossibility isto useaclassicalkerneldensity estimatorafter
transformingthedata (see[6]). Here,we useamixtureof alight-tailed and aheavy-tailed
distribution,namelyaGammadistributionandaGPD.
P θ
thenhasdensityx 7→ q (bx) a−1
Γ(a) be −bx 1l {x>0} + (1 − q) 1 σ
1 + ξ(x − µ) σ
−1−1/ξ 1l {x>µ}
where
a, b, σ, ξ > 0
,0 < q < 1
andµ = 49.33
istheminimal(observed)claimsize (theunitistheeuro).
Consequently,theparameterstobeestimatedare
` 12
,` 21
,λ 1
,λ 2
,p 1 (1)
,p 2 (1)
,a 1
,a 2
,b 1
,b 2
,σ 1
,σ 2
,ξ 1
,ξ 2
,q 1
andq 2
.EstimatingtheparametersviatheEMalgorithm,withaquasi-Newtonalgorithmtoestimate
theparameters
a i
,b i
,σ i
,ξ i
andq i
duringtheMstepgivesthefollowingresults:L b =
−0.0065 0.0065 0.0018 −0.0018
, Λ = b
0.462 0 0 0.214
,
b p(1) =
0.963 0 0 0.947
, p(0) = b
0.037 0 0 0.053
,
b a =
4.52 4.14
, b b =
0.011 0.0073
, b σ =
1145 1216
, ξ b =
1.45 1.31
, q b =
0.230 0.335
.
Theclaim sizesthus haveinnitemeansin bothstatesin theory. This meansthat thetail
of the claim size distribution is very heavy. However, reinsurance mechanisms and other
guaranteesmayenable theinsurer to provide insurancecoverage of those risksupto some
highthresholdlevel. Afurtheranalysis thenshowsthat
1. Sojourntimesin state
1
areonaverage3.5
timesshorterthaninstate2
;2. There aremoreaccidentswhen
J
isin state1
thaninstate2
;3. Because
p b 1 (1)
is slightly greater thanb p 2 (1)
, these accidents cause morelosses to theinsurancerm;
4. Losses instate
1
aremorelikelytobeheavy-tailedthaninstate2
.An aposteriorireconstruction ofthestatesof
J
isgiveninFigure3.6.3 A life insurance data set
Let us now present an application in the life insurance eld. From January
2006
to July2010
,1507
closuresofsavingsaccounts(alsocalledsurrenders)were observed. Themonths0 500 1000 1500 2000 1.0
1.2 1.4 1.6 1.8 2.0
Figure3: Aposteriorireconstructionofthestatesof
J
oftheseeventswererecorded,alongwiththeamountofmoneywithdrawn. Earlysurrenders
canberegardedasclaimsfor theinsurancecompanyin somecases,becauseit corresponds
toadropinfuturebusiness,andbecausesometimestheinsurerhasbeenunabletochargeall
thefees(thatareoftenpartlypaidbythepolicyholderateachtimeperiodandnotupfront)
beforethe surrender. Surrender riskiscomplex: taxand penalty relief,interest ratelevels,
competitionbetweeninsurancecompanies,aswellasotherfactorsareatstake. Forareview
on surrendertriggers, theinterested readermightconsult [29] or[26]. Inthe present data
study, we are interested in the big picture in aquite stable regime (and notin prediction
offuture surrenderrates): intheconsideredperiod, theportfolioseemstohavebeenpretty
stable,mainlysensitivetoexternalcompetition(whichisdiculttoobserveinpractice). We
assumethat conditionally with respect to thestateof theenvironment,the probabilityfor
onepolicyholdertosurrenderhercontractdoesnotdependontheamountofsavings. Toset
aprecisedateforthe
k−
thsurrender,wedrawauniformrandomvariableandaddittothemonth ofthis eventto obtainan exactdate. Here, theclaims arethe amountsof money
withdrawn;theprocesses
N
andS
are representedonFigure 45. Again,thissituation ts the caseα = n = 1
of ourmodel; weuse atwo-state model for this situation,so that theparametersare
1.
` 12
and` 21
,thejump ratesofthehiddenMarkovprocessJ
;λ 1 λ 2 N
Notethatinthisexample,thereisnoneedtoestimate
p 1 (1)
andp 2 (1)
. Onthegraphsbelow,theunitoftimeis themonth:
0 10 20 30 40 50
0 200 400 600 800 1000 1200 1400
Figure4: Thecountingprocess
N
0 10 20 30 40 50
0.0e+000 5.0e+006 1.0e+007 1.5e+007 2.0e+007 2.5e+007 3.0e+007 3.5e+007
Figure5: Theprocess
S
representingthecumulativeamountofmoneywithdrawnInstate
1
,weuseamixtureofalight-tailedandaheavy-taileddistribution,namelyaWeibull distributionandaGPD,thedensityofP θ
thenbeingx 7→ q a b
x − µ b
a−1
e −((x−µ)/b) a 1l {x>µ} + (1 − q) 1 σ
1 + ξ(x − µ) σ
−1−1/ξ
1l {x>µ}
where
a, b, σ, ξ > 0
,0 < q < 1
andµ = 1.1
is theminimal (observed)amount(the unit istheeuro). Instate
2
,wetaGPD, whosedensityisx 7→ 1
σ
1 + ξ(x − µ) σ
−1−1/ξ
1l {x>µ}
(1)where
µ, σ, ξ > 0
. Of course, surrender amounts are not completely independent at the microscopic level aseach policyholder has acertain balance onhis savingsaccountthat isknown at a precise date. We are aware that in theory, the
X i
are not independent and identically distributed in each state, but in practice there are enough policyholders andenough randomness in the surrendered amounts for this assumption to be acceptable in
practice at the macroscopic level in each state of the environment (this is supported by
statisticaltests).
Consequently,theparameterstobeestimated are
` 12
,` 21
,λ 1
,λ 2
,a
,b
,σ 1
,σ 2
,ξ 1
,ξ 2
andq
.EstimatingtheparametersviatheEMalgorithm,withaquasi-Newtonalgorithmtoestimate
theparameters
a
,b
,σ i
,ξ i
andq
during theMstepgivesthefollowingresults:L b =
−0.254 0.254 0.373 −0.373
, Λ = b
34.2 0 0 17.4
,
b a = 1.65, b b = 9141, σ b =
22350 14591
, ξ b =
0.17 0.40
, q b = 0.306.
An a posteriori reconstruction of the states of
J
is shown in Figure 6. Note that resultsshowthat during someerce competition periods, surrenderrates become moreimportant
(they double from one stateto the other). In the statewhere surrender rates are higher,
the surrenderedamount tteddistribution is composed of a light-tailed part and aheavy-
tailed part,whereas forsmaller surrenderrates, this distribution does notincorporateany
light-tailed part. This suggeststhat policieswith smallerfacialamountsaremoresensitive
tochangesin theenvironment. Onceagain,here,theheavy-tailed partmustberegardedas
astatisticalt,andthetailwouldhaveto becutatanappropriatelevelaposteriori.
6.4 Simulations in the multivariate setting
6.4.1 Motivation
Oneof themain purposes of insuranceisrisk diversicationand mutualization: thelaw of
largenumbersandthecentral limittheorem oftenapply inpracticewhenindependencebe-
0 10 20 30 40 50 1.0
1.2 1.4 1.6 1.8 2.0
Figure6: Aposteriorireconstructionofthestatesof
J
insuranceportfolios(withoutmotorliabilityinsurance)atthenationallevel. However,when
itcomes to hurricane risksorearthquakerisks,individualrisksare onlyconditionallyinde-
pendentwithrespecttotheoccurrenceornotofsucheventsinthecountry. Thiscorrelation
makesitdiculttodiversifythoserisksatthenationallevel,andoneoftenusesreinsurance:
risksarethendiversiedatthegloballevel(oodsinAustralia,tsunamisinAsia,hurricanes
intheEastCoastofNorthAmerica,earthquakesinJapan,MonteCarloandSanFrancisco,
stormsinEuropeforinstance). Nevertheless,thoserisksarenotreallyindependent,assome
(often ignored) correlation factors are present. Even if they are geographically scattered,
meteorological phenomena like theEl Nino-La Nina Southern Oscillation (ENSO) may si-
multaneouslyinuenceclaimoccurrenceandseverityin thosedierentzones. Forexample,
itis nowacceptedthat theprobabilitiesofsevereoods in Australia,strongsnowstormsin
NorthAmericaandhurricanesontheUSEastCoastincreaseduringLaNinaepisodes,while
other kindsofeventsare morelikelyduring El Nino episodes. Tobuild amodel forENSO
and tounderstand allits impactsondierentareasof theworldis farbeyond thescopeof
this paper. Ofcourse, ENSO isobservedand canbe(partly) measured,itsbehaviorisnot
really Markovianand claim arrivalprocesses feature seasonality. There are certainly other
kindsofunobservedenvironmentprocessesthatjointlymodulateclaimprocessesindierent
regions of the world. In our illustrative example, we just imagine that some unobserved
Markov process inuences claim frequencies in three regions A (
k = 1
), B(k = 2
) and C(
k = 3
). RegionsA and Bare assumed to be close to each other, so that common shockschangesaremorefrequentthanfortheENSO cycle. Wesimulate thecorrespondingmulti-
variateriskprocess,andwecheckwhetheritwouldbepossibleornotforustoestimatethe
parametersof themodel and to re-buildthestates ofthe environment modulating process
(without observingitofcourse).
6.4.2 Amodel with 2states ofthe environment
Werst assumethat
r = 2
: in state1, claimsarelessfrequentand lessseverein the threezones, and common shocks are not present (
p 1 (e) = 0
ifCard(e) ≥ 2
). In state 2, claimsare more likely and more severe in average, and common shocks are possible for zones A
and B(
p 2 ({1, 2}) > 0
). Takeλ 1 = 20
,λ 2 = 200
,p 1 ({1}) = p 1 ({2}) = 0.3
,p 1 ({3}) = 0.4
,p 2 ({1}) = p 2 ({2}) = 0.2
,p 2 ({3}) = 0.4
andp 2 ({1, 2}) = 0.2
. Theunivariateclaimseveritydistributions arechosentobeGPdistributedasin(1),withtheparametersbeing
µ({1}) = µ({2}) = µ({3}) = 1, σ(1, {1}) = σ(1, {2}) = σ(1, {3}) = 1, σ(2, {1}) = σ(2, {2}) = σ(2, {3}) = 20, ξ(1, {1}) = ξ(1, {2}) = ξ(1, {3}) = 1/2, ξ(2, {1}) = ξ(2, {2}) = ξ(2, {3}) = 2
.Univariateclaimsarethereforemoresevereinaverageandinthetailforstate2forallthree
lines. Asfarasthebivariateclaimsinstate2areconcerned,wemodelthembyabivariate
GPDasin[7,11];namely,theirdensityhastheform
(x, y) 7→ α(α + 1) σ 1 σ 2
1 + x − µ 1
σ 1
+ x − µ 2
σ 2
−α−2
1l {x>µ 1 } 1l {y>µ 2 }
where
α, µ 1 , µ 2 , σ 1 , σ 2 > 0
,andwechooseµ({1, 2}) =
3 3
, σ(2, {1, 2}) =
30 20
, α(2, {1, 2}) = 2.
Assumethatweobservethemultivariateclaimprocessduring30years,andthattheaverage
timespentin state1(before switchingtostate2)is1year,whiletheaveragetimespentin
state2(beforeswitchingtostate1)is3months. Namely,
` 12 = 1
and` 21 = 4
.The estimate of
µ({e})
,e 6= ∅
ischosenas thevectorof theminima of the claims arising whenashockaectssimultaneouslythelinesL k 1 , . . . , L k p
, withe = {k 1 , . . . , k p }
. Resultsaregivenbelow:
b ` 12 = 1.064, ` b 21 = 3.891,
b λ 1 = 21.21, b λ 2 = 195.7,
b
p 2 ({1}) = 0.227, p b 2 ({2}) = 0.182, p b 2 ({3}) = 0.394, b
p 2 ({1, 2}) = 0.197, b
µ({1}) = 1.002, µ({2}) = 1.000, b µ({3}) = 1.004, b b
σ(1, {1}) = 0.950, b σ(1, {2}) = 1.393, b σ(1, {3}) = 0.999, b
σ(2, {1}) = 18.22, b σ(2, {2}) = 19.18, b σ(2, {3}) = 24.83, ξ(1, b {1}) = 0.552, ξ(1, b {2}) = 0.507, ξ(1, b {3}) = 0.493, ξ(2, b {1}) = 2.206, ξ(2, b {2}) = 2.220, ξ(2, b {3}) = 1.888, b
µ({1, 2}) =
3.142 3.040
, b σ(2, {1, 2}) =
25.98 18.06
, α(2, b {1, 2}) = 1.79
.Theestimationprocedureworksquitewellandthestatesarecorrectlyretrieved,seeFigure9.
Ofcourse,iftheobservationperiodwasshorter,orifthephasechangeintensitiesweresmaller,
thenitwouldbeimpossibletoestimatetransitionratesaccurately.
0 5 10 15 20 25 30
1.0 1.2 1.4 1.6 1.8 2.0
0 5 10 15 20 25 30
0 100 200 300 400 500 600 700 800
0 5 10 15 20 25 30
0 100 200 300 400 500 600 700
0 5 10 15 20 25 30
0 100 200 300 400 500 600 700
Figure7: Thecountingprocesses: topleft: thetrueprocess
J
,topright:thecountingprocessrelated to
S 1
, bottom left: the counting process related toS 2
, bottomright: the countingprocessrelatedto
S 3
6.4.3 Amodel with 3states ofthe environment
Wenowassumethat
r = 3
andthatcommonshocksarenotpresent(fori = 1, 2, 3
,p i (e) = 0
if
Card(e) ≥ 2
). Instate 1, claims arenot veryfrequent and notverysevere in the threezones. In state 2, claims are more likely and moresevere in average for the three zones.
State3correspondstoexceptionalconditionsthatfavorextremelysevereclaimsforzonesA
0 5 10 15 20 25 30 1.0
1.2 1.4 1.6 1.8 2.0
0 5 10 15 20 25 30
0 2.0e+5 4.0e+5 6.0e+5 8.0e+5 1.0e+6 1.2e+6 1.4e+6
0 5 10 15 20 25 30
0 2.0e+5 4.0e+5 6.0e+5 8.0e+5 1.0e+6 1.2e+6 1.4e+6 1.6e+6
0 5 10 15 20 25 30
0 2.0e+5 4.0e+5 6.0e+5 8.0e+5 1.0e+6 1.2e+6 1.4e+6 1.6e+6 1.8e+6
Figure 8: Thelossprocesses
S k
,topleft: thetrueprocessJ
, topright: thelossprocessS 1
,bottomleft: thelossprocess
S 2
,bottomright: thelossprocessS 3
0 5 10 15 20 25 30
1.0 1.2 1.4 1.6 1.8 2.0
0 5 10 15 20 25 30
1.0 1.2 1.4 1.6 1.8 2.0
Figure9: ReconstructionofthehiddenMarkovprocess
J
: top: thetrueprocessJ
,bottom:thereconstructedprocess
J b
and B but protectzone C.Take
λ 1 = 20
,λ 2 = 200
,λ 3 = 1000
,p 1 ({1}) = p 1 ({2}) = 0.3
,p 1 ({3}) = 0.4
,p 2 ({1}) = p 2 ({2}) = 0.3
,p 2 ({3}) = 0.4
,p 3 ({1}) = p 1 ({2}) = 0.45
andp 1 ({3}) = 0.1
. TheclaimseveritydistributionsareonceagainmodeledbyGPdistributions, withµ({1}) = µ({2}) = µ({3}) = 1, σ(1, {1}) = σ(1, {2}) = σ(1, {3}) = 1, σ(2, {1}) = σ(2, {2}) = σ(2, {3}) = 20, σ(3, {1}) = σ(3, {2}) = 200, σ(3, {3}) = 0.5,
ξ(1, {1}) = ξ(1, {2}) = ξ(1, {3}) = 1/4, ξ(2, {1}) = ξ(2, {2}) = ξ(2, {3}) = 1/2, ξ(2, {1}) = ξ(2, {2}) = 1, ξ(2, {3}) = 1/3,
These parametersare chosensothat claimsfor zoneC in state3 areverysmall compared
to those forzones Aand B.Assume that weobservethe multivariateclaim processduring
30years,that theaveragetimespentinstate1(before switchingtoanotherstate)is1year
(resp. 3 months for state2, 1 month for state3), and that jumps from state 1to state3
orfrom state3to state1area.s. impossible. Assume nally that whenoneleavesstate2,
theprobabilitytogotostate1is
2/3
. Theintensitytransitionparametersarethen` 12 = 1
,` 13 = 0
,` 21 = 8/3
,` 23 = 4/3
,` 31 = 0
,` 32 = 12
.Again, theestimate of
µ({i})
,i = 1, 2, 3
is chosenastheminimum of theclaims aectingline
i
. Theresultsarethefollowing:` b 12 = 1.691, b ` 13 = 0, b ` 21 = 2.513, ` b 23 = 1.288, b ` 31 = 0, b ` 32 = 10.76, b λ 1 = 27.44, b λ 2 = 198.3, b λ 3 = 976.3,
b
p 1 ({1}) = 0.289, p b 1 ({2}) = 0.332, p b 1 ({3}) = 0.379, b
p 2 ({1}) = 0.306, p b 2 ({2}) = 0.298, p b 2 ({3}) = 0.396, b
p 3 ({1}) = 0.448, p b 3 ({2}) = 0.444, p b 3 ({3}) = 0.109, b
µ({1}) = 1.003, µ({2}) = 1.001, b µ({3}) = 1.000, b b
σ(1, {1}) = 1.013, b σ(1, {2}) = 1.065, b σ(1, {3}) = 1.016, b
σ(2, {1}) = 19.17, b σ(2, {2}) = 19.85, b σ(2, {3}) = 20.83, b
σ(3, {1}) = 191.9, b σ(3, {2}) = 191.2, b σ(3, {3}) = 0.472, ξ(1, b {1}) = 0.356, ξ(1, b {2}) = 0.298, ξ(1, b {3}) = 0.251, ξ(2, b {1}) = 0.504, ξ(2, b {2}) = 0.437, ξ(2, b {3}) = 0.433, ξ(2, b {1}) = 0.957, ξ(2, b {2}) = 0.948, ξ(2, b {3}) = 0.443
.Onceagain,resultsarecorrectbecausewehaveenoughenvironmentprocesschangesduring
our observation period, see Figure 12. Results are slightly less accurate than in the 2-
dimensional case, forexample regarding
λ 1
. Note that even ifresultswould be completelyreconstructionresultsareacceptablefor
3
linesand3
statesoftheenvironment.0 5 10 15 20 25 30
1.0 1.5 2.0 2.5 3.0
0 5 10 15 20 25 30
0 500 1000 1500
0 5 10 15 20 25 30
0 500 1000 1500
0 5 10 15 20 25 30
0 200 400 600 800 1000 1200 1400
Figure 10: The counting processes: top left: the true process
J
, top right: the countingprocess related to
S 1
, bottom left: the counting process related toS 2
, bottom right: thecountingprocessrelatedto
S 3
Acknowledgments
TheauthorsthankverymuchAlexandreYouforhisvaluablehelponthisproject,andJean-
Baptiste Gouere for a useful comment. The second author acknowledges partial support
from the research chair ActuariatDurable sponsoredby Milliman, from the research chair
ActuariatResponsable sponsored byGenerali, and from French Research National Agency
(ANR)under thereferenceANR-08-BLAN-0314-01.
References
[1] Asmussen, S. (1989). Risk theory in a Markovian environment. Scand. Actuar. J. 2,
69100.
[2] Asmussen,S. (2000).Ruinprobabilities,WorldScientic.
[3] Baum,L.E., Petrie,T. (1966).Statisticalinferenceforprobabilisticfunctions of nite-
0 5 10 15 20 25 30 1.0
1.5 2.0 2.5 3.0
0 5 10 15 20 25 30
0 1.0e+5 2.0e+5 3.0e+5 4.0e+5 5.0e+5 6.0e+5 7.0e+5
0 5 10 15 20 25 30
0 1.0e+5 2.0e+5 3.0e+5 4.0e+5 5.0e+5 6.0e+5 7.0e+5 8.0e+5
0 5 10 15 20 25 30
0 5000 10000 15000 20000 25000 30000 35000 40000
Figure11: Thelossprocesses
S k
,topleft: thetrueprocessJ
,topright: thelossprocessS 1
,bottomleft: thelossprocess
S 2
,bottomright: thelossprocessS 3
0 5 10 15 20 25 30
1.0 1.5 2.0 2.5 3.0
0 5 10 15 20 25 30
1.0 1.5 2.0 2.5 3.0
Figure12: ReconstructionofthehiddenMarkovprocess
J
: top: thetrueprocessJ
,bottom:thereconstructedprocess
J b
inthestatisticalanalysisofprobabilisticfunctionsofMarkovchains.Ann.Math.Statist.
41,164171.
[5] Bickel, P.J., Ritov, Y., Rydén, T. (1998). Asymptotic normality of the maximum-
likelihoodestimatorforgeneralhiddenMarkovmodels.Ann.Statist.26(4),16141635.
[6] Buch-Larsen,T.,Nielsen,J.P.,Guillen,M.,Bolanc,C.(2005).Kerneldensityestimation
forheavy-taileddistributionsusingtheChampernownetransformation.Statistics39(6),
503516.
[7] Cai,J.,Tan,K.S. (2007).Optimalretentionforastop-lossreinsurance undertheVaR
andCTEriskmeasure.ASTINBull. 37(1),93112.
[8] Caliebe, A. (2006). Properties of the maximum aposterioripath estimator in hidden
Markovmodels.IEEETrans.Inform. Theory 52(1),4151.
[9] Champernowne,D.G.(1936).TheOxfordMeeting,September2529.Econometrica5,
October1937.
[10] Champernowne,D.G. (1937).Thetheoryofincomedistribution,Econometrica5,379
381.
[11] Chiragiev,A.,Landsman,Z.(2007).MultivariateParetoportfolios: TCE-based capital
allocationanddivideddierences.Scand.Actuar. J.2007(4),261280.
[12] Çinlar,E.(1975).Introduction tostochastic processes,Prentice-Hall.
[13] Davison, A.C., Ramesh, N.I. (1993). A stochastic model for times of exposures to air
pollutionfromapointsource,inStatisticsfor theenvironment, editors: V.Barnettand
K.F. Turkman,Wiley,NewYork.
[14] Deng, L., Mark, J.W. (1993). Parameter estimation for Markov modulated Poisson
processesviatheEMalgorithm withtimediscretization.Telecomm. Syst.1,321338.
[15] Fischer, W., Meier-Hellstern, K.S. (1993). The Markov-modulated Poisson process
(MMPP)cookbook.Perf. Eval.18,149171.
[16] Guillou, A., Loisel, S., Stuper, G. (2011). Asymptotic normality of the maximum
likelihood estimator of a loss process, available on the webpage http://www-irma.u-
strasbg.fr/guillou/supplement.pdf.
[17] Gusella, R. (1991). Characterizing the variability of arrival processes with indexes of