HAL Id: hal-00271136
https://hal.archives-ouvertes.fr/hal-00271136
Preprint submitted on 8 Apr 2008
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires
On asymptotic normality of sequential LS-estimate for unstable autoregressive process AR(2).
Leonid Galtchouk, Victor Konev
To cite this version:
Leonid Galtchouk, Victor Konev. On asymptotic normality of sequential LS-estimate for unstable autoregressive process AR(2).. 2008. �hal-00271136�
On asymptotic normality of sequential LS-estimate for unstable autoregressive process AR(2).
By Leonid Galtchouk and Victor Konev
∗Louis Pasteur University of Strasbourg and University of Tomsk
Abstract
For estimating parameters in an unstableAR(2) model, the paper proposes a sequential least squares estimate with a special stopping time defined by the trace of the observed Fisher information matrix. It is shown that the sequential LSE is asymptotically normally distributed in the stability region and on its boundary in contrast to the usual LSE, having six different types of asymptotic distributions on the boundary depending on the values of the unknown parameters. 1 2
∗The second author is partially supported by the RFFI-Grant 04-01-00855.
1AMS 2000 Subject Classification: 62L10, 62L12
2Key words: Autoregressive process, least squares estimate, sequential estimation, asymptotic normality .
1 Introduction
Consider an autoregressiveAR(2) model
xn=θ1xn−1+θ2xn−2+εn, n= 1,2, . . . , (1.1) where (xn) is the observation, (εn) is a sequence of independent identically distributed (i.i.d.) random variables withEε1 = 0 and 0<Eε21 =σ2 <∞, σ2is known,x0 =x−1= 0. The process (1.1) is assumed to can be unstable, that is, both roots of the characteristic polynomial
P(z) =z2−θ1z−θ2 (1.2) lie on or inside the unit circle. The model (1.1) is a particular case of unstable autoregressive processesAR(p) which have been studied by many authors due to their applications in automatic control, identification and in modeling economic and financial time series (we refer the reader to Anderson (1971), Ahtola and Tiao (1987), Dickey and Fuller (1979), Chan and Wei (1988), Rao (1978) for details and futher references).
A commonly used estimate of parameter vector θ= (θ1, θ2)0 is the least squares estimate (LSE)
θ(n) = (θ1(n), θ2(n))0 =Mn−1
n
X
k=1
Xk−1xk, Mn=
n
X
k=1
Xk−1Xk0−1, (1.3) whereXk= (xk, xk−1)0; the prime denotes the transpose;Mn−1 denotes the inverse of matrix Mn if detMn>0 and Mn−1 = 0 otherwise.
It is well known that
√n(θ(n)−θ)=L⇒ N(0, F), asn→ ∞,
for allθ ∈Λ, where Λ is the stability region of process (1.1), that is, Λ ={θ= (θ1, θ2)0 :−1 +θ2 < θ1<1−θ2, |θ2|<1}, (1.4)
F = F(θ) is a positive definite matrix (see, e.g., Anderson (1971), Th.
5.5.7),=L⇒indicates convergence in law. Ifθbelongs to the boundary∂Λ of the stability region Λ, the limiting distribution of LSE is no longer normal.
Moreover, there is no one universal limiting distribution for allθ∈∂Λ and the corresponding set of limiting distributions numbers 6 different types depending on the values of roots z1 and z2 of the polynomial (1.2). Each limiting distribution of LSE on the boundary coincides with that of the ratio of certain Brownian functionals (we refer the reader to the paper of Chan and Wei (1988) for general results on the limiting distributions of the least squares estimates for unstable AR(p) processes and further details). For example, for conjugate complex roots z1=eiϕ, z2 =e−iϕ one has
n·(θ1(n)−2 cosϕ)=L⇒ (W12(1)−W22(1)) sinϕ+ (W12(1) +W22(1)−2) cosϕ R1
0[W12(s) +W22(s)]ds , n·(θ2(n) + 1)=L⇒(2−W12(1)−W22(1))/
Z 1 0
[W12(s) +W22(s)]ds , where (W1(t),0 ≤t ≤1) and (W2(t),0 ≤ t≤1) are independent standard Brownian motion processes; ifθ= (2,−1), then (see Theorem 3.1.2 ibid)
n2 0
0 n
(θ(n)−θ)=L⇒G−1ξ, ξ=
R1
0 Z(t)dW(t) R1
0 W(t)dW(t)
, where
G=
R1
0 W2(t)dt R1
0 W(t)Z(t)dt R1
0 W(t)Z(t)dt R1
0 Z2(t)dt
, Z(t) = Z t
0
W(s)ds .
It is well-known that a similar situation takes place in case ofAR(1) process
xn=θxn−1+εn, (1.5)
for which the limiting distributions of the least squares estimate are not normal at the end-points θ = ±1 of stability interval (-1,1) (see White (1958), Lai and Siegmund (1983)).
Lai and Siegmund (1983) for a first order non-explosive autoregressive process (1.5) proposed to use a sequential sampling scheme and proved that the sequential least squares estimate for θ with the stopping time based on the observed Fisher information is asymptotically normal uniformly in θ∈[−1,1] in contrast with the ordinary LSE.
In this paper we develop a sequential sampling scheme for estimating parameter vector θ = (θ1, θ2)0 in model (1.1). We will use the sequential least squares estimate defined by the formula
θ(τ(h)) =Mτ(h)−1
τ(h)
X
k=1
Xk−1xk, (1.6)
whereτ(h) is the stopping time for the thresholdh >0 : τ(h) = inf{n≥1 :
n
X
k=1
(x2k−1+x2k−2)≥hσ2}, inf{∅}= +∞. (1.7) This construction of sequential estimate is similar to that proposed in the paper of Lai and Siegmund forAR(1) which is defined as
θˆτ(h)=
τ(h)
X
k=1
x2k−1
−1 τ(h)
X
k=1
xk−1xk, (1.8)
τ(h) = inf{n≥1 :
n
X
k=1
x2k−1 ≥hσ2}. (1.9) It should be noted, however, that the first factor in (1.6) is a random matrix and not a random variable, as in (1.8), and this makes additional difficulties.
For AR(1) the stopping time (1.9) turns the denominator in (1.8) prac- tically into a constant hσ2 and this allows to use the central limit theorem for martingales. In the case of AR(2) the stopping time (1.7) enables one to control the inverse matrix Mτ(h)−1 in (1.6) only partially since it remains random. Nevertheless, we will see that such a change of time also enables one to improve the properties of the estimate (1.3).
In our paper (2006) we proved the following result.
Theorem 1.1. Let (εn)n≥1 in (1.1) be a sequence of i.i.d. random variables with Eεn= 0, 0< Eε2n=σ2 <∞. Then, for any compact set K⊂Λ1,
hlim→∞sup
θ∈K
sup
t∈R2|Pθ
Mτ(h)1/2(θ(τ(h))−θ)≤t
−Φ2(t/σ)|= 0, where Φ2(t) = Φ(t1)Φ(t2), Φ is the standard normal distribution function,
Λ1={θ= (θ1, θ2)0 : −1 +θ2< θ1 <1−θ2, −1≤θ2<1}, t= (t1, t2)0. This theorem implies, in particular, that estimate (1.6) is asymptotically normal not only inside the stability region (1.4) but also on the part of its boundary{θ = (θ1,−1)0 :−2< θ1<2}in contrast to the LSE (1.3).
The goal of this paper is to prove the asymptotic normality of the esti- mate (1.6),(1.7) in the whole region [Λ] including its boundary∂Λ.
Our main result (Theorem 3.1) claims that, as h→ ∞,
Mτ(h)1/2(θ(τ(h))−θ)=L⇒ N(0, σ2I), (1.10) for any θ= (θ1, θ2)0 inside the stability region Λ (1.4) and on its boundary
∂Λ, whereI is the identity matrix. Thus the sequential estimate (1.6), (1.7) has a unique normal asymptotic distribution in the closure [Λ] of the stability region (1.4). It will be observed that the normalizing factor Mτ(h)1/2 in the limit theorem (1.10) remains the same in the whole region [Λ] in contrast to the case of the LSE (1.3), which has seven different limiting distributions in [Λ] and in order to apply the limiting distributions one needs some knowledge about the location of unknown parameters (see Chan and Wei (1988)). The convergence of the sequential estimate (1.6), (1.7) to the normal distribution in (1.10) is not uniform inθforθ∈[Λ]. It can be explained by the fact that in the case, when the polynomial (1.2) has one root inside and the other on the unit circle, the rates of information provided by sample valuesxnabout the unknown parametersθ1 and θ2 may differ greatly.
Theorem 3.1 permits setting up tests of hypotheses aboutθand forming asymptotic confidence regions forθ on the basis of standard normal distri- bution. Moreover, the asymptotic normality holds in [Λ] for a broad class of the distributions of noises (εn).
The remainder of this paper is arranged as follows. Section 2 gives the asymptotic distribution of the stopping time (1.7) (Theorem 2.1) and some properties of the observed Fisher information matrix. In section 3 the asymptotic normality of sequential estimate (1.6) for unstableAR(2) model is established (Theorem 3.1). Section 4 proposes the sequential estimation scheme for the case of unknown variance σ2 in model (1.1). The appendix contains some technical results.
2 Properties of the stopping time τ ( h ) and the ob- served Fisher information matrix M
n.
In this section the attention is mainly focused on the case when the unknown parameterθ = (θ1, θ2)0 belongs to the boundary∂Λ of the stability region (1.4). The boundary∂Λ includes three sides:
Γ1 ={θ:−θ1+θ2= 1,−2< θ1<0},Γ2 ={θ:θ1+θ2 = 1,0< θ1 <2}, Γ3 ={θ :−2< θ1 <2, θ2 =−1} (2.1) and three apexes (0,1),(−2,−1),(2,−1). Denote
A=
θ1 θ2
1 0
, B=
1 0 0 0
,
W(n)(t) = 1 σ√
n
[nt]
X
i=0
εi, W1(n)(t) = 1 σ√
n
[nt]
X
i=0
(−1)iεi, 0≤t≤1, (2.2)
and introduce the following functionals J1(x;t) =
Z t
0
x2(s)ds, J2(x;t) = Z t
0
Z s 0
x(u)du 2
ds, (2.3) J3(x;y;t) =
Z t 0
(x2(s) +y2(s))ds, J4(x;t) = Z t
0
x(s)ds 2
.
Theorem 2.1. Let (εn)n≥1 in (1.1) be a sequence of i.i.d. random variables with Eεn = 0, Eε2n = σ2 and τ(h) be defined by (1.7). Denote by a and b real roots of the polynomial (1.2), −1≤a < b≤1. Then, for each θ∈Λ,
Pθ− lim
h→∞τ(h)/h= 1/trF, F −AF A0 =B. (2.4) Moreover, for eachθ ∈∂Λ, as h→ ∞,
τ(h) ψ(θ, h)
=L⇒
ν1(W1) = inf{t≥0 :J1(W1;t)≥1} ifθ∈Γ1, ν2(W) = inf{t≥0 :J1(W;t)≥1} ifθ ∈Γ2,
ν3(W, W1) = inf{t≥0 :J3(W;W1;t)≥1} ifθ ∈Γ3∪ {(0,1)}, ν4(W) = inf{t≥0 :J2(W;t)≥1} ifθ = (2,−1),
ν5(W1) = inf{t≥0 :J2(W1;t)≥1} ifθ= (−2,−1),
(2.5) where inf{∅}=∞, Λ is defined in (1.4),
ψ(θ, h) =
(1 +b)p
h/2 if θ∈Γ1, (1−a)p
h/2 ifθ ∈Γ2,
√2hsinϕ ifθ= (2 cosϕ,−1)0 ∈Γ3,
√2h if θ= (0,1),
(h/2)1/4 ifθ∈ {(−2,−1),(2,−1)},
(2.6)
W(t), W1(t) are independent standard Brownian motions.
Proof Assertion (2.4) easily follows from Lemma 3.12 in [6].
For θ ∈ ∂Λ we decompose the original process (1.1) into two processes (uk)k≥1 and (vk)k≥1 using the transformation
QXk= (uk, vk)0, (2.7)
where Q is a non-degenerate constant matrix of size 2×2 which will be chosen later depending on the values of θ. The limiting relation (2.5) for θ∈ ∪3i=1Γi has been proved in [7], Th 2.2. It remains to consider the apexes (2,−1),(−2,−1),(0,1).
Forθ = (2,−1), putting in (2.7)
Q=
1 0
1 −1
(2.8)
one obtains vk=
k
X
j=1
εj, uk=
k
X
j=1
(xj−xj−1) =
k
X
j=1
vj =
k
X
j=1 j
X
i=1
εi,
n
X
k=1
kXk−1k2 =
n
X
k=1
u2k−1+
n
X
k=1
u2k−2 = 2
n
X
k=1
u2k−1−u2n−1. (2.9) By the definition of τ(h) in (1.7), one gets
Pθ{τ(h)≤th1/4}=Pθ{
[th1/4]
X
k=1
kXk−1k2≥hσ2} (2.10)
=Pθ{ 2 hσ2
[th1/4]
X
k=1
u2k−1− 1
hσ2u2[th1/4]−1 ≥1}.
Further we show (by the argument similar to that in the proof of Lemma 2.3 in the Appendix) that the sum
Sn(t) = 1 n4σ2
[nt]
X
k=1
u2k−1
satisfies the relation
Sn(t) =J2(W(n);t) +g(n)(t), whereg(n)(t) is a random process such that, for anyδ >0,
nlim→∞Pθ(|g(n)(t)|> δ) = 0.
Now we check that
nlim→∞u2n/n4 = 0 Pθ −a.s.. (2.11) By the Cauchy-Schwarz-Bunyakovskii inequality and the law of iterated log- arithm we have
u2n/n4≤n−3
n
X
k=1
k
X
j=1
εi
2
,
n
X
k=1
1 k3
k
X
j=1
εi
2
<∞ Pθ−a.s.. These inequalities, in virtue of the Kronecker Lemma, imply (2.11).
From here and (2.10), (2.11), we obtain
Pθ(τ(h)/ψ(θ, h)≤t) =Pθ(νθ(n)≤t) +βθ(h), where
νθ(n) = inf{t≥0 :J2(W(n);t)≥1}, lim
h→∞βθ(h) = 0,
W(n)(t) is given in (2.2). This, by the functional Donsker theorem (see Billingsley (1968)), leads to (2.5) for θ= (2,−1).
The case of the apexes (0,1),(−2,−1) can be considered similarly with the use of Theorem 5.14 given in the Appendix. This completes the proof of Theorem 2.1.
Now we will establish some properties of the observed Fisher information matrixMn. Introduce the following subsets of the closed region [Λ] :
Λd = [Λ]\
2
[
i=1
Bi, Λd = Λd,1+ Λd,2, (2.12) where
Λd,1= Λd∩Vd, Λd,2 = Λd\Λd,1; Vd =
θ :−2 + d
√2 ≤θ1≤0, −θ21 4 +d2
8 < θ2≤1 +θ1
∪
θ: 0≤θ1≤2− d
√2, −θ12 4 +d2
8 ≤θ2 ≤1−θ1
;
Bi are open balls of radiusd >0 centered at the apexes (−2,−1),(2,−1).
In view of Theorem 1.1, it suffices to study the properties of Mn only for the parametric subset Λd,1 and the apexes (−2,−1),(2,−1). In the case of Λd,1, one can use the transformation (2.7) with
Q=
1 −b 1 −a
, (2.13)
where−1≤a < b≤1. Substituting (2.7) and (2.13) inMn (1.3) yields Mn=Q−1Sn(Q0)−1 =Q−1R−n1JnR−n1(Q0)−1, (2.14) where
Sn=
(u, u)n (u, v)n (u, v)n (v, v)n
, Rn=
(u, u)−n1/2 0 0 (v, v)−n1/2
,
Jn=RnSnRn=
1 ξn ξn 1
, (2.15)
ξn= (u, u)−n1/2(v, v)−n1/2(u, v)n, (u, v)n=
n
X
k=1
uk−1vk−1. (2.16) Proposition 2.2. Under conditions of Theorem 2.1, for any d >0, δ >0,
hlim→∞ sup
θ∈Λd,1
Pθ kJτ(h)−T(θ1, θ2)k> δ
= 0, (2.17)
where
T(θ1, θ2) =
1 r(a, b) r(a, b) 1
, r(a, b) =
√1−a2√ 1−b2
1−ab . (2.18) The proof of Proposition 2.2 is given in the Appendix.
Further we consider the asymptotic behaviour of the matrix Jn in the extreme cases when the process xk is ”most” unstable, that is, θ coincides with one of the apexes (−2,−1),(2,−1) of the parametric region [Λ].
Forθ = (2,−1) we take the matrix Q from (2.8). This yields uk =
k
X
j=0 j
X
i=0
εi, vk=
k
X
j=0
εj, k≥1, u0=v0=ε0 = 0. (2.19) Forθ= (−2,−1) we take
Q=
1 0 1 1
.
This implies
uk= (−1)k
k
X
j=1 j
X
i=1
(−1)iεi, vk=
k
X
j=1
(−1)jεj.
Lemma 2.3. Let ξn be given by (2.16) and θ∈ {(−2,−1),(2,−1)}. Then
ξn=L⇒
ϕ(W) if θ= (2,−1), ϕ(W1) if θ= (−2,−1),
as n→ ∞, (2.20)
where
ϕ(W) = 2−1J2−1/2(W; 1)J1−1/2(W; 1)J4(W; 1). (2.21) The proof of Lemma 2.3 is given in the Appendix.
3 Asymptotic normality.
It is known that the sequential least squares estimate (1.6),(1.7) is asymp- totically normal just like the ordinary LSE for any value ofθin the stability region Λ. Moreover, according to Theorem 1.1, this convergence of sequen- tial LSE to normal law is uniform in θ belonging to any compact set in Λ supplemented with the part of its boundary corresponding to complex roots of the polynomial (1.2). In this section, we will show that in contrast with the ordinary LSE (c.f. Chen and Wei (1988)), the sequential LSE is asymptotically normal also on the boundary∂Λ of the stability region Λ.
Theorem 3.1. Suppose that in AR(2) model (1.1), (εn)n≥1 is a sequence of i.i.d. random variables, Eεn = 0 and 0 < Eε2n = σ2 < ∞. Define τ(h), θ(τ(h))and Mτ(h) as in (1.6),(1.7) and (1.3). Then for any θ∈[Λ]
hlim→∞sup
t∈R2
Pθ
Mτ(h)1/2(θ(τ(h))−θ)≤t
−Φ2(t/σ)
= 0, (3.1) where Φ2(t) = Φ(t1)Φ(t2), t= (t1, t2)0, Φ is the standard normal distribu- tion function; [Λ] is the closure of the stability region (1.4).
Proof of Theorem 3.1 In view of Theorem 1.1, we have to show (3.1) forθ ∈ Γ1∪Γ2∪ {(0,1),(−2,−1),(2,−1)}. First we note that ifθ ∈ Γ1∪ Γ2∪ {(0,1)}, the minimal and the maximal rootsaand bof the polynomial (1.2) satisfy the inequalities −1 ≤ a < b ≤ 1. Therefore one can use the transformation (2.7),(2.13) to decompose the original process AR(2) (1.1) into two processes (uk) and (vk) which obey the equations
uk=auk−1+εk, vk=bvk−1+εk, u0=v0= 0. (3.2) Since the matrix Q in (2.13) is non-degenerate, one can represent the ob- served Fisher information matrixMn in the form (2.14) to obtain
Mn1/2 =Q−1R−n1Jn1/2. (3.3) Substituting this matrix in the standardized deviation of the sequential es- timate (1.6), one gets
Mτ(h)1/2(θ(τ(h))−θ) =Mτ(h)−1/2
τ(h)
X
k=1
Xk−1εk
=Jτ(h)−1/2Rτ(h)
τ(h)
X
k=1
QXk−1εk =Jτ(h)−1/2Zτ(h), where
Zn=
(u, u)−n1/2Pn
k=1uk−1εk (v, v)−n1/2Pn
k=1vk−1εk
. (3.4)
Further we note that Proposition 2.2 implies that, for anyδ >0,
hlim→∞ sup
θ∈Γ1∪Γ2∪{(0,1)}
Pθ
kJτ(h)−1/2−Ik> δ
= 0. (3.5)
Therefore in order to prove (3.1) for θ ∈ Γ1 ∪Γ2 ∪ {(0,1)} it suffices to establish the following result.
Proposition 3.2. Let θ∈Γ1∪Γ2∪{(0,1)}. Then, for each constant vector λ= (λ1, λ2)0 ∈R2 with kλk= 1, the random variable
Yh =λ0Zτ(h)/σ (3.6)
is asymptotically normal with mean 0and unit variance, as h→ ∞, that is,
hlim→∞sup
t∈R|Pθ(Yh ≤t)−Φ(t)|= 0.
The main difficulty in the analysis of Yh is that the stopping time (1.7) enables one to control the sums (u, u)τ(h), (v, v)τ(h) in the denominators of (3.6) only partially because one of them or both are random variables even in the asymptotics as h→ ∞.
The proof of Proposition 3.2 is given in the Appendix. The key idea of the proof is to replaceYhby a more tractable random variable ˜Yhequivalent to Yh in distribution by making use of the Skorohod coupling theorem and then apply the Central Limit Theorem for martingales. The appendix con- tains also the proof of Theorem 3.1 for the case of θ ∈ {(−2,−1),(2,−1)}. This case is considered separately because the matrixJn in (3.3) converges, according to Lemma 2.3, only in distribution.
4 Asymptotic normaliy in the case of unknown variance.
In this section, we extend the sequential estimation scheme to model (1.1) with unknown variance. It is shown that the sequential least squares es-
timate modified to embrace this case remains asymptotically normal uni- formly inθ for any compact set in the region Λ1 = Λ∪Γ3 (Th. 4.1) and it is asymptotically normal in the closure of the stability region [Λ] (Th. 4.2).
Suppose that the variance σ2 in (1.1) is unknown. A commonly used estimate for σ2 in autoregression processes on the basis of observations (x1, . . . , xn) is defined as
ˆ
σn2 =n−1
n
X
k=1
(xk−θ0(n)Xk−1)2, (4.1) whereθ(n) is the least squares estimate ofθ defined in (1.3). Now we must modify the stopping time (1.7). At first sight, to this end one should replace σ2 in (1.7) by ˆσn2. However, we will use a different modification similar to that proposed by Lai and Siegmund for AR(1) model, which turns out to be more convenient in the theoretic studies. Define the sequential estimate as
θ(ˆτ(h)) =Mτ(h)ˆ−1
ˆ τ(h)
X
k=1
Xk−1xk, (4.2)
ˆ
τ(h) = inf{n≥3 :
n
X
k=1
(x2k−1+x2k−2)≥hs2n}, (4.3) wheres2n= ˆσn2∨δn, δn is a sequence of positive numbers withδn→0.
The main results of this section are stated in the following theorems.
Theorem 4.1. Let(εn)n≥1in (1.1) be a sequence of i.i.d. random variables, Eεn= 0, 0< Eε2n=σ2 <∞. Then, for any compact set K ⊂Λ1,
hlim→∞sup
θ∈K
sup
t∈R2|Pθ
Mτ(h)ˆ1/2(θ(ˆτ(h))−θ)/ˆσˆτ(h)≤t
−Φ2(t)|= 0, (4.4) where Φ2(t) = Φ(t1)Φ(t2), Φ is the standard normal distribution function,
Λ1={θ= (θ1, θ2)0: −1 +θ2< θ1 <1−θ2, −1≤θ2<1}, t= (t1, t2)0.
Theorem 4.2. Let(εn)n≥1in (1.1) be a sequence of i.i.d. random variables, Eεn= 0, 0< Eε2n=σ2 <∞. Then, for any θ∈[Λ],
hlim→∞sup
t∈R2|Pθ
Mτˆ1/2(h)(θ(ˆτ(h))−θ)/ˆστ(h)ˆ ≤t
−Φ2(t)|= 0.
The proofs of Theorems 4.1- 4.2 proceed along the lines of those of The- orems 1.1 and 3.1 though they become more laborious because one needs to control the additional terms appearing as a result of the unknown variance.
We will give only the proof of Theorems 4.1.
Proof of Theorems 4.1. Substituting (1.1) in (4.2) yields
Mτ(h)ˆ1/2(θ(ˆτ(h))−θ)/ˆστ(h)ˆ =Mτ(h)ˆ−1/2
ˆ τ(h)
X
k=1
Xk−1εk/ˆστ(h)ˆ
=
Mˆτ(h)ˆστ(h)2ˆ /(σ4h/2)−1/2 ˆτ(h)
X
k=1
Xk−1εk/(σ2p
h/2). (4.5) Further we need the following results.
Lemma 4.3. Let Mn,τˆ(h) be given by (1.3), (4.3). Then, for any compact setK ⊂Λ1 andδ >0,
hlim→∞sup
θ∈K
Pθ
kMτ(h)ˆ σˆ2ˆτ(h)/(σ4h/2)−L(θ1, θ2)k> δ
= 0, (4.6) where
L(θ1, θ2) =
1 θ1/(1−θ2) θ1/(1−θ2) 1
.
Lemma 4.4. Under the assumptions of Theorem 1.1, for any compact set K⊂Λ1 and for each constant vector λ= (λ1, λ2)0 with kλk= 1,
hlim→∞sup
θ∈K
sup
t∈R|Pθ(Yh≤t)−Φ(t)|= 0, where
Yh=λ0L−1/2(θ1, θ2)
ˆ τ(h)
X
k=1
Xk−1εk/(σ2p h/2).
The proofs of these Lemmas are given below in this section.
Now we rewrite (4.5) as Mτ(h)ˆ1/2(θ(ˆτ(h))−θ)/ˆστ(h)ˆ =
Mτˆ(h)σˆ2τ(h)ˆ /(σ4h/2)−1/2
L1/2(θ1, θ2) (4.7)
×L−1/2(θ1, θ2)
ˆ τ(h)
X
k=1
Xk−1εk/(σ2p h/2). According to Lemma 4.3 we have for eachδ >0
hlim→∞sup
θ∈K
Pθ
k
Mτ(h)ˆ σˆτ(h)2ˆ /(σ4h/2)−1/2
L1/2(θ1, θ2)−Ik> δ
= 0. From here and (4.7) by applying Lemma 4.4, we come to (4.4). This com- pletes the proof of Theorem 4.1.
In order to prove Lemmas 4.3, 4.4, we need the following result.
Proposition 4.5. Let θ(n) and σˆn2 be given by (1.3) and (4.1). Then, for any compact set K⊂Λ1 and δ >0,
mlim→∞sup
θ∈K
Pθ(kθ(n)−θk> δ for some n≥m) = 0, (4.8)
mlim→∞sup
θ∈K
Pθ |σˆ2n−σ2|> δ for some n≥m
= 0. (4.9) Proof. We have
θ(n)−θ= (Mn/(x, x)n)−1(x, x)−n1
n
X
k=1
Xk−1εk.
By Lemma 3.3 in [7], for any δ > 0 and any compact K ⊂Λ= [Λ]◦ \ {(0,1),(−2,−1),(2,−1)},
mlim→∞sup
θ∈K
Pθ(kMn/(x, x)n−L(θ1, θ2)k> δ for some n≥m) = 0. (4.10) Further it will be observed that, for any 0 < C < ∞ and compact set K, there exists a positive number ∆ that, for all matrices L(θ1, θ2) with θ= (θ1, θ2)0 ∈KandB such thatkB−L(θ1, θ2)k<∆, one haskB−1k ≤C.
LetC, B be such a pair. Then, for each θ∈K, we have the inclusions (kθ(n)−θk> δ for some n≥m)
⊆ k(Mn/(x, x)n)−1k(x, x)−n1k
n
X
k=1
Xk−1εkk> δ for some n≥m
!
= (•)⊆(•,kMn/(x, x)n−L(θ1, θ2)k ≤∆ for all n≥m)
∪(•,kMn/(x, x)n−L(θ1, θ2)k>∆ for somen≥m)
⊂ C(x, x)−n1k
n
X
k=1
Xk−1εkk> δ for some n≥m
!
∪(kMn/(x, x)n−L(θ1, θ2)k>∆ for some n≥m) . This yields
Pθ(kθ(n)−θk> δ for some n≥m)
≤Pθ (x, x)−n1k
n
X
k=1
Xk−1εkk> δ0 for somen≥m
!
+Pθ(kMn/(x, x)n−L(θ1, θ2)k>∆ for some n≥m), δ0 =δ/C . By Lemmas 3.2,3.3 from [7], limiting m→ ∞, we come to (4.8).
Consider (4.9). Rewrite ˆσn2 in (4.1) as ˆ
σn2 =n−1
n
X
k=1
(εk+ (θ−θ(n))0Xk−1)2=n−1
n
X
k=1
ε2k
+2n−1(θ−θ(n))0
n
X
k=1
Xk−1εk+n−1(θ−θ(n))0
n
X
k=1
Xk−1Xk0−1(θ−θ(n)). Substituting hereθ(n) from (1.3) yields
ˆ
σ2n−σ2 = n−1
n
X
k=1
ε2k−σ2
!
−2n−1
n
X
k=1
Xk0−1εk
! Mn−1
n
X
k=1
Xk−1εk
+n−1
n
X
k=1
Xk0−1εk
!
Mn−1MnMn−1
n
X
k=1
Xk−1εk
= n−1
n
X
k=1
ε2k−σ2
!
−n−1
n
X
k=1
Xk0−1εk
! Mn−1
n
X
k=1
Xk−1εk
= n−1
n
X
k=1
ε2k−σ2
!
− 1 n(x, x)n
n
X
k=1
Xk0−1εk
!
(Mn/(x, x)n)−1
n
X
k=1
Xk−1εk. The first term in the right-hand side of this equality converges to zero in virtue of the strong law of large numbers. Therefore, in order to prove (4.9), we have to verify that, for eachK ⊂Λ1 and δ >0,
mlim→∞sup
θ∈K
Pθ
1 n
n
X
k=1
Xk0−1εk
! Mn−1
n
X
k=1
Xk−1εk > δ for some n≥m
!
= 0 In view of Lemma 3.3 in [7], it is equivalent to the following limiting relations
mlim→∞sup
θ∈K
Pθ
n
X
k=1
xk−1εk
2
> δn(x, x)n for some n≥m
= 0, (4.11)
mlim→∞sup
θ∈K
Pθ
n
X
k=1
xk−2εk
2
> δn(x, x)n for some n≥m
= 0. (4.12) To prove these relations we will make use of Lemma 2.2 from [10]. First we note that the matrix Adefined in (2.2) possesses the property (see, [7]):
sup
θ∈KkAnk ≤κ, n= 1,2, . . . , (4.13) whereκ is some positive number. This implies the following inequality
(x, x)n≤κ2
n
X
k=1
(
k
X
j=1
|εj|)2 =:Un. (4.14) Indeed, writing down (1.1) in the vector form
Xk=AXk−1+ξk, ξk= (εk,0)0, and using the formulaXk=Pk
j=1 Ak−jξj, lead to the estimate
|xk| ≤ kXkk ≤κ
k
X
j=1
|εj|
and, hence, to (4.14). By making use of the law of iterated logarithm and the Kronecker Lemma, one can show thatUnin (4.14) satisfies the following relation
Un=o(n4) a.s. (4.15)
Now let us prove, for example, (4.11). From the inequality under the sign of probability in (4.11), it follows that
n
X
k=1
xk−1εk
> δ1/2(x, x)5/8n n4/(x, x)n1/8
≥δ1/2(x, x)5/8n n4/Un1/8
. (4.16)
This enables us to obtain the following inclusions for ∆< σ2:
|
n
X
k=1
xk−1εk|> δ1/2n1/2(x, x)1/2n for some n≥m
!
⊆ •, |n−1
n
X
k=1
ε2k−σ2| ≤∆ for all n≥m
!
∪ |n−1
n
X
k=1
ε2k−σ2|>∆ for some n≥m
!
⊆ •, n−1
n
X
k=1
ε2k> σ2−∆ for all n≥m
!
∪ |n−1
n
X
k=1
ε2k−σ2|>∆ for some n≥m
!
⊆ •, n−1
n
X
k=1
ε2k > σ2−∆ all n≥m, Un/n4≤1 alln≥m
!
∪ |n−1
n
X
k=1
ε2k−σ2|>∆ for some n≥m
!
∪(Un
n4 >1 for some n≥m). From here one gets
Pθ
n
X
k=1
xk−1εk
> δ1/2n1/2(x, x)1/2n for somen≥m
!
≤Pθ
n
X
k=1
xk−1εk
> δ1/2
4−1n3/2(σ2−∆)∨(x, x)5/8n
for some n≥m
!
+Pθ |1 n
n
X
k=1
ε2k−σ2|>∆ for some n≥m
! +Pθ
Un
n4 >1 for somen≥m
.
In order to come to (4.11), it remains to use Lemma 2.2 from [10], the strong law of large numbers and putm→ ∞ . This completes the proof of Proposition 4.5.
Proof of Lemma 4.3. We start with the representation Mτ(h)ˆ σˆ2τ(h)ˆ
σ4h/2 = Mτ(h)ˆ
(x, x)ˆτ(h) · (x, x)τˆ(h) 2−1Pτ(h)ˆ
k=1kXk−1k2 · Pτ(h)ˆ
k=1kXk−1k2 hs2ˆτ(h) · s2τ(h)ˆ
σ2 ·σˆ2ˆτ(h) σ2 . It suffices to show, for any δ >0, the limiting relations
hlim→∞sup
θ∈K
Pθ
kMτ(h)ˆ /(x, x)ˆτ(h)−L(θ1, θ2)k> δ
= 0, (4.17)
hlim→∞sup
θ∈K
Pθ
(x, x)ˆτ(h)
2−1
ˆ τ(h)
X
k=1
kXk−1k2
−1
−1
> δ
= 0, (4.18)
hlim→∞sup
θ∈K
Pθ
ˆ τ(h)
X
k=1
kXk−1k2/(hs2τ(h)ˆ )−1
> δ
= 0, (4.19)
hlim→∞sup
θ∈K
Pθ
|s2ˆτ(h)/σ2−1|> δ
= 0, (4.20)
hlim→∞sup
θ∈K
Pθ
|σˆτ(h)2ˆ /σ2−1|> δ
= 0. (4.21)
Consider (4.17). We have Pθ
kMˆτ(h)/(x, x)τ(h)ˆ −L(θ1, θ2)k> δ
≤Pθ(ˆτ(h)≤m) (4.22) +Pθ(kMn/(x, x)n−L(θ1, θ2)k> δ for somen≥m) .
In view of (4.10), we need to check only that, for each sufficiently largem,
hlim→∞sup
θ∈K
Pθ(ˆτ(h)≤m) = 0. (4.23) Letm0 be a number such that, for all m≥m0, the sequence (δm) satisfies the inequality δm ≤ σ2/2. By the definition of the stopping time ˆτ(h) in (4.3), it follows that
Pθ(ˆτ(h)≤m) =Pθ(
m
X
k=1
kXk−1k2 ≥hs2m)
=Pθ(
m
X
k=1
kXk−1k2 ≥hδm, δm≥σˆm2) +Pθ(
m
X
k=1
kXk−1k2 ≥hˆσ2m, δm<ˆσ2m)
≤Pθ(ˆσm2 ≤δm) +Pθ m
X
k=1
kXk−1k2≥hˆσm2
!
≤Pθ(|σˆ2m−σ2| ≥σ2/2) +Pθ m
X
k=1
kXk−1k2 ≥hˆσm2
!
. (4.24)
Further we have Pθ
m
X
k=1
kXk−1k2≥hˆσm2
!
=Pθ(•, |σˆ2m−σ2| ≤∆) +Pθ(•, |σˆ2m−σ2| >∆)
≤Pθ m
X
k=1
kXk−1k2 ≥h(σ2−∆)
!
+Pθ(|σˆm2 −σ2|>∆). (4.25) The inequalities (4.24),(4.25), in view of Proposition 4.5, imply (4.23). This leads to (4.17). To show (4.18) we use the identity
(x, x)n= 2−1
n
X
k=1
kXk−1k2+x2n−1/2,
(4.23) and apply Lemma 3.1 from [7]. The relations (4.19)-(4.21) can be checked in a similar way. This completes the proof of Lemma 4.3.
Proof of Lemma 4.4. We will use the argument similar to that in the proof of Proposition 2.1 in [10]. First we introduce a sequence (ˆxn) of truncated observations (xn) defined as
ˆ xn=
xnif x2n≤δ2h , δ√
h ifx2n> δ2h, 0< δ <1, and the set
Ωˆh = (xn = ˆxn for all n <τˆh).
Along the lines of the proof of Proposition 2.1, one can verify that
hlim→∞sup
θ∈K
Pθ( ˆΩch) = 0. (4.26)