Adaptive and non-adaptive estimation for degenerate diffusion processes

(1)

HAL Id: hal-02488402

https://hal.archives-ouvertes.fr/hal-02488402

Preprint submitted on 22 Feb 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are pub- lished or not. The documents may come from

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de

Adaptive and non-adaptive estimation for degenerate diffusion processes

Arnaud Gloter, Nakahiro Yoshida

To cite this version:

Arnaud Gloter, Nakahiro Yoshida. Adaptive and non-adaptive estimation for degenerate diffusion processes. 2020. �hal-02488402�

(2)

Adaptive and non-adaptive estimation for degenerate diffusion processes ^∗

Arnaud Gloter

¹

and Nakahiro Yoshida

^2,3

1

Laboratoire de Mathématiques et Modélisation d’Evry, Université d’Evry

^†

2

Graduate School of Mathematical Sciences, University of Tokyo

^‡

3

CREST, Japan Science and Technology Agency February 22, 2020

Summary We discuss parametric estimation of a degenerate diffusion system from time-discrete observations. The first component of the degenerate diffusion system has a parameter θ

1

in a non-degenerate diffusion coefficient and a parameter θ

2

in the drift term. The second component has a drift term parameterized by θ

3

and no diffusion term. Asymptotic normality is proved in three different situations for an adaptive estimator for θ

3

with some initial estimators for (θ

1

, θ

2

), an adaptive one-step estimator for (θ

1

, θ

2

, θ

3

) with some initial estimators for them, and a joint quasi-maximum likelihood estimator for (θ

1

, θ

2

, θ

3

) without any initial estimator.

Our estimators incorporate information of the increments of both components. Thanks to this construction, the asymptotic variance of the estimators for θ

1

is smaller than the standard one based only on the first component. The convergence of the estimators for θ

3

is much faster than the other parameters. The resulting asymptotic variance is smaller than that of an estimator only using the increments of the second component.

Keywords and phrases Degenerate diffusion, one-step estimator, quasi-maximum likelihood estimator.

∗This work was in part supported by Japan Science and Technology Agency CREST JPMJCR14D7; Japan Society for the Promotion of Science Grants-in-Aid for Scientific Research No. 17H01702 (Scientific Research);

and by a Cooperative Research Program of the Institute of Statistical Mathematics.

†Laboratoire de Mathématiques et Modélisation d’Evry, CNRS, Univ Evry, Université Paris-Saclay, 91037, Evry, France. e-mail: arnaud.gloter@univ-evry.fr

‡Graduate School of Mathematical Sciences, University of Tokyo: 3-8-1 Komaba, Meguro-ku, Tokyo 153- 8914, Japan. e-mail: nakahiro@ms.u-tokyo.ac.jp

(3)

1 Introduction

In this article, we will discuss parametric estimation for a hypo-elliptic diffusion process. More precisely, given a stochastic basis (Ω, F , F, P ) with a right-continuous filtration F = ( F

^t

)

t∈R+

, R

+

= [0, ∞ ), suppose that an F-adapted process Z

t

= (X

t

, Y

t

) satisfies the stochastic differential equation

( dX

t

= A(Z

t

, θ

2

)dt + B (Z

t

, θ

1

)dw

t

dY

t

= H(Z

t

, θ

3

)dt (1.1)

Here A : R

^d^Z

× Θ

2

→ R

^d^X

, B : R

^d^Z

× Θ

1

→ R

^d^X

⊗ R

^r

, H : R

^d^Z

× Θ

3

→ R

^d^Y

, and w = (w

t

)

t∈R+

is an r-dimensional F-Wiener process. The spaces Θ

i

(i = 1, 2, 3) are the unknown parameter spaces of the components of θ = (θ

₁

, θ

₂

, θ

₃

) to be estimated from the data (Z

_t_j

)

j=0,1,...,n

, where t

j

= jh, h = h

n

satisfying h → 0, nh → ∞ and nh

²

→ 0 as n → ∞ .

Estimation theory has been well developed for diffusion processes. Even focusing on parametric estimation for ergodic diffusions, there is huge amount of studies: Kutoyants [22, 24, 23], Prakasa Rao [29, 30], Yoshida [39, 40], Bibby and Sørensen [1], Kessler [20], K¨ uchler and Sørensen [21], Genon–Catalot et al. [11], Gloter [13, 14, 16], Sakamoto and Yoshida [31], Uchida [35], Uchida and Yoshida [36, 37, 38], Kamatani and Uchida [19], De Gregorio and Ia- cus [9], Genon–Catalot and Lar´edo [12], Suzuki and Yoshida [34] among many others. Nakakita and Uchida [28] and Nakakita et al. [27] studied estimation under measurement error; related are Gloter and Jacod [17, 18]. Non parametric estimation for the coefficients of an ergodic diffusion has also been widely studied : Dalayan and Kutoyants [8], Kutoyants [24], Dalalyan [5], Dalalyan and Reiss [6, 7], Comte and Genon–Catalot [2], Comte et al. [3], Schmisser [33], to name a few. Historically attentions were paid to inference for non-degenerate cases.

Recently there is a growing interest in hypo-elliptic diffusions, that appear in various applied fields. Examples of the hypo-elliptic diffusion include the harmonic oscillator, the Van der Pol oscillator and the FitzHugh-Nagumo neuronal model; see e.g. Le´on and Samson [25]. For parametric estimation of hypo-elliptic diffusions, we refer the reader to Gloter [15] for a discretely observed integrated diffusion process, and Samson and Thieullen [32] for a contrast estimator.

Comte et al. [4] gave adaptive estimation under partial observation. Recently, Ditlevsen and Samson [10] studied filtering and inference for hypo-elliptic diffusions from complete and partial observations. When the observations are discrete and complete, they showed asymptotic normality of their estimators under the assumption that the true value of some of parameters are known. Melnykova [26] studied the estimation problem for the model (1.1), comparing contrast functions and least square estimates. The contrast functions we propose in this paper are different from the one in [26].

In this paper, we will present several estimation schemes. Since we assume discrete-time

observations of Z= (Z

t

)

t∈R+

, quasi-likelihood estimation for θ

1

and θ

2

is known; only difference

from the standard diffusion case is the existence of the covariate Y = (Y

_t

)

_t∈R+

in the equation of

X= (X

t

)

t∈R+

but it causes no theoretical difficulty. We will give an exposition for construction

of those standard estimators in Sections 7 and 8 for selfcontainedness. Thus, our first approach

in Section 4 is toward estimation of θ

₃

with initial estimators for θ

₁

and θ

₂

. The idea for

construction of the quasi-likelihood function in the elliptic case was based on the local Gaussian

approximation of the transition density. Then it is natural to approximate the distribution of

(4)

the increments of Y by that of the principal Gaussian variable in the expansion of the increment.

However, this method causes deficiency, as we will observe there; see Remark 4.6 on p.18. We present a more efficient method by incorporating an additional Gaussian part from X. The rate of convergence attained by the estimator for θ

3

is n

^−1/2

h

^1/2

and it is much faster than the rate (nh)

^−1/2

for θ

₂

and n

^−1/2

for θ

₁

. Section 5 treats some adaptive estimators using suitable initial estimators for (θ

1

, θ

2

, θ

3

), and shows joint asymptotic normality. Then it should be remarked that the asymptotic variance of our estimator ˆ θ

1

for θ

1

has improved that of the ordinary volatility parameter estimator, e.g. ˆ θ

₁⁰

recalled in Section 7, that would be asymptotically optimal if the system consisted only of X. In Section 6, we consider a non-adaptive joint quasi-maximum likelihood estimator. This method does not require initial estimators. From computational point of view, adaptive methods often have merits by reducing dimension of parameters, but the non-adaptive method is still theoretically interesting. Section 2 collects the assumptions under which we will work. Section 3 offers several basic estimates to the increments of Z. To investigate efficiency of the presented estimators, we need the LAN property of the exact likelihood function of the hypo-elliptic diffusion. We will discuss this problem elsewhere.

2 Assumptions

We assume that Θ

i

(i = 1, 2, 3) are bounded open domain in R

^pⁱ

, respectively, and Θ = Q

3 i=1

Θ

i

has a good boundary so that Sobolev’s embedding inequality holds, that is, there exists a positive constant C

Θ

such that

sup

θ∈Θ

| f

i

(θ

i

) | ≤ C

Θ

X

1

k=0

k ∂

_θ^k

f k

^L^p^(Θ)

(2.1)

for all f ∈ C

¹

(Θ) and p > P

3

i=1

p

_i

. If Θ has a Lipschitz boundary, then this condition is satisfied. Obviously, the embedding inequality (2.1) is valid for functions depending only on a part of components of θ.

In this paper, we will propose an estimator for θ and show its consistency and asymptotic normality.

Given a finite-dimensional real vector space E, denote by C

_p^a,b

( R

^d^Z

× Θ

i

; E) the set of functions f : R

^d^Z

× Θ

i

→ E such that f is continuously differentiable a times in z ∈ R

^d^Z

and b times in θ

i

∈ Θ in any order and f and all such derivatives are continuously extended to R

^d^Z

× Θ

i

, moreover, they are of at most polynomial growth in z ∈ R

^d^Z

uniformly in θ ∈ Θ. Let C = BB

^⋆

,

⋆ denoting the matrix transpose. We suppose that the process (Z

t

)

t∈R+

that generates the data satisfies the stochastic differential equation (1.1) for a true value θ

^∗

= (θ

₁^∗

, θ

₂^∗

, θ

^∗₃

) ∈ Θ

1

× Θ

2

× Θ

3

. [A1 ] (i) A ∈ C

_pⁱ^A^,j^A

( R

^d^Z

× Θ

2

; R

^d^X

) and B ∈ C

_pⁱ^B^,j^B

( R

^d^Z

× Θ

1

; R

^d^X

⊗ R

^r

).

(ii) H ∈ C

_pⁱ^H^,j^H

( R

^d^Z

× Θ

3

; R

^d^Y

).

We will denote F

_x

for ∂

_x

F , F

_y

for ∂

_y

F , and F

_i

for ∂

_θ_i

F .

[A2 ] (i) sup

_t∈R₊

k Z

t

k

^p

< ∞ for every p > 1.

(5)

(ii) There exists a probability measure ν on R

^d^Z

such that 1

T Z

T

0

f (Z

t

) dt →

^p

Z

f (z)ν(dz) (T → ∞ ) for any bounded continuous function f : R

^d^Z

→ R .

(iii) The function θ

1

7→ C(Z

t

, θ

1

)

⁻¹

is continuous on Θ

1

a.s., and sup

θ1∈Θ1

sup

t∈R+

k det C(Z

t

, θ

1

)

⁻¹

k

^p

< ∞ for every p > 1.

(iv) For the R

^d^Y

⊗ R

^d^Y

valued function V (z, θ

1

, θ

3

) = H

x

(z, θ

3

)C(z, θ

1

)H

x

(z, θ

3

)

^⋆

, the function (θ

1

, θ

3

) 7→ V (Z

t

, θ

1

, θ

3

)

⁻¹

is continuous on Θ

1

× Θ

3

a.s., and

sup

(θ1,θ3)∈Θ1×Θ3

sup

t∈R+

k det V (Z

t

, θ

1

, θ

3

)

⁻¹

k

^p

< ∞ for every p > 1.

Remark 2.1. (a) It follows from [A2] that the convergence in [A2] (ii) holds for any continuous function f of at most polynomial growth.

(b) We implicitly assume the existence of C(Z

T

, θ

1

)

⁻¹

and V (Z

t

, θ

1

, θ

3

)

⁻¹

in (iii) and (iv) of [A2].

(c) Fatou’s lemma implies Z

| z |

^p

ν(dz) + sup

θ1∈Θ1

Z

det C(z, θ

1

)

−p

ν(dz) + sup

(θ1,θ3)∈Θ1×Θ3

Z

det V (z, θ

1

, θ

3

)

−p

ν(dz) < ∞

for any p > 0.

Let

Y

⁽¹⁾

(θ

1

) = − 1 2

Z

Tr C(z, θ

1

)

⁻¹

C(z, θ

₁^∗

)

− d

_X

+ log det C(z, θ

1

) det C(z, θ

₁^∗

)

ν(dz).

Since | log x | ≤ x + x

⁻¹

for x > 0, Y

⁽¹⁾

(θ

1

) is a continuous function on Θ

1

well defined under [A1] and [A2]. Let

Y

^(J,1)

(θ

1

) = − 1 2

Z

Tr C(z, θ

1

)

⁻¹

C(z, θ

₁^∗

)

+ Tr V (z, θ

1

, θ

^∗₃

)

⁻¹

V (z, θ

₁^∗

, θ

^∗₃

)

− d

_Z

+ log det C(z, θ

1

) det V (θ

1

, θ

₃^∗

)

det C(z, θ

₁^∗

) det V (θ

₁^∗

, θ

^∗₃

)

ν(dz)

Let

Y

⁽²⁾

(θ

₂

) = − 1 2

Z

C(z, θ

₁^∗

)

⁻¹

A(z, θ

₂

) − A(z, θ

₂^∗

)

⊗2

ν(dz) (2.2)

(6)

Let

Y

⁽³⁾

(θ

3

) = − Z

6V (z, θ

^∗₁

, θ

3

)

⁻¹

H(z, θ

3

) − H(z, θ

^∗₃

)

⊗2

ν(dz).

The random field Y

⁽³⁾

is well defined under [A1] and [A2]. Let Y

^(J,3)

(θ

1

, θ

3

) = −

Z

6V (z, θ

1

, θ

3

)

⁻¹

H(z, θ

3

) − H(z, θ

₃^∗

)

⊗2

ν(dz).

We will assume all or some of the following identifiability conditions [A3 ] (i) There exists a positive constant χ

1

such that

Y

⁽¹⁾

(θ

1

) ≤ − χ

1

| θ

1

− θ

^∗₁

|

²

(θ

1

∈ Θ

1

).

(i

^′

) There exists a positive constant χ

^′₁

such that

Y

^(J,1)

(θ

1

) ≤ − χ

^′₁

| θ

1

− θ

₁^∗

|

²

(θ

1

∈ Θ

1

).

(ii) There exists a positive constant χ

2

such that

Y

⁽²⁾

(θ

₂

) ≤ − χ

₂

| θ

₂

− θ

₂^∗

|

²

(θ

₂

∈ Θ

₂

).

(iii) There exists a positive constant χ

3

such that

Y

⁽³⁾

(θ

₃

) ≤ − χ

₃

| θ

₃

− θ

₃^∗

|

²

(θ

₃

∈ Θ

₃

).

(iii

^′

) There exists a positive constant χ

3

such that

Y

^(J,3)

(θ

1

, θ

3

) ≤ − χ

3

| θ

3

− θ

₃^∗

|

²

(θ

1

∈ Θ

1

, θ

3

∈ Θ

3

).

3 Basic estimation of the increments

We denote U

^⊗k

for U ⊗ · · ·⊗ U (k-times) for a tensor U. For tensors S

¹

= (S

_i¹_1,1_,...,i_1,d

1;j1,1,...,j1,k1

), ..., S

^m

= (S

_i^m_m,1_,...,i

m,dm;jm,1,...,j_m,km

) and and a tensor T = (T

ⁱ^1,1^,...,i¹^,d1^,...,i^m,1^,...,i^m,dm

), we write T [S

¹

, ..., S

^m

] = T [S

¹

⊗ · · · ⊗ S

^m

]

=

X

i1,1,...,i1,d1,...,im,1,...,i_m,dm

T

ⁱ^1,1^,...,i^1,d¹^,...,i^m,1^,...,i^m,dm

S

_i¹₁_,₁_,...,i_1,d

1;j1,1,...,j1,k1

· · · S

_i^m_m,1_,...,i

m,dm;jm,1,...,j_m,km

j1,1,...,j_1,k1,...,jm,1,...,j_m,km

.

This notation will be applied for a tensor-valued tensor T as well.

(7)

We have

h

^−1/2

∆

j

X = h

^−1/2

Z

tj

tj−1

B(Z

t

, θ

₁^∗

)dw

t

+ h

^−1/2

Z

tj

tj−1

A(Z

t

, θ

₂^∗

)dt

= h

^−1/2

B(Z

tj−1

, θ

₁^∗

)∆

j

w + r

_j^(3.2)

(3.1)

where

r

^(3.2)_j

= h

^−1/2

Z

tj

tj−1

(B (Z

t

, θ

^∗₁

) − B(Z

t_j−1

, θ

^∗₁

))dw

t

+ h

^−1/2

Z

tj

tj−1

A(Z

t

, θ

^∗₂

)dt (3.2) Lemma 3.1. (a) Under [A1] with (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (0, 0, 0, 0, 0, 0) and [A2] (i) ,

sup

s,t∈R+,|s−t|≤∆

k Z

s

− Z

t

k

^p

= O(∆

^1/2

) (∆ ↓ 0) (3.3)

for every p > 1 .

(b) Under [A1] with (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (0, 0, 1, 0, 0, 0) and [A2] (i), r

^(3.2)_j

= O

L^∞^–

(h

^1/2

), i.e.,

sup

n

sup

j

k r

^(3.2)_j

k

^p

= O(h

^1/2

) for every p > 1.

Proof. (a) is trivial. For (b), the first term on the right-hand side of (3.2) can be estimated by the Burkholder-Davis-Gundy inequality, Taylor’s formula for B(Z

t

, θ

^∗₁

) − B(Z

tj−1

, θ

₁^∗

) and by (3.3).

We have

h

^−1/2

∆

_j

X = h

^−1/2

Z

tj

tj−1

B(Z

_t

, θ

₁^∗

)dw

_t

+ h

^−1/2

Z

tj

tj−1

A(Z

_t

, θ

^∗₂

)dt

= h

^−1/2

Z

tj

tj−1

B(Z

t

, θ

₁^∗

)dw

t

+ h

^1/2

A(Z

tj−1

, θ

^∗₂

) + r

^(3.4)_j

where

r

_j^(3.4)

= h

^−1/2

Z

tj

tj−1

A(Z

t

, θ

^∗₂

) − A(Z

tj−1

, θ

^∗₂

)

dt (3.4)

Then

Lemma 3.2. r

^(3.4)_j

= O

L^∞^–

(h) , i.e., sup

n

sup

j

r

^(3.4)_j

p

= O(h) (3.5)

for every p > 1 if [A1] for (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (1, 0, 0, 0, 0, 0) and [A2] (i) hold.

(8)

Proof. Thanks to (3.3).

Let

L

H

(z, θ

1

, θ

2

, θ

3

) = H

x

(z, θ

3

)[A(z, θ

2

)] + 1

2 H

xx

(z, θ

3

)[C(z, θ

1

)] + H

y

(z, θ

3

)[H(z, θ

3

)].

Define the R

^d^Y

-valued function G

n

(z, θ

1

, θ

2

, θ

3

) by G

n

(z, θ

1

, θ

2

, θ

3

) = H(z, θ

3

) + h

2 L

H

z, θ

1

, θ

2

, θ

3

.

Write

ζ

j

= √ 3

Z

tj

tj−1

Z

t

tj−1

dw

s

dt

Then E ζ

_j^⊗2

= h

³

I

r

for the r-dimensional identity matrix I

r

. We have

∆

j

Y − hG

n

(Z

tj−1

, θ

1

, θ

2

, θ

3

)

= ∆

j

Y − hH(Z

tj−1

, θ

3

) − h

²

2 L

H

Z

tj−1

, θ

1

, θ

2

, θ

3

= hH(Z

_t_j−1

, θ

₃^∗

) − hH(Z

_t_j−1

, θ

₃

) +H

x

(Z

tj−1

, θ

₃^∗

)B(Z

tj−1

, θ

₁^∗

)

Z

t_j

t_j−1

Z

t

t_j−1

dw

s

dt

+ Z

tj

tj−1

Z

t

tj−1

H

x

(Z

s

, θ

₃^∗

)B(Z

s

, θ

^∗₁

) − H

x

(Z

tj−1

, θ

^∗₃

)B(Z

tj−1

, θ

₁^∗

) dw

s

dt

+ Z

tj

tj−1

Z

t

tj−1

L

H

(Z

s

, θ

^∗₁

, θ

₂^∗

, θ

^∗₃

) − L

H

(Z

tj−1

, θ

1

, θ

2

, θ

3

) dsdt

=

hH(Z

tj−1

, θ

^∗₃

) − hH(Z

tj−1

, θ

3

) + κ(Z

tj−1

, θ

₁^∗

, θ

₃^∗

)ζ

j

+ ρ

j

(θ

1

, θ

2

, θ

3

) (3.6) where

κ(Z

tj−1

, θ

₁^∗

, θ

₃^∗

) = 3

^−1/2

H

x

(Z

tj−1

, θ

₃^∗

)B(Z

tj−1

, θ

^∗₁

) and

ρ

j

(θ

1

, θ

2

, θ

3

) = Z

tj

tj−1

Z

t

tj−1

H

x

(Z

s

, θ

^∗₃

)B(Z

s

, θ

^∗₁

) − H

x

(Z

tj−1

, θ

^∗₃

)B(Z

tj−1

, θ

^∗₁

) dw

s

dt

+ Z

tj

tj−1

Z

t

tj−1

L

H

(Z

s

, θ

₁^∗

, θ

₂^∗

, θ

^∗₃

) − L

H

(Z

tj−1

, θ

1

, θ

2

, θ

3

)

dsdt. (3.7)

Let

D

^j

(θ

1

, θ

2

, θ

3

) = h

^−1/2

∆

j

X − hA(Z

tj−1

, θ

2

) h

^−3/2

∆

_j

Y − hG

_n

(Z

_t_j−1

, θ

₁

, θ

₂

, θ

₃

)

!

. (3.8)

(9)

Lemma 3.3. Suppose that [A1] with (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (1, 0, 1, 0, 3, 0) and [A2] (i) are satisfied. Then

(a) sup

n

sup

j

ρ

j

(θ

₁^∗

, θ

^∗₂

, θ

₃^∗

)

p

= O (h

²

) for every p > 1.

(b) sup

n

sup

j

D

^j

(θ

₁^∗

, θ

^∗₂

, θ

^∗₃

)

p

< ∞ for every p > 1.

Proof. It is possible to show (a) by (3.7) and using the estimate (3.3) with the help of Taylor’s formula. Additionally to the representation (3.6), by using (3.1) and (3.2), we obtain (b).

We denote by (B

x

B)(z, θ

2

) the tensor defined by (B

x

B)(z, θ

2

)[u

1

⊗ u

2

] = B

x

(z, θ

2

)[u

2

, B(z, θ

2

)[u

1

]]

for u

1

, u

2

∈ R

^r

. Moreover, we write dw

s

dw

t

for dw

s

⊗ dw

t

, and (B

x

B )(Z

tj−1

, θ

^∗₂

) R

tj

tj−1

R

t

tj−1

dw

s

dw

t

for (B

x

B )(Z

tj−1

, θ

₂^∗

) R

tj

tj−1

R

t

tj−1

dw

s

dw

t

. We will apply this rule in similar situations. Let L

B

(z, θ

1

, θ

2

, θ

3

) = B

x

(z, θ

1

)[A(z, θ

2

)] + 1

2 B

xx

(z, θ

3

)[C(z, θ

1

)] + B

y

(z, θ

3

)[H(z, θ

3

)]. (3.9) Lemma 3.4. Suppose that [A1] with (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (1, 1, 2, 0, 0, 0) and [A2] (i) are satisfied. Then

h

^−1/2

∆

_j

X − hA(Z

_t_j−1

, θ

₂

)

= ξ

_j^(3.11)

+ ξ

_j^(3.12)

+ r

_j^(3.13)

(θ

₂

) (3.10) where

ξ

_j^(3.11)

= h

^−1/2

B(Z

tj−1

, θ

₁^∗

)∆

j

w, (3.11) ξ

_j^(3.12)

= h

^−1/2

(B

x

B)(Z

t_j−1

, θ

₁^∗

)

Z

tj

tj−1

Z

t

tj−1

dw

s

dw

t

, (3.12)

and

r

_j^(3.13)

(θ

2

) = h

^−1/2

Z

tj

tj−1

Z

t

tj−1

((B

x

B )(Z

s

, θ

^∗₁

) − (B

x

B )(Z

tj−1

, θ

^∗₁

))dw

s

dw

t

+h

^−1/2

Z

tj

tj−1

Z

t

tj−1

L

B

(Z

s

, θ

^∗₁

, θ

₂^∗

, θ

^∗₃

)dsdw

t

+h

^−1/2

Z

tj

tj−1

A(Z

t

, θ

^∗₂

) − A(Z

tj−1

, θ

2

)

dt. (3.13)

Moreover,

sup

n

sup

j

k r

^(3.13)_j

(θ

^∗₂

) k

^p

= O(h) (3.14)

for every p > 1 , and

r

_j^(3.13)

(θ

2

) ≤ r

^(3.16)_n,j

h

^1/2

θ

2

− θ

₂^∗

+ h (3.15) with some random variables r

_n,j^(3.16)

satisfying

sup

n

sup

j

r

_n,j^(3.16)

p

< ∞ (3.16)

for every p > 1 .

(10)

Proof. The decomposition (3.10) is obtained by Itˆo’s formula. The estimate (3.14) is verified by (3.3) since ∂

_z

(B

_x

B) and ∂

_z

A are bound by a polynomial in z uniformly in θ. The estimate (3.15) uses ∂

2

A for θ

2

near θ

^∗₂

as well as ∂

z

A evaluated at θ

^∗₂

:

r

_j^(3.13)

(θ

2

)

1

{|θ2−θ^∗₂|<r}

≤ r

^(3.16)_n,j

h

^1/2

θ

2

− θ

₂^∗

+ h 1

{|θ2−θ₂^∗|<r}

with some positive constant r and some random variables r

_n,j^(3.16)

satisfying (3.16). The small number r was taken to ensure convexity of the vicinity of θ

₂^∗

. For θ

2

such that | θ

2

− θ

₂^∗

| ≥ r, the estimate (3.15) is valid by enlarging r

_n,j^(3.16)

if necessary.

Lemma 3.5. (a) Suppose that [A1] with (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (1, 1, 2, 1, 3, 0) and [A2] (i) are satisfied. Then

∆

j

Y − hG

n

(Z

tj−1

, θ

1

, θ

2

, θ

^∗₃

) = ξ

_j^(3.18)

+ ξ

^(3.19)_j

+ h

^3/2

r

_j^(3.20)

(θ

1

, θ

2

) + h

^3/2

r

_j^(3.21)

(θ

1

, θ

2

) (3.17) where

ξ

^(3.18)_j

= κ(Z

tj−1

, θ

₁^∗

, θ

₃^∗

)ζ

j

, (3.18)

ξ

_j^(3.19)

= ((H

x

B )

x

B )(Z

tj−1

, θ

₁^∗

, θ

₃^∗

) Z

tj

tj−1

Z

t

tj−1

Z

s

tj−1

dw

r

dw

s

dt, (3.19)

r

^(3.20)_j

(θ

1

, θ

2

) = h

^−3/2

Z

tj

tj−1

Z

t

tj−1

Z

s

tj−1

((H

x

B )

x

B)(Z

r

, θ

^∗₁

, θ

^∗₃

)

− ((H

x

B)

x

B)(Z

tj−1

, θ

₁^∗

, θ

₃^∗

) dw

r

dw

s

dt +h

^−3/2

Z

tj

tj−1

Z

t

tj−1

Z

s

tj−1

L

_H_x_B

(Z

_r

, θ

₁^∗

, θ

₂^∗

, θ

^∗₃

)drdw

_s

dt +h

^−3/2

Z

tj

tj−1

Z

t

tj−1

L

H

(Z

s

, θ

1

, θ

2

, θ

₃^∗

) − L

H

(Z

tj−1

, θ

1

, θ

2

, θ

^∗₃

)

dsdt (3.20) with

L

HxB

(z, θ

1

, θ

2

, θ

3

) = (H

x

B)

x

(z, θ

1

, θ

3

)[A(z, θ

2

)] + 1

2 (H

x

B)

xx

(z, θ

1

, θ

3

)[C(z, θ

1

)]

+(H

x

B )

y

(z, θ

1

, θ

3

)[H(z, θ

3

)], and

r

_j^(3.21)

(θ

1

, θ

2

) = h

^−3/2

Z

tj

tj−1

Z

t

tj−1

L

H

(Z

s

, θ

₁^∗

, θ

₂^∗

, θ

^∗₃

) − L

H

(Z

s

, θ

1

, θ

2

, θ

^∗₃

)

dsdt. (3.21) Moreover,

sup

n

sup

j

sup

(θ1,θ2)∈Θ1×Θ2

r

^(3.20)_j

(θ

1

, θ

2

)

p

= O(h) (3.22)

(11)

for every p > 1 , and

r

_j^(3.21)

(θ

1

, θ

2

)

≤ h

^1/2

r

^(3.24)_n,j

θ

1

− θ

₁^∗

+

θ

2

− θ

^∗₂

(3.23) for all (θ

1

, θ

2

) ∈ Θ

1

× Θ

2

with some random variables r

_n,j^(3.24)

satisfying

sup

n

sup

j

r

_n,j^(3.24)

p

< ∞ (3.24)

for every p > 1 .

(b) Suppose that [A1] with (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (1, 1, 2, 1, 2, 0) and [A2] (i) are satisfied.

Then there exist random variables r

_n,j^(3.25)

and a number ρ such that sup

θ3∈Θ3

D

^j

(θ

1

, θ

2

, θ

3

) − D

^j

(θ

^∗₁

, θ

^∗₂

, θ

3

)

≤ h

^1/2

r

^(3.25)_n,j

θ

1

− θ

₁^∗

+

θ

2

− θ

^∗₂

for all (θ

1

, θ

2

) ∈ B ((θ

₁^∗

, θ

₂^∗

), ρ) and that sup

n

sup

j

r

_n,j^(3.25)

p

< ∞ (3.25)

for every p > 1.

Proof. By (3.6), we have

∆

j

Y − hG

n

(Z

tj−1

, θ

1

, θ

2

, θ

^∗₃

) = ξ

_j^(3.18)

+ ρ

j

(θ

1

, θ

2

, θ

₃^∗

) (3.26) and

ρ

j

(θ

1

, θ

2

, θ

^∗₃

) = Z

tj

tj−1

Z

t

tj−1

H

x

(Z

s

, θ

^∗₃

)B(Z

s

, θ

₁^∗

) − H

x

(Z

tj−1

, θ

₃^∗

)B (Z

tj−1

, θ

^∗₁

) dw

s

dt

+ Z

t_j

tj−1

Z

t

tj−1

L

H

(Z

s

, θ

^∗₁

, θ

₂^∗

, θ

^∗₃

) − L

H

(Z

tj−1

, θ

1

, θ

2

, θ

₃^∗

) dsdt.

Then the decomposition (3.17) is obvious. The first and third terms on the right-hand side of (3.20) can be estimated with Taylor’s formula and (3.3), and the second term is easy to estimate.

Thus, we obtain (3.22). Since ∂

_(θ1,θ2)

L

H

(z, θ

1

, θ

2

, θ

₃^∗

) is bound by a polynomial in z uniformly in (θ

1

, θ

2

), there exist random variables r

^(3.24)_n,j

that satisfy (3.23) and (3.24). [ First show (3.23) on the set {| (θ

1

, θ

2

) − (θ

₁^∗

, θ

₂^∗

) | < r } , next see this estimate is valid on Θ

1

× Θ

2

\{| (θ

1

, θ

2

) − (θ

₁^∗

, θ

₂^∗

) | <

r } by redefining r

_n,j^(3.24)

if necessary. We obtained (a). The assertion (b) is easy to verify with (3.6), (3.7) and Lemma 3.4.

Lemma 3.6. Suppose that [A1] with (i

A

, j

A

, i

B

, j

B

, i

H

, j

H

) = (0, 0, 0, 0, 2, 1) and [A2] (i) are satisfied. Then

sup

(θ1,θ2)∈Θ1×Θ2

D

^j

(θ

1

, θ

2

, θ

3

) − D

^j

(θ

1

, θ

2

, θ

₃^′

)

≤ h

^−1/2

r

^(3.27)_n,j

θ

3

− θ

^′₃

(θ

3

, θ

^′₃

∈ Θ

3

)

(12)

for some random variables r

^(3.27)_n,j

such that sup

n

sup

j

r

_n,j^(3.27)

p

< ∞ (3.27)

for every p > 1 . Proof.

D

^j

(θ

1

, θ

2

, θ

3

) − D

^j

(θ

1

, θ

2

, θ

^′₃

)

=

0 h

^−1/2

H(Z

tj−1

, θ

₃^′

) − H(Z

tj−1

, θ

3

)

+

^h¹₂^/²

L

H

(Z

tj−1

, θ

1

, θ

2

, θ

₃^′

) − L

H

(Z

tj−1

, θ

1

, θ

2

, θ

3

)

Therefore the lemma is obvious. Apply the Taylor formula for the argument θ

₃

if θ

₃

and θ

^′₃

are close, otherwise and if necessary, redifine r

_n,j^(3.27)

.

4 An adaptive estimator for θ ₃

We will work with some initial estimators ˆ θ

⁰₁

for θ

⁰₁

and ˆ θ

⁰₂

for θ

₂

. The following standard convergence rates, in part or fully, will be assumed for these estimators:

[A4 ] (i) θ ˆ

₁⁰

− θ

₁^∗

= O

p

(n

^−1/2

) as n → ∞ (ii) θ ˆ

₂⁰

− θ

^∗₂

= O

_p

(n

^−1/2

h

^−1/2

) as n → ∞

Sections 7 and 8 recall certain standard estimators for θ

1

and θ

2

, respectively. The ex- pansions (3.1) and (3.6) with Lemma 3.5 suggest two approaches for estimating θ

3

. The first approach is based on the likelihood of h

^−3/2

∆

_j

Y − hG

_n

(Z

_t_j−1

, θ

₁

, θ

₂

, θ

₃

)

only. The second one uses the likelihood corresponding to D

^j

(θ

1

, θ

2

, θ

3

). However, it is possible to show that the first approach gives less optimal asymptotic variance; see Remark 4.6. So, we will treat the second approach here.

4.1 Adaptive quasi-likelihood function for θ

3

Let

S(z, θ

1

, θ

3

) =

C(z, θ

1

) 2

⁻¹

C(z, θ

1

)H

x

(z, θ

3

)

^⋆

2

⁻¹

H

x

(z, θ

3

)C(z, θ

1

) 3

⁻¹

H

x

(z, θ

3

)C(z, θ

1

)H

x

(z, θ

3

)

^⋆

Then

S(z, θ

1

, θ

3

)

⁻¹

= C(z, θ

1

)

⁻¹

+ 3H

x

(z, θ

3

)

^⋆

V (z, θ

1

, θ

3

)

⁻¹

H

x

(z, θ

3

) − 6H

x

(z, θ

3

)

^⋆

V (z, θ

1

, θ

3

)

⁻¹

− 6V (z, θ

1

, θ

3

)

⁻¹

H

x

(z, θ

3

) 12V (z, θ

1

, θ

3

)

⁻¹

! .

(4.1)

Adaptive and non-adaptive estimation for degenerate diffusion processes

HAL Id: hal-02488402

https://hal.archives-ouvertes.fr/hal-02488402