Uniform deterministic equivalent of additive functionals and non-parametric drift estimation for one-dimensional recurrent diffusions

(1)

www.imstat.org/aihp 2008, Vol. 44, No. 4, 771–786

DOI: 10.1214/07-AIHP141

Uniform deterministic equivalent of additive functionals and non-parametric drift estimation for one-dimensional

recurrent diffusions

D. Loukianova

^a

and O. Loukianov

^b

aDépartement Mathématique, Université d’Evry, France. E-mail: [email protected] bDépartement Informatique, IUT de Fontainebleau, France. E-mail: [email protected]

Received 29 September 2006; revised 18 July 2007; accepted 23 August 2007

Abstract. Usually the problem of drift estimation for a diffusion process is considered under the hypothesis of ergodicity. It is less often considered under the hypothesis of null-recurrence, simply because there are fewer limit theorems and existing ones do not apply to the whole null-recurrent class.

The aim of this paper is to provide some limit theorems for additive functionals and martingales of a general (ergodic or null) recurrent diffusion which would allow us to have a somewhat unified approach to the problem of non-parametric kernel drift estimation in the one-dimensional recurrent case. As a particular example we obtain the rate of convergence of the Nadaraya–

Watson estimator in the case of a locally Hölder-continuous drift.

Résumé. Habituellement le problème de l’estimation du drift pour un processus de diffusion est considéré sous l’hypothèse de l’ergodicité. Il l’est moins souvent sous l’hypothèse de nulle-récurrence, car dans ce cas il y a moins de théorèmes limites, et ceux qui existent ne s’appliquent pas à toute la classe nulle-récurrente.

Le but de cet article est de démontrer quelques théorèmes limites pour les fonctionnelles additives et martingales dépendantes d’une diffusion récurrente générale (ergodique ou nulle). Ces théorèmes permettent de donner une approche unifiée au problème de l’estimation non-paramétrique par noyau du drift dans le cas unidimensionnel récurrent. Comme exemple on obtient la vitesse de convergence de l’estimateur de Nadaraya–Watson dans le cas d’un drift localement hölderien.

MSC:60G17; 60F10; 92B20; 68T10

Keywords:Harris recurrence; Diffusion processes; Limit theorems; Additive functionals; Non-parametric estimation; Nadaraya–Watson estimator;

Rate of convergence

1. Introduction

Consider a stochastic differential equation

dXt=σ (X_t)dWt+b(X_t)dt, (1)

whereb, σ:R→Rand(Wt)t≥0 is a Brownian motion onR. Throughout this paper we suppose thatσ is strictly positive, continuous,bis measurable, and for some constantC >0 and allx∈R

σ²(x)≤C 1+x²

and b(x)≤C 1+ |x|

.

(2)

Under these conditions for eachx₀∈Rthe Eq. (1) has a unique weak solutionX=(X_t)_t_≥₀starting fromx₀, and the corresponding semigroup is strong Feller (see e.g. [20], p. 170). Furthermore, we suppose thatXis Harris recurrent.

Recall that it means thatXadmits an invariant measureμonB(R)such that for any measurablef ≥0, μ(f ) >0 ⇒ ∀x∈R:

_∞

0

f (X_t)dt= ∞ P_x-a.s.

In our case (more exactly ifσ andbare locally bounded Borel functions andσ does not vanish), the invariant measure μis absolutely continuous with respect to the Lebesgue measure, see [19], p. 298. The density ofμ(dx)is given by

μ(dx)= 2 σ²(x)exp

_x

0

2b σ²(v)dv

dx. (2)

Recall also that a one-dimensional diffusion given by (1) is recurrent if and only if _x

0

exp y

0

2b σ²(v)dv

dy→ ±∞ asx→ ±∞. (3)

With regard to theμ-total mass ofRone subdivides Harris recurrent diffusions into two sub-classes: ifμ(R)is finite thenXis said to be ergodic, and null-recurrent otherwise.

For such a diffusion we will consider the problem of drift estimation. Throughout this paper we suppose that a sample path of diffusion is observed on[0, T]and thatT → ∞.

Since we study the asymptotic behavior of diffusion processes, it is important to have at one’s disposal limit theorems for additive functionals of the formA_t =_t

0f (X_s)dsor martingales of the formM_t=_t

0f (X_s)dWs. The limit theorems for additive functionals and martingales ofergodicdiffusions are available on each level of convergence and well known, for instance iff ≥0,μ(f ) <∞, we have a LLN

tlim→∞

1 t

_t

0

f (Xs)ds=μ(f ) Px-a.s.∀x and ifμ(f²) <∞, a CLT:

√1 t

_t

0

f (Xs)dWs−→N 0, μ

f² .

Fornull-recurrentdiffusions there is no LLN with deterministic normalization and weak convergence theorems are available only for a subclass of regularly varying null-recurrent diffusions, see [12,21] and the book by Höpfner and Löcherbach [10]. As for limit theorems working in a general recurrent setting, there is only one result: the famous Chacon–Ornstein theorem stating that all integrable additive functionals (IAF) ofX are asymptotically equivalent:

∀f ≥0,μ(f ) <∞;∀g≥0, 0< μ(g) <∞;

tlim→∞

t

0f (Xs)ds t

0g(Xs)ds =μ(f )

μ(g) P_x-a.s.∀x and its integral version yields thatμ-a.s.

tlim→∞

Ex

_t

0f (X_s)ds Ex

_t

0g(X_s)ds =μ(f )

μ(g); (4)

where the exceptional set depends onf and ong.

The literature on statistical inference for ergodic diffusions in general and drift estimation in particular is extensive and significant results can be obtained in this case, see for example the works by Dalalyan and Kutoyants [5], Dalalyan [4], Van-Zanten [22], Galtchouk and Pergamentchikov [8], Yoshida [24] to mention just a few; an extensive survey can be found in the recent book [14] by Kutoyants.

(3)

The situation is different for null-recurrent diffusions. As we have mentioned, an important class of regularly varyingnull-recurrent diffusions behaving much like the ergodic ones has been thoroughly studied by Höpfner and Löcherbach in [10]. For such diffusions the pair(M_t/√

v_t, A_t/v_t)converges in law ast→ ∞, wherev_t=t^αl(t )for some 0< α≤1 andl(t )varying slowly at infinity. Using these facts, Höpfner and Kutoyants [9] have presented one of the first examples concerning the rate of convergence and the limit distribution of MLE for null-recurrent diffusions.

It might be possible to extend their method to the whole class of regular variation, but since it is based on weak convergence it cannot be used in general cases.

Our aim in this paper is to develop some tools, for the problem of drift estimation, and more precisely, for the rate of convergence calculation, which would work in a general recurrent (null or ergodic) setting.

The first idea in this direction could be derived from the Chacon–Ornstein theorem. Because all IAF of X are equivalent, we can consider some fixed IAF as a “time” of the system and try to work with a random normalisation based on this IAF. This idea was used in [7] to study the rate of convergence of a non-parametric kernel estimator and by Loukianova, Loukianov [17] to obtain the a.s. rate of convergence of the MLE. However, applying this idea in each particular case presents a real technical difficulty, and no concise method, based on this idea, has emerged so far.

The second idea is based on the following observation: we are interested in the calculation of the rate of convergence, and the rate is a notion of tightness. All known limit theorems deal with a stronger type of convergence, and none (except Chacon–Ornstein) works for the whole recurrent class. However, tightness should be sufficient to treat the problem of the rate of convergence. Therefore the natural question arises: is there a property of tightness with deterministic norming for IAF, which would be true for all recurrent diffusions? The answer is affirmative:

For every recurrent diffusionXthere is some deterministic functionvt, called in the sequeldeterministic equivalent ofX. With this function, which will be explained shortly, the following theorem holds:

Theorem 1. For every IAFA_t ofXthe following holds

Mlim→∞lim inf

t→∞ P 1

M<At

vt

< M

=1.

Such a property was first proved for recurrent Markov chains by Chen [3], without any relation to statistics. In [16], using Chen’s method, we extended this property to one-dimensional diffusions, and recently, in [15], using a new version of the Nummelin splitting method, this property was extended to the whole class of continuous time Harris recurrent Markov processes.

To explain what a deterministic equivalentvt is, we need the notion of a special function.

Definition 1.1. A measurable bounded functionf:R→R⁺is said to be special if for everyh:R→R⁺,measurable, bounded and such thatμ(h) >0,the following functionU^hf is bounded:

U^hf (x)=Ex

_∞

0

exp − _t

0

h(Xs)ds

f (Xt)dt.

This notion can be extended to additive functionals.

Definition 1.2. A continuous AFA_t is said to be special if for everyh:R→R⁺,measurable,bounded and such that μ(h) >0,the function

U_A^h1(x)=Ex

_∞

0

exp − _t

0

h(Xs)ds

dAt

is bounded.

For instance, a diffusion local time is a special additive functional (SAF), see [2]; on the other hand, any measurable bounded function with compact support is special for an-dimensional strong Feller diffusion, see [15].

Observe that in the “integral version” (4) of the Chacon–Ornstein theorem we cannot determine whether the result holds or not for fixedx,f andg. The most important application of special functions and functionals is in the fact

(4)

that the assertion (4) holds for everyx, and moreover, the following “strong” version of Chacon–Ornstein theorem (SCO) holds:

Theorem 2. IfA,Bare two SAF such thatνB :=EμB1>0,then for every pair(π1, π2)of probability measures,

tlim→∞

Eπ1A_t

Eπ2B_t =ν_A ν_B. (Recall that ifA_t=_t

0f (X_s)dswithf∈L¹(μ), thenν_A =μ(f ).)

Now, to normalize our additive functional we take some specialgwithμ(g)=1, and put vt=Eπ

_t

0

g(Xs)ds,

whereπ is some probability measure on E. Clearly,v_t is non-negative and non-decreasing. In view of the strong Chacon–Ornstein theorem the asymptotic order ofv_tdepends only on the law ofX. IfXis an ergodic diffusion, then vt∼t, ifXis a null-recurrent diffusion of regular variation with indexα, thenvt∼t^α; in particular, ifXis a linear Brownian motion, thenv_t ∼√

t. Some less explicit examples are possible: ifX is a drifted Brownian motion with compactly supported drift, thenXis null-recurrent andv_t∼√

t. However, to evaluatev_t in the general case we need the semi-group ofX, sovt is clearly a more “abstract” object.

In [16] we have applied the deterministic equivalent to give a rate of convergence of the MLE for a class of recurrent diffusions with Hölder drift. In this parametric context, the deterministic equivalent turned out to be an appropriate tool, because if in the proof of the ergodic case, we replace the normalizationt withvt, we obtain, with very few modifications, a proof for the whole recurrent class.

In this paper we apply the deterministic equivalent to the calculation of the rate of convergence in non-parametric drift estimation. We suppose, as usual, that the drift function ofXis locallyα-Hölder near some pointx0∈E, and we estimateb(x₀)by the Nadaraya–Watson estimatorbˆ^h_x^t

0,tgiven by bˆ^h_x

0,t= t

0φ ((Xs−x0)/ h)dXs

t

0φ ((Xs−x0)/ h)ds .

Hereφ is a non-negative smooth function with compact support. The processbˆ_x^h

0,t is composed with some bandwidthht. In the ergodic settinght is a deterministic function oft which can be found using a standard optimization procedure. A similar procedure, when guessing the optimal bandwidth in the general recurrent case, givesh_t as a function ofv_t. As we have noticed above,v_t cannot be calculated in general, but we need an estimator, depending only on the observations. So we also express the bandwidth (and the rate) in terms of any random and observable IAFV_t, equivalent (in the sense of Theorem 1) tov_t.

Therefore, the objects appearing in this problem and those for which we need to study the asymptotic behaviour, are additive functionalsA^h_t and martingalesM_t^hcomposed with some deterministic or random bandwidthht, namely

A^h_t^t = t

0

φ

X_s−x₀ ht

ds and M_t^h^t = t

0

φ

X_s−x₀ ht

dWs.

To be precise, with a view of rate calculation, we want to show some tightness property for these objects, with some normalization depending onvt. When the bandwidth is random,A^h_t^t is no longer an additive functional, andM_t^h^t is no longer a martingale. It would be difficult to obtain their asymptotics directly, using stochastic calculus methods, because these objects do not have the usual stochastic structure. To avoid this difficulty, we firstly show some uniform inhtightness property forA^h_t. Namely, we show that¹_hA^h_t admitsvtas uniform deterministic equivalent (Theorem 5).

We then derive a “diagonal” tightness property forA^h_t^t andM_t^h^t, Corollaries 3.1 and 3.2. All results regarding the limit behavior ofA^h_t^t andM_t^h^t are given in Section 3.

The main technical tools of our work are two theorems on the local timeL^x_t of X, namely, the uniform in x deterministic equivalent (Theorem 3), and the uniform inx strong Chacon–Ornstein Theorem 4. This is the content of Section 2.

(5)

Statistical applications are given in Section 4. Theorem 6 gives the rater_t=v_t^−α/(2α+¹⁾specified by the bandwidth ht=v⁻_t ^1/(2α⁺¹⁾. The random rateRtand the bandwidthHt are given by the same theorem and are:Rt=V_t⁻^α/(2α⁺¹⁾; Ht=V_t⁻^1/(2α⁺¹⁾.

Expressed in deterministic terms, our result agrees with the well-known rater_t=t⁻^α/(2α⁺¹⁾ in the ergodic case of this model, as well as with those of Delattre and Hoffmann [6], where the minimax ratert=t^−α/(4α+²⁾ specified by the bandwidthh_t=t⁻⁽⁻^1/(4α⁺²⁾⁾was found for the model of drifted Brownian motion with compactly supported drift. Recall that this diffusion is null-recurrent withvt ∼√

t. Our random expressions agree with those of Delattre, Hoffmann and Kessler [7], where the Nadaraya–Watson estimator was studied in a minimax setting by different methods and both rate and bandwidth were expressed in terms of one fixed IAF: the occupation time.

Notice finally that for multi-dimensional diffusions, it is possible, using the Nummelin splitting, to get a result similar to Theorem 6, but only with deterministic rate and bandwidth [15].

2. Uniform deterministic equivalent and uniform Strong Chacon–Ornstein theorem for the local time

Recall that we consider a recurrent scalar diffusionXgiven by the Eq. (1) and subject to the same assumptions. As usual, μ denotes the invariant measure ofX and(L^x_t)t≥0 the Tanaka–Meyer local time at the pointx ∈R. Letπ be some probability on Randv_t =Eπ

_t

0g(X_s)ds be a deterministic equivalent ofX. We supposeg measurable, bounded, compactly supported and such thatμ(g)=1. In this section we derive a uniform deterministic equivalent for (L^x_t)_t_≥₀(Theorem 3), and we show that the local time obeys a uniform strong Chacon–Ornstein theorem (Theorem 4).

Proposition 2.1. For eachy∈R,ν_L^y =σ²(y)μ(y),whereμ(y)is the density of the invariant measureμ(dy)with respect to Lebesgue measure.In particular, 0<ν_L^y<∞.

Proof. Denote byp(t;x, y)the density of the transition function ofXwith respect to the invariant measureμ. This density exists and may be taken to be positive, jointly continuous in all variables and symmetric, that isp(t, x, y)= p(t, y, x), see [11], p. 149.

On the other hand, the invariant measureμhas itself a density with respect to Lebesgue measure,μ(dx)=μ(x)dx, where

μ(x)= 2 σ²(x)exp

_x

0

2b σ²(v)dv

. (5)

We have the following relation between the local time and the densityp(t;x, y):

ExL^y_t =σ²(y)μ(y) t

0

p(s;x, y)ds, (6)

see [1], p. 21 or [11], p. 175. The factor σ²(y)μ(y) arises since L^y_t is the Tanaka–Meyer local time and not the Ito–McKean local time.

ν_L^y :=EμL^y₁=σ²(y)μ(y)

Rμ(x)dx 1

0

p(s;x, y)ds

=σ²(y)μ(y) ₁

0

ds

Rp(s;y, x)μ(x)dx=σ²(y)μ(y).

Remark 2.1. The conditionνL^y>0implies,according to Brancovan[2],that∀x∈R,Px(L^y_∞= ∞)=1,and that L^yis a SAF.

Theorem 3. LetK⊂Rbe some compact set.For any initial probabilityνthe following limit holds:

Mlim→∞lim inf

t→∞ Pν

1 M ≤ inf

x∈K

L^x_t v_t ≤sup

x∈K

L^x_t v_t ≤M

=1.

(6)

Before we prove this theorem let us discuss the structure of the proof. The basic idea is of course to show some kind of uniform continuity property of the family of normalized local times. For each pointx0∈K,we do it for some small interval[x0−δ;x0+δ]withδ >0 depending onx0.Using this we will start by proving the Theorem 3 for this small interval and then for the compactK. By definition fory < z,

L^y_t −L^z_t

2 =(X_t−y)⁺−(X_t−z)⁺−

(X0−y)⁺−(X0−z)⁺

− _t

0

1_{y<Xs≤z}σ (X_s)dWs− _t

0

1_{y<Xs≤z}b(X_s)ds a.s. (7)

Fort≥0,y < zdenote M_t^y,z=

_t

0

1_{y<X_s≤z}σ (Xs)dWs. (8)

Using occupation formula and boundedness of|b|/σ²onK, withC >0, we can write L^y_t −L^z_t≤2δ+M_t^y,z+C

_z

y

L^x_t dx a.s. (9)

Now to prove Theorem 3 we need Lemmas 2.1–2.4. The first one is axillary to the second. The second deals with the martingale term of the decomposition (9), the third one with the last term of the (9) and Lemma 2.4 gives the equicontinuity property we need.

PutKδ:= [x0−δ;x0+δ].

Lemma 2.1. For any initial probabilityνthe following limit holds:

Mlim→∞lim sup

t→∞ Pν

_x₀₊_δ

x0−δ

L^x_t2

dx > Mv_t²

=0.

Proof. Fix somex < x0. Observe that with some constantC >0 we have L^x_t −L^x_t⁰2

≤C

(2δ)²+ _t

0

1_{x<Xs≤x0}σ (X_s)dWs

2

+ _t

0

1_{Xs∈Kδ}b(X_s)ds 2

.

Observe that thanks to conditions onbandσ in the Introduction they are bounded onK_δ. Fort≥0, introduce the notation

A_t = _t

0

1_{X_s_∈K_δ_}b(X_s)ds. (10)

SinceA_t is an IAF (and even SAF), with notation (10) we have L^x_t −L^x_t⁰2

1_A

t≤√⁴

Mvt ≤C

(2δ)²+ M_t^x,x⁰2

+√ Mv_t²

.

Notice that Eν

M_t^x,x⁰2

≤Eν

_t

0

1_{Xs∈Kδ}σ²(X_s)ds:= ˜v_t.

Thanks to the condition onσ the last expectation is an expectation of SAF, and we have denoted it byv˜_t to stress the fact that it is a version of the deterministic equivalent ofX. Observe that the same reasoning (with changingM_t^x,x⁰ on M_t^x⁰^,x) and the same estimations hold forx > x0. Finally, for allx∈K_δit holds

Eν

L^x_t −L^x_t⁰2

1_A

t≤√⁴

Mvt ≤C

(2δ)²+ ˜v_t+√ Mv_t²

.

(7)

Now, to prove the lemma we use the following decomposition:

Pν

x₀+δ x0−δ

L^x_t2

dx > Mv_t²

≤Pν

8δ L^x_t⁰2

> Mv²_t +Pν

4

_x₀₊_δ

x0−δ

L^x_t −L^x_t⁰2

dx > Mv²_t;At<√⁴ Mvt

+Pν

A_t>√⁴ Mv_t

. (11)

With some constantC,for the second term of the right-hand side expression of (11) we have:

Pν

4

_x₀₊_δ

x₀−δ

L^x_t −L^x_t⁰2

dx > Mv_t²;At<√⁴ Mvt

≤ 1 Mv_t²

x₀+δ x0−δ

Eν

L^x_t −L^x_t⁰2

1_A

t≤√⁴

Mvtdx≤C(2δ)²+ ˜v_t+√ Mv_t² Mv_t² .

According the strong Chacon–Ornstein theorem there is a limit of v˜_t/v_t, as t→ ∞. Using this, together with Theorem 1 for the other two terms on the right of (11) we obtain Lemma 2.1.

Let V be some IAF of X, such thatν_V>0, and for simplicity we take ν_V =1. Vt will play the role of normalization forL^y_t −L^z_t to obtain the uniform continuity property we need.

In the sequel of this section denote Ω_M,t:=

K_δ

L^x_t2

dx < M²v_t²; v_t

M ≤V_t≤v_tM

.

Let{(M_t^y,z;t≥0);x₀−δ≤y < z≤x₀+δ}be a family of martingales defined by (8).

Lemma 2.2. For everyu >0andM >0there is someτ =τ (ω, M, u, K_δ)such that a.s.fort > τit holds sup

y,z∈K_δ

|M_t^y,z|

V_t 1ΩM,t < u v^1/4_t

.

Proof. Fix someu >0 andM >0. Denote Bn=

sup

{t;eⁿ⁻¹<v_t≤eⁿ}

sup

{a,b∈Kδ}

|M_t^y,z|

V_t 1Ω_M,t ≥ u v_t^1/4

.

We will use the Nishiyama martingale inequality to bound the probability ofBnand then the Borel–Cantelli lemma to prove Lemma 2.2. The statement of the Nishiyama martingale’s inequality for our setting is given bellow, for a proof of this result see [18] and for its generalization see [23].

Let ρ be a metric on Kδ, denote byMρ,t the quadratic ρ-modulus of the family of martingales {M_t^y,z;y <

z, y, z∈K_δ}. For everyt >0, this modulus is given by

Mρ,t=sup

y=z

M_t^y,z

ρ(y, z) .

As usual, denote byN (K_δ, ρ, ε)the metric entropy ofK_δ with respect to the metricρ, i.e. the minimal number of ρ-balls of radiusε, needed to coverK_δ.The Nishiyama inequality for tail probabilities [23] states that

P

sup

t≤τ

sup

ρ(y,z)≤η

M_t^y,z1_{Mρ,τ≤κ}≥x

≤2e⁻^x²^/(cφ²^(η)κ²⁾,

(8)

whereC >0, η >0, κ >0 are constants,τ is a finite stopping time and φ (η)=

_η

0

logN (K_δ, ρ, ε)dε

is supposed to be finite.

Before we use the Nishiyama inequality we need to evaluate the quadraticρ-modulus on the setΩM,t. We have M_t^y,z

= _t

0

1_{[y;z[}(Xs)σ²(Xs)ds= _z

y

L^x_t dx≤√ z−y

_x₀₊_δ

x0−δ

L^x_t2

dx and then

M_t^y,z

1Ω_M,t ≤√

z−yMvt. If we takeρ(y, z)=√⁴

z−yas a distance onK_δ, it is easy to see thatφ (δ)is finite and we obtain a bound for quadratic ρ-modulusMρ,t restricted onΩ_M,t:

Mρ,t1Ω_M,t =sup

y=z

M_t^y,z

ρ(y, z) 1Ω_M,t≤ Mvt.

Then

P(Bn)=P

sup

{t;eⁿ⁻¹<vt≤eⁿ} {y,zsup∈Kδ}

|M_t^y,z| Vt

1ΩM,t > u v^1/4_t

≤P

sup

{t;eⁿ⁻¹<vt≤eⁿ} {y,zsup∈K_δ}

M_t^y,z> uv_t

Mv_t^1/4; Mρ,t≤ Mv_t

≤P

sup

{t;eⁿ⁻¹<v_t≤eⁿ} {y,zsup∈K_δ}

M_t^y,z> ueⁿ⁻¹

Me^n/4; Mρ,eⁿ≤√ Meⁿ

≤2 exp

− u²e²ⁿ⁻² M³e^(3/2)nφ²(δ)

.

For allu >0, for allM >0 the series

nP(Bn)converges, and the lemma follows by Borel–Cantelli.

Note thatv_t^1/4is enough for our needs, but actually one can replacev^1/4_t in the lemma withv_t^pwithout changing the proof.

Lemma 2.3. For everyM >0there is somet=t (ω, M)such that a.s.fort > t (ω, M)it holds

y=z∈Ksupδ

1 z−y

_z

y

L^x_t

V_t dx1Ω_M,t ≤A

with some constantA >0,independent ofM, t, ω.

Proof. Lety < zinK_δ. _z

y

L^x_t Vt

dx1ΩM,t ≤(z−y) L^x_t⁰

Vt +sup

y,z

|L^y_t −L^z_t| Vt

1ΩM,t

. (12)

(9)

Using the Chacon–Ornstein theorem and Proposition 2.1, almost surely ^L

x0 t

Vt → ^ν_ν^Lx_V =σ²(x)μ(x):=l₀ withl₀ independent ofω. Using the definition (7) and the notations (10) and (8) we can write:

|L^y_t −L^z_t|

V_t 1Ω_M,t ≤2δ

V_t +|M_t^y,z|

V_t 1Ω_M,t+A_t V_t.

Following Lemma 2.2 there is somet=t (ω, M), such that fort > t (ω, M)

|L^y_t −L^z_t| Vt

1ΩM,t ≤2δ Vt + δ

v^1/4_t +A_t

Vt ≤2l1, (13)

where we have denotedl₁=limt→∞At

Vt. From (12) and (13) we deduce that the lemma holds withA=2(l0+l₁).

Lemma 2.4. For everyMthere is somet=t (ω, M),such that a.s.fort > t (ω, M)it holds with some constantC >0 sup

y=z∈Kδ

|L^y_t −L^z_t| Vt

1ΩM,t≤δC.

Proof. We apply Lemmas 2.2 and 2.3 to the decomposition

|L^y_t −L^z_t| Vt

1ΩM,t ≤2δ

Vt +|M_t^y,z| Vt

1ΩM,t+ _z

y

L^x_t Vt

dx1ΩM,t,

where in Lemma 2.2 we takeu=δ.

This property is weaker than the restricted uniform continuity of normalized local time, but enough for what we want to do. Namely, it provides the necessarily “uniform” argument to show that ^inf^K_v^L^x^t

t is not far from ^L

x0 t

vt for K= [x0−δ;x0+δ]with a smallδ. Now we can prove Theorem 3:

Proof. We firstly prove the theorem forK_δ= [x0−δ;x0+δ]with some smallδdepending onx0. Lett=t (ω, M) of Lemma 2.4. As previously, letl0>0 be the almost sure limitl0=limt→∞L^x_t⁰

Vt . Lett(ω)= inf{t >0;l₀/2< L^x_t⁰/V_t<2l0}andτ =max(t, t). FixM >0 andδ=δ(x₀) >0 in such a way that the following holds withC >0 from Lemma 2.4

1 M ≤l₀

2 −δC≤2l0+δC≤M.

We have P

1 M ≤ inf

x∈K_δ

L^x_t V_t ≤ sup

x∈Kδ

L^x_t V_t ≤M

≥P 1

M ≤L^x_t⁰ Vt + inf

x∈Kδ

L^x_t −L^x_t⁰ Vt ≤L^x_t⁰

Vt + sup

x∈Kδ

L^x_t −L^x_t⁰

Vt ≤M∩Ω_M,t∩t > τ

≥P 1

M ≤L^x_t⁰

V_t −δC≤L^x_t⁰

V_t +δC≤M∩ΩM,t∩t > τ

≥P 1

M ≤l₀

2 −δC≤2l0+δC≤M∩Ω_M,t∩t > τ

≥P(ΩM,t∩t > τ ). (14)

For the last expression we have

Mlim→∞lim inf

t→∞ P

Ω_M,t∩t > τ (ω, M)

=1.

(10)

Note that we can easily changeV_t withv_t in (14), so the theorem forK_δis proven.

Now letK be some compact in the state space. For eachx ∈K we choose δ_x>0 such that the theorem with K(x)= [x−δx;x+δx]is true. Then by standard compactness arguments we obtain the theorem forK.

The strong Chacon–Ornstein (SCO) theorem states that ifAtis a SAF andvtis a deterministic equivalent associated toX_t, thenEνA_t ∼v_t,ast→ ∞for any initial distributionν. The following theorem is a uniform version of SCO theorem for a local time.

Theorem 4. The sequence of real functiong_t(y)=^E^ν_v^L_t^y^t converges uniformly onK_δto the functionl(y)=σ²(y)μ(y) ast→ ∞.

Proof. The point by point convergence follows from the strong Chacon–Ornstein theorem and Proposition 2.1:

tlim→∞

EνL^y_t

v_t =ν_L^y

1 =σ²(y)μ(y). (15)

To show the uniform convergence we will prove that the family{^E^ν_v^L^y^t

t }t >0is equicontinuous onK_δ= [x₀−δ, x₀+δ].

1

2L^y_t =(X_t−y)⁺−(X0−y)⁺− t

0

1_{Xs>y}dXs

hence 1 2

EνL^y_t −EνL^z_t

=Eν

(Xt−y)⁺−(Xt−z)⁺

−Eν

(X0−y)⁺−(X0−z)⁺

−Eν

t 0

1_{y<Xs≤z}b(X_s)ds.

So, as an initial inequality we can take 1

2EνL^y_t −EνL^z_t≤2|y−z| +Eν

_t

0

1_{y<Xs≤z}b(X_s)ds. (16)

We rewrite this using occupation formula, and divide byv_t:

|EνL^y_t −EνL^z_t|

2vt ≤ 2|y−z|

vt + z

y

|b(x)|

σ²(x) EνL^x_t

vt

dx. (17)

From (16) we have for ally, z∈K_δ 1

2EνL^y_t −EνL^z_t≤4δ+Eν

_t

0

1_{Xs∈Kδ}b(X_s)ds=4δ+ ˜v_t. (18) Sincebis locally bounded andXis strong Feller, the second term on the right-hand side in (18) is an expectation of special additive functional which was denoted byv˜_t. Hence, for allx∈K_δ

EνL^x_t

v_t ≤ |EνL^x_t −EνL^x_t⁰|

v_t +EνL^x_t⁰

v_t ≤4δ+ ˜v_t+ ˜˜v_t v_t ,

where we denotedEνL^x_t⁰ byv˜˜_t to stress the fact thatEνL^x_t⁰ is an expectation of another special additive functional.

According to the strong Chacon–Ornstein theorem the right-hand side expression converges to some constant, hence for some constantM >0 andt >0 large enough

sup

x∈[x0−δ,x0+δ]

EνL^x_t

vt ≤M. (19)

(11)

Now we insert (19) in (17): for someK >0 and a large enought

|EνL^y_t −EνL^z_t|

vt ≤|y−z|

vt +M sup

[x0−δ,x0+δ]

|b(x)|

σ²(x)|y−z| ≤K|y−z|

and uniform convergence follows from the Ascoli lemma.

Corolarry 2.1. Leth_t, t≥0,be some deterministic function with range in[0, δ]and such thatlimt→∞h_t =0.For ally∈ [−1;1]limt→∞EνL^x_t^{0+ht y}

vt =μ(x0)σ²(x0).

Proof. We use the theorem about continuous convergence, see [13], p. 194: A sequence of continuous functions(fn) defined on some metric compact X converges uniformly to f if and only if for any sequence (xn)the condition

limn→∞xn=ximplies limn→∞fn(xn)=f (x).

Corolarry 2.2. Let φbe measurable bounded and compactly supported in[−1;1].Leth:R⁺→R⁺be such that limt→∞h_t=0,andψ:R→R⁺continuous inx₀.Then for any initial probabilityνthe following limit holds:

tlim→∞

Eν

_t

0φ ((X_s−x₀)/h_t)ψ (X_s)ds

htvt =ψ (x0)μ(x0)

Rφ (y)dy.

Proof. Using the occupation formula Eν

_t

0φ ((X_s−x₀)/h_t)ψ (X_s)ds

h_tv_t = Eν

Rφ ((x−x0)/ht)(ψ (x)/σ²(x))L^x_t dx h_tv_t

=

Rφ (y)ψ (x0+hty)

σ²(x0+hty)f_t(y)dy, where we have denotedf_t(y)=^E^ν^L^x^t_v^{0+ht y}_t . Using Corollary 2.1,

∀y∈ [−1;1] lim

t→∞f_t(y)=μ(x₀)σ²(x₀).

Also using (19) for someM >0 andtlarge enough we obtain sup_[−_1,1_]f_t≤M. The dominated convergence theorem

concludes the proof.

3. Limit theorems for AF and martingales composed with deterministic or random bandwidth Letφ:R→R⁺be of classC¹, with support in[−1,1]and₁

−1φ (x)dx=1. LetXbe a Harris recurrent diffusion subject to the assumptions of (1). Denote forh≥0

1 hA^h_t =1

h _t

0

φ

Xs−x0

h

ds (forh=0 see (21)), and forh >0

M_t^h= t

0

φ

X_s−x₀ h

σ (X_s)dWs. (20)

SinceφisC¹, there is a continuous modification of _h¹A^h_t and ifσ is bounded, ofM_t^h,h∈ ]0, δ]. We adopt these modifications in what follows. As before, forδ >0,K_δ= [x0−δ, x0+δ]andv_t denotes a deterministic equivalent ofX.

In this section we obtain asymptotic results forA^h_t andM_t^hcomposed with deterministic or random bandwidthht.