A representation for the derivative with respect to the initial data of the solution of an sde with

(1)

North-Western European Journal of Mathematics ^M

E J

A representation for the derivative with respect to the initial data of the solution of an sde with

a non-regular drift

Olga Viktorovna Aryasova¹ Andrey Yurievich Pilipenko²

Received: June 10, 2016/Accepted: January 10, 2017/Online: January 25, 2017

Abstract

We consider a multidimensionalsdewith a Gaussian noise and a drift vector being a vector function of bounded variation. We prove the existence of generalized derivative of the solution with respect to the initial conditions and represent the derivative as a solution of a linearsdewith coefficients depending on the initial process. The obtained representation is a natural generalization of the expression for the derivative in the smooth case. In the proof we use the results on continuous additive functionals.

Keywords:Stochastic flow; Continuous additive functional; Differentiability with respect to initial data.

msc: 60J65, 60H10.

Introduction

Consider ad-dimensional nonhomogeneous stochastic differential equation (sde)











dϕ_t(x) =a(t, ϕ_t(x)) dt+ Xm k=1

σ_k(t, ϕ_t(x)) dw_k(t), ϕ₀(x) =x,

(1)

wherex∈R^d,d≥1,m≥1, (w(t))_t≥0= (w₁(t), . . . , w_m(t))_t≥0is a standardm-dimensional Wiener process, the drift coefficienta: [0,∞)×R^d→R^dis Borel measurable and bounded, and the diffusion coefficientσ : [0,∞)×R^d →R^d×R^m is bounded and continuous.

In what follows we assume thatσ satisfies the following conditions:

1Institute of Geophysics, National Academy of Sciences of Ukraine, Palladin pr. 32, 03680, Kiev-142, Ukraine

2Institute of Mathematics, National Academy of Sciences of Ukraine, Tereshchenkivska str. 3, 01601, Kiev, Ukraine; National Technical University of Ukraine "KPI", Kiev, Ukraine

(2)

(C₁) σ∈W_2d+2,loc^0,1 ([0,∞)×R^d).

(C₂) Uniform ellipticity. For eachT >0, there exists an ellipticity constantB >0 such that for allt∈[0, T],x∈R^d,θ∈R^d,

θ^∗σ(t, x)σ^∗(t, x)θ≥B|θ|², where|·|is a norm inR^d.

(C3) Hölder continuity. For eachT >0, there existL >0, 0< α≤1 such that for all t1, t2∈[0, T],x1, x2∈R^d, 1≤i≤d, 1≤k≤m,

σ_kⁱ(t1, x1)−σ_kⁱ(t2, x2) ≤L

|t1−t2|^α/2+|x1−x2|^α .

Under these assumptions on the coefficients there exists a unique strong solution to Equation (1) on the previous page³).

It is well known⁴that if the coefficients of Equation (1) on the previous page are continuously differentiable in the spatial variable and the derivatives are bounded and Hölder continuous uniformly int, then there exists a modification ofϕ_t(x) (denoted by the same symbol) which is continuous in (t, x) and continuously differentiable inxalmost surely. Moreover, the derivative∇ϕ_t(x) :=Y_t(x) is a solution of the equation

dYt(x) =∇a(t, ϕt(x))Yt(x) dt+

m

X

k=1

∇σk(t, ϕt(x))Yt(x) dwk(t), (2)

where for a functionf :R^d→R^d, we set∇f = ∂fⁱ

∂x_j

1≤i,j≤d.

Flandoli, Gubinelli, and Priola (2010) show that in the case of a smooth, bounded, uniformly non-degenerate noise and a bounded, uniformly in time Hölder continuous drift term the conditions on the coefficients can be essentially weakened, and the solution is continuously differentiable with respect to the spatial parameter.

The case of discontinuous drift is studied in Fedrizzi and Flandoli (2013a,b), Meyer-Brandis and Proske (2010), and Mohammed, Nilssen, and Proske (2015) and the weak differentiability of the solution to Equation (1) on the previous page is proved under rather weak assumptions on the drift. Fedrizzi and Flandoli (2013a) consider Equation (1) on the previous page with the identity diffusion matrix and a drift vector belonging toLq(0, T;Lp(R^d)) for somep,qsuch that

p≥2, q >2, d p+2

q <1.

3See Veretennikov, 1981, “On strong solutions and explicit formulas for solutions of stochastic integral equations”.

4Kunita, 1990,Stochastic Flows and Stochastic Differential Equations.

(3)

Applying a Zvonkin-type transformation they establish the existence of the Gâteaux derivative with respect to the initial data inL₂(Ω×[0, T];R^d). The authors of Meyer- Brandis and Proske (2010) and Mohammed, Nilssen, and Proske (2015) use the Malliavin calculus. In particular, Mohammed, Nilssen, and Proske (2015) prove that the solution of Equation (1) on p. 1 with a bounded measurable drift vector aand the identity diffusion matrix belongs to the spaceL²(Ω;W^1,p(U)) for each t∈R^d, p >1, and any open and boundedU∈R^d. Unfortunately, in these works no representations for the derivatives are given.

The one-dimensional case is considered in Aryasova and Pilipenko (2012) and Attanasio (2010) and explicit expressions for the Sobolev derivative are obtained.

The formulas involve the local time of the initial process. There is no direct generalization of the expressions for the Sobolev derivative to the multidimensional case because the local time at a point does not exist in the multidimensional situation.

The aim of this paper is to get a natural representation for the derivative∇_xϕ_t(x) of the solution to Equation (1) on p. 1. We assume thatσ satisfies conditions (C₁) to (C₃) on the preceding page, and for someρ >0 and all 1≤k≤m, 1≤i, j≤d, the function

∂σ_kⁱ

∂y_j(s, y)

2+ρ

belongs to the Kato-type classK, i.e.,

limt↓0 sup

t₀∈[0,∞) x₀∈Rd

Z t₀+t t₀

ds Z

Rd

1

(2π(s−t₀))^d/2exp











− y−x₀

2

2(s−t₀)











×

∂σ_kⁱ

∂y_j(s, y)

2+ρ

dy= 0.

We show that the derivativeY_t(x) =∇_xϕ_t(x) is a solution to thesde Y_t(x) =E+

Z t 0

dA_s(ϕ(x))Y_s(x) +

m

X

k=1

Z t 0

∇σ_k(s, ϕ_s(x))Y_s(x) dw_k(s), (3) where Eis the d-dimensional identity matrix,A_s(ϕ(x)) is a continuous additive functional of the process (t, ϕ_t(x))_t≥0, which is equal to R_t

0∇a(s, ϕ_s(x)) ds if a is differentiable. This representation is a natural generalization of the expressions for the smooth case.

We prove the main result under the assumption that for each t ≥0 and all 1≤i≤d,aⁱ(t,·) is a function of bounded variation onR^d, i.e., for each 1≤j≤d, the generalized derivativeµ^ij(t,dy) =^∂a_∂xⁱ

j(t,dy) is a signed measure onR^d. Besides, we suppose that for all 1≤i, j≤d,µ^ij(t,dy) dtis of the classK, i.e.,

limt↓0 sup

t₀∈[0,∞) x₀∈R

Zt₀+t t₀

ds Z

Rd

1

(2π(s−t₀))^d/2exp











− y−x₀

2

2(s−t0)









 µ

ij(s,dy) = 0,

where µ

ij =µîj,++µîj,⁻is the variation ofµîj;µîj,+,µîj,⁻are measures from the Hahn-Jordan decompositionµîj=µîj,+−µîj,⁻.

(4)

Similar results for a homogeneoussdewith the identity diffusion matrix and a drift being a vector function of bounded variation are obtained in Aryasova and Pilipenko (2014). In this case, there is no martingale member in the right-hand side of Equation (3) on the previous page. This essentially simplifies the proof. The argument is based on the theory of additive functionals of homogeneous Markov processes developed by Dynkin (1965). In Bogachev and Pilipenko (2015) the same method is applied to a homogeneoussdewith Lévi noise and a drift being a vector function of bounded variation. The authors prove the existence of a strong solution and the differentiability of the solution with respect to the initial data.

Unfortunately, Dynkin’s theory can not be directly applied to our problem because (ϕt(x))t≥0is not homogeneous.

The paper is organized as follows. In Section 1 we collect some facts from the theory of additive functionals of homogeneous Markov processes⁵. We consider a homogeneous process (t, ϕ_t)_t≥0and adapt Dynkin’s theory to the functionals of this process. The main result is formulated in Section 2 on p. 14 and proved in Section 3 on p. 15. The idea of the proof is to approximate the solution of Equation (1) on p. 1 by solutions of equations with smooth coefficients. The key point is the convergence of continuous homogeneous additive functionals of the approximating processes to a functional of the process being the solution to Equation (1) on p. 1 (Lemma 6 on p. 21). The proof of the corresponding statement uses essentially the result on the convergence of the transition probability densities of the approximating processes, which is obtained in Section 4 on p. 29.

The suggested method can be considered as a generalization of the local time approach used in the one-dimensional case.

1 Preliminaries: continuous additive functionals

Let (ξt,Ft, Pz) be a càdlàg homogeneous Markov process with a phase space (E,B), whereσ-algebraBcontains all one-point sets⁶. Assume that (ξt)t≥0has the infinite life-time. DenoteNt=σ{ξs: 0≤s≤t}.

Definition 1 – A random functionA_t,t≥0, adapted to the filtration{Nt}is called a non-negative continuous additive functional of the process (ξ_t)_t≥0if it is

• non-negative;

• continuous int;

• homogeneous additive, i.e., for allt≥0,s >0,z∈E,

A_t+s=A_s+θ_sA_t P_z−almost surely, (4)

whereθis the shift operator.

5Dynkin, 1965,Markov Processes.

6See notations in ibid.

(5)

If in addition for eacht≥0, sup

z∈E

E_zA_t<∞,

thenA_t,t≥0, is called a W-functional.

Remark 1 – It follows from Definition 1 on the preceding page that a W-functional is non-decreasing int, and for allz∈E

P_z{A₀= 0}= 1.

Definition 2 – The function f_t(z) =E_zA_t

is called the characteristic of aW-functionalA_t. Remark 2 (Dynkin⁷) – For alls≥0,t≥0,

kf_t+sk

E≤ kf_tk

E+kf_sk

E, wherekftk

E= sup_z∈E|ft(z)|.

The following theorem states the relation between the convergence of W-functionals and the convergence of their characteristics.

Theorem 1 (Dynkin⁸) – LetA_n,t,n≥1, be W-functionals of the process(ξ_t)_t≥0and f_n,t(z) =E_zA_n,t be their characteristics. Suppose that for eacht >0, a functionf_t(z) satisfies the condition

nlim→∞ sup

0≤u≤t

sup

z∈E

fn,u(z)−fu(z)

= 0. (5)

Thenft(z)is the characteristic of a W-functionalAt. Moreover, A_t= l.i.m.

n→∞A_n,t,

wherel.i.m.denotes the convergence in the mean square sense (for any initial distribution ξ₀).

Proposition 1 (Dynkin⁹) – If for any t ≥ 0 the sequence of non-negative additive functionalsA_n,t:n≥1 of the Markov process(ξ_t)_t≥0 converges in probability to a continuous functionalA_t, then the convergence in probability is uniform, i.e.

∀T >0 sup

t∈[0,T]

A_n,t−A_t

→0, n→ ∞,in probability.

7Dynkin, 1965,Markov Processes, Properties 6.15.

8Ibid., Theorem 6.3.

(6)

Example 1 – LetE=R^d,hbe a non-negative bounded measurable function onE, and suppose that the process (ξ_t)_t≥0has a transition probability densityg_t(z₁, z₂).

Then At:=

Z t 0

h(ξs) ds

is aW-functional of the process (ξ_t)_t≥0and its characteristic is equal to f_t(z) =

Z

E

Z t 0

g_s(z, v) ds

!

h(v) dv= Z

E

k_t(z, v)h(v) dv, where

k_t(z, v) = Z t

0

g_s(z, v) ds.

Let a measureνbe such thatR

Ek_t(z, v)ν(dv) is well defined. If we can choose a sequence of non-negative bounded continuous functions{h_n:n≥1}such that for eachT >0,

nlim→∞ sup

t∈[0,T]

sup

z∈E

Z

E

k_t(z, v)h_n(v) dv− Z

E

k_t(z, v)ν(dv)

= 0,

then by Theorem 1 on the previous page there exists a W-functionalA^ν_t corresponding to the measureνwith its characteristic being equal toR

Ek_t(z, v)ν(dv).

For a given measureν, we have a sufficient condition for the existence of a corresponding W-functional.

Theorem 2 (Dynkin¹⁰) – Suppose that limt↓0sup

z∈E

ft(z) = lim

t↓0sup

z∈E

Z

E

kt(z, y)ν(dy) = 0. (6)

Thenft(z)is the characteristic of a W-functionalA^ν_t. Moreover, A^ν_t = l.i.m.

ε↓0

Zt 0

f_ε(ξ_u)

ε du,

and the sequence of characteristics of integral functionalsRt 0

f_ε(ξ_u)

ε duconverges tof_t(z) in sense of Equation(5)on the previous page.

9Dynkin, 1965,Markov Processes, Lemma 6.1’.

(7)

Consider a processη_t= (η_t¹, η_t²),t≥0, which is a (unique) solution to the system ofsde:











dη_t¹= dt,

dη_t²=a(η_t¹, η_t²) dt+ Xm

k=1

σ_k(η_t¹, η²_t) dw_k(t). (7)

For the initial conditionη₀¹=t₀,η₀²=x₀, denote the corresponding distribution of the process (η_t)_t≥0byP_t₀_,x₀.

Since (η_t)_t≥0is a homogeneous Markov process, we can apply for its investigation the theory of additive functionals.

Lethbe a non-negative bounded measurable function onE= [0,∞)×R^d. Then (cf. Example 1 on the preceding page)

A_t= Z t

0

h(η_s) ds

is a W-functional of the process (η_t)_t≥0. Its characteristic is equal to f_t(t₀, x₀) =E_t

0,x₀

Zt 0

h(η_s) ds= Zt

0

ds Z

R^d

G(t₀, x₀, t₀+s, y)h(t₀+s, y) dy

= Zt₀+t

t₀

ds Z

R^d

G(t₀, x₀, s, y)h(s, y) dy, (8)

whereG(s, x, t, y), 0≤s≤t,x∈R^d,y∈R^d, is the transition probability density of the process (η_t²)_t≥0.

Let a measureν on [0,∞)×R^d be such thatRt₀+t t₀

R

RdG(t₀, x₀, s, y)ν(ds,dy)<∞ for allt≥0,t₀≥0,x₀∈R^d. If there exists a sequence of non-negative bounded continuous functions{h_n: n≥1}such that for eachT >0,

nlim→∞ sup

t∈[0,T] t₀∈[0,∞)

x₀∈Rd

Z t₀+t t₀

ds Z

R^d

G(t₀, x₀, s, y)h_n(s, y) dy− Zt₀+t

t₀

Z

R^d

G(t₀, x₀, s, y)ν(ds,dy)

= 0,

then by Theorem 1 on p. 5 there exists a W-functional corresponding to the measure νwith its characteristic being equal toRt₀+t

t₀

R

RdG(t₀, x₀, s, y)ν(ds,dy).

Theorem 3 (Corollary of Theorem 2 on the preceding page) – Suppose that limt↓0 sup

t₀∈[0,∞) x₀∈R^d

f_t(t₀, x₀) = lim

t↓0 sup

t₀∈[0,∞) x₀∈R^d

Z t₀+t t₀

Z

R^d

G(t₀, x₀, s, y)ν(ds,dy) = 0. (9)

10Dynkin, 1965,Markov Processes, Theorem 6.6.

(8)

Thenf_t(z), z∈[0,∞)×R^d, is the characteristic of a W-functionalA^ν_t. Moreover, A^ν_t = l.i.m.

ε↓0

Zt 0

f_ε(η_u)

ε du,

and the sequence of characteristics of integral functionalsRt 0

f_ε(η_u)

ε duconverges tof_t(z) in sense of Equation(5)on p. 5.

Let ηt(t0, x0) = (η_t¹(t0, x0), η_t²(t0, x0)) be a solution of Equation (7) on the previous page with the initial condition (t0, x0) and defined on a probability space (Ω,F,Ft,P). LetP_t

0,x₀be the distribution of the process (η_t(t₀, x0))_t≥0, wheret0≥0, x₀∈ R^d. In Dynkin’s notation¹¹ (η_t(t₀, x₀))_t≥0,Ft,P), t₀ ≥0, x₀ ∈R^d, is called a Markov family of random functions.

Suppose that the measureνsatisfies the condition of Theorem 3 on the previous page. Then there exists a W-functionalA^ν_t of the process (η_t)_t≥0. According to the definition of W-functionals, the functional is measurable w.r.t. σ-algebra generated by the process (η_t)_t≥0. Since the process (η_t)_t≥0is continuous and has the infinite life-time, we can consider A^ν_t =A^ν_t(·) as a measurable function on [0,∞)×C([0,∞),R^d) that depends only on the behavior of the process on [0, t]. The compositionA^ν_t(η^·(t0, x0)),t≥0, is called a W-functional of (ηt(t0, x0))t≥0correspond- ing to the measureν. The functionA^ν_t(η^·(t0, x0)) is defined for allt0≥0, x0∈R^d.

Ift0= 0,x0=x, the processη_t²(0, x) =η_t²(x) is the solution of Equation (1) on p. 1 starting fromxand thereforeη_t²(x) =ϕt(x). Thenηt(0, x) = (t, ϕt(x)). Since the first coordinateη_t¹(t0, x0) =t0+tis non-random, we denoteA^ν_t(η^·(0, x)) asA^ν_t(ϕ^·(x)).

Let us formulate the sufficient condition for Equation (9) on the previous page.

Ifaandσare bounded and measurable, andσ satisfies conditions conditions (C2) and (C3) on p. 2 then the transition probability density of the process (η2(t))t≥0

satisfies the Gaussian estimate¹²: G(s, x, t, y)≤ C

(t−s)^d/2exp











−c y−x

2

t−s











(10)

valid in every domain of the form 0≤s < t ≤T, x∈R^d, y ∈R^d, where T >0.

ConstantsC,care positive and depend only ond,T,kak_{T ,}_∞,kσ_kk_{T ,}_∞, 1≤k≤m, and the ellipticity constantB, wherekak_{T ,}_∞= sup_t∈[0,T]sup_x∈Rdka(t, x)k.

Denote byp₀(s, x, t, y) the transition probability density of a Wiener process:

p0(s, x, t, y) = 1

(2π(t−s))^d/2exp











− y−x

2

2(t−s)











. (11)

11See Dynkin, 1965,Markov Processes.

12See N. I. Portenko, 1990,Generalized Diffusion Processes, Ch. II.

(9)

The following definition is analogous to the definition of the Kato class¹³. Definition 3 – A measureνon [0,∞)×R^d is of the classK, if

limt↓0 sup

t₀∈[0,∞) x₀∈Rd

Zt₀+t t₀

Z

R^d

p₀(t₀, x₀, s, y)ν(ds,dy) = 0. (12)

Taking into account Equation (10) on the preceding page it is easy to see that ifνis of the classKthen it satisfies condition Equation (9) on p. 7.

Definition 4 – A signed measureνis of the classKif the measure|ν|is of the class K, where|ν|=ν⁺+ν⁻is the variation ofν. Hereν⁺,ν⁻are the measures from the Hahn-Jordan decompositionν=ν⁺−ν⁻.

Letν=ν⁺−ν⁻be a signed measure belonging toK. Then by Theorem 2 on p. 6 there exist W-functionalsA^ν_t^±. DenoteA^ν_t =A^ν_t⁺−A^ν_t⁻.

Remark 3 – Suppose that the signed measureν can be represented in the form ν=eν⁺−

eν⁻, whereeν⁺,eν⁻are of the classKbut are not necessarily orthogonal. Then one can see thatA^ν_t⁺−A^ν_t⁻=A^e^ν_t⁺−A^e^ν_t⁻.

In what follows we often deal with measures which have densities with respect to the Lebesgue measure on [0,∞)×R^d.

Definition 5 – A measurable functionhon [0,∞)×R^d is called a function of the classKif the signed measureν(ds,dy) =h(s, y) dsdyis of the classK.

Remark 4 – Letν(ds,dx) =µ(dx) ds, whereµis a measure onR^d. Then the Equa- tion (12) transforms into the following one

limt↓0 sup

x₀∈Rd

Z t 0

ds Z

R^d

p₀(0, x₀, s, y)µ(dy) = 0. (13)

It is proved¹⁴thatµsatisfies the condition Equation (13) if and only if sup

x∈R

Z

|_x−y|≤1

µ(dy)<∞, whend= 1; (14)

limε↓0sup

x∈R2

Z

|_x−y|≤ε

ln 1 x−y

µ(dy) = 0, whend= 2; (15)

limε↓0 sup

x∈Rd

Z

|_x−y|≤ε

x−y

2−d

µ(dy) = 0, whend≥3. (16)

13Kuwae and Takahashi, 2007, “Kato class measures of symmetric Markov processes under heat kernel estimates”, Cf.

(10)

Consider now a measureνof the form ν(ds,dx) =µ(s,dx) ds. Similarly to Equa- tions (14) to (16) on the previous page one can show that if for eachT >0,µsatisfies the condition

sup

t∈[0,∞)

sup

x∈R

Z

|_x−y|≤1

µ(t,dy)<∞, whend= 1, (17) limε↓0 sup

t∈[0,∞)

sup

x∈R2

Z

|_x−y|≤ε

ln 1 x−y

µ(t,dy) = 0, whend= 2, (18) limε↓0 sup

t∈[0,∞)

sup

x∈Rd

Z

|_x−y|≤ε

x−y

2−d

µ(t,dy) = 0, whend≥3, (19) thenν∈K.

Remark 5 – Suppose that the measureν(ds,dx) =µ(s,dx) ds satisfies one of the conditions Equations (17) to (19). Then it can be verified¹⁵that for eachT >0,r >0, there existsK=K(r, T)>0 such that for allx∈R^d,t∈[0, T],

µ(t, B(x, r))< K,

whereB(x, r) is the ball centered atxand with radiusr.

In the sequel we use the following modification of Khas’minskii’s lemma¹⁶. Lemma 1 – Let A_t be a W-functional with the characteristicf_t satisfying condition Equation(9)on p. 7. Then for allp >0,t≥0, there exists a constantC >0depending on p,t, and the rate of convergence in Equation(9)on p. 7 such that

sup

t₀∈[0,∞),x₀∈Rd

E_t

0,x₀exp{pA_t}< C.

Example 2 – Letν(dt,dx) =h(t, x)dtdx, wherehis a non-negative bounded measurable function. Then the measureνis of the classK. The functional

A_t:=

Z t 0

h(η_s) ds

is a W-functional of the process (η_t)_t≥0with characteristic defined by Equation (8) on p. 7, and

At(ϕ(x)) = Z t

0

h(s, ϕs(x)) ds.

14E.g. in Chen, 2002, “Gaugeability and Conditional Gaugeability”, Theorem 2.1.

15Cf. Dynkin, 1965,Markov Processes, Lemma 8.3.

16See Khasminskii, 1959, “On Positive Solutions of the EquationAU+V u= 0”;

or Sznitman, 1998,Brownian Motion, Obstacles and Random Media.Ch. 1, Lemma 2.1.

(11)

Example 3 – Local time.Letd= 1. It is well known that for anyx, y∈Rthere exists a local time of the process (ϕ_t(x))_t≥0at the pointy, which is defined as

L^y_t(ϕ(x)) = l.i.m.

ε↓0

1 2ε

Z_t

0 1[y−ε,y+ε](ϕ_s(x)) ds.

It can be checked thatL^y_t(ϕ(x)) is a W-functional of (ϕ_t(x))_t≥0corresponding to the measureν(ds,dx) = dsδ_y(dx), whereδ_yis the delta measure at the pointy.

Indeed, for fixedy∈Rand eachε >0, put h^ε,y(t, x) =h^ε,y(x) = 1

2ε1[y−ε,y+ε](x), t≥0, x∈R,

andν^ε,y(dt,dx) =h^ε,y(t, x)dtdx. The functionh^ε,yis bounded and measurable. Then (see Example 2 on the preceding page) there exists a W-functional of the process (η_t)_t≥0corresponding to the measureν^ε,y. This functional is defined by the formula

A^ε,y_t :=A^ν_t^ε,y= Z t

0

h^ε,y(η_s) ds= 1 2ε

Zt 0

1[y−ε,y+ε](η_s²) ds and its characteristic is equal to

f_t^ε,y(t₀, x₀) =E_t

0,x₀A^ε,y_t (η) = Zt₀+t

t₀

ds Z

R^d

G(t₀, x₀, s, v)h^ε,y(s, v) dv.

One can see thatf_t^ε,y(t₀, x0) tends to f_t^y(t0, x0) =

Zt₀+t t₀

G(t0, x0, s, y) ds= Zt₀+t

t₀

ds Z

RdG(t0, x0, s, v)δy(dv)

asε→0 uniformly int∈[0, T],t₀∈[0,∞],x₀∈R. Then by Theorem 1 on p. 5 there exists a functional

A^y_t = l.i.m.

ε↓0 A^ε,y_t = l.i.m.

ε↓0

Zt 0

h^ε,y(η_s) ds.

In particular,

A^y_t(ϕ_t(x)) =L^y_t(ϕ(x)).

Note that ifd≥2, the measureδ_yis not of the classK. This agrees with the well-known fact that the local time for a multidimensional Wiener process does not exist.

The following lemma deals with the convergence of W-functionals of, generally speaking, different random functions.

(12)

Lemma 2 – Let{(ξ_n,t)_t≥0:n≥0}be a sequence of homogeneous Markov random func- tions defined on a common probability space(Ω,F, P)with the common phase space (E,B), whereEis a metric space,Bis the Borelσ-algebra. Forn≥0, letA_n,t=A_n,t(ξ_n) be a W-functional of the random function(ξ_n,t)_t≥0with the characteristicf_n,t(z).

Assume that

(A1) for eacht≥0,f_0,t(z)is continuous inz∈E;

(A₂) for eacht≥0,ξn,t→ξ0,t,n→ ∞, in probabilityP; (A3) for alln≥0,lim

δ↓0

f_n,δ

_E= 0,where

fn,δ

_E= sup

z∈E

fn,δ(z)

; (A₄) for eacht >0,

f_n,t−f_0,t

_E→0,n→ ∞. Then for eachT >0,

sup

t∈[0,T]

An,t(ξn)−A0,t(ξ0)

→0, n→ ∞,in probabilityP . Proof. Note thatA^δ_n,t := ¹_δRt

0f_n,δ(ξ_n,s) dsis a W-functional of the process (ξ_n,t)_t≥0. Denote its characteristic byf_n,t^δ. Then by Dynkin (1965, Lemma 6.5), for allt≥0, z∈E,

E_z A_n,t−1 δ

Zt 0

f_n,δ(ξ_n,s) ds

!2

≤2

f_n,t(z) +f_n,t^δ(z) sup

0≤u≤t

f_n,u−f_n,u^δ _E. Similarly to the proof of Dynkin (1965, Theorem 6.6), we get

f_n,t^δ(z)−fn,t(z) ≤1

δ Z t+δ

t

fn,u(z)−fn,t(z) du+1

δ Z δ

0

fn,u(z) du

≤1 δ

Z t+δ t

f_n,u−t

_Edu+1 δ

Z δ 0

f_n,u

_Edu≤2 f_n,δ

_E. So for allt≥0,

sup

0≤u≤t

f_n,u^δ −f_n,u _E≤2

f_n,δ

_E. (20)

Using the calculations of the proof of Dynkin (1965, Theorem 6.6), once more we obtain

f_n,t(z) +f_n,t^δ(z) =f_n,t(z) +1 δ

Zt+δ t

f_n,u(z) du−1 δ

Z δ 0

f_n,u(z) du

(Cont. next page)

(13)

≤ f_n,t

_E+ f_n,t+δ

_E

≤2 f_n,t+δ

_E. (21)

The inequalities Equation (20) on the preceding page and Equation (21) give us the estimate

E_z A_n,t(ξ_n)−1 δ

Zt 0

f_n,δ(ξ_n,s) ds

!2

≤8 f_n,δ

_E f_n,t+δ

_E. (22) Further, we have

E_z(A_n,t(ξ_n)−A_0,t(ξ₀))²≤4







E_z A_n,t(ξ_n)−1 δ

Zt 0

f_n,δ(ξ_n,s) ds

!2

+E_z 1 δ

Zt 0

fn,δ(ξn,s) ds−1 δ

Zt 0

f0,δ(ξn,s) ds

!2

+E_z 1 δ

Zt 0

f_0,δ(ξ_n,s) ds−1 δ

Zt 0

f_0,δ(ξ_0,s) ds

!2

+E_z 1 δ

Z t 0

f_0,δ(ξ_0,s) ds−A_0,t(ξ₀)

!2



= 4[I+II+III+IV]. (23) By assertion (A3) on the preceding page for anyε >0 we can chooseδ >0 such that

f_0,δ

_E< ε. According to assertion (A4) there existsn₀ >0 such that for all n > n₀,

fn,δ−f0,δ

_E< ε.

Then for alln > n0,

f_n,δ

_E≤

f_n,δ−f_0,δ _E+

f_0,δ

_E<2ε.

Note that for eachn≥0,k≥1, we have fn,kδ

_E≤k

fn,δ

_E. This implies that for any t≥0,Mt:= sup_n≥0

fn,t

_E<∞. Taking into account Equation (22), we obtain that for alln > n0,

I≤16Mt+δε,

and the same estimate holds forIV. By the Hölder inequality,

II≤ t δ²E_z

Z t 0

fn,δ(ξn,s)−f0,δ(ξn,s)2

ds≤ t² δ²E_zsup

z∈E

fn,δ(z)−f0,δ(z)2

The assertion (A4) on the preceding page yields the estimateII ≤ε valid for all n≥n1=n1(ε, δ).

(14)

Similarly, III≤ t

δ²E_z Z _t

0

f0,δ(ξ_n,s)−f0,δ(ξ_0,s)2ds.

The continuity of the functionf_0,t(·) and assertion (A2) provide the convergence f_0,δ(ξ_n,s) tof_0,δ(ξ_0,s) in probability asntends to∞. This convergence together with assertion (A3) allow us to use the dominated convergence theorem and prove that III→0 asn→ ∞. Then the right-hand side of Equation (23) on the previous page tends to 0 asntends to∞. The uniform convergence follows from Proposition 1 on

p. 5, which completes the proof.

2 The main result

In the following theorem we formulate the main result of this paper.

Theorem 4 – Suppose thata= (a¹, . . . , a^d) : [0,∞)×R^d→R^dis a bounded measurable function such that for eacht ≥0and all 1≤i≤d,aⁱ(t,·)is a function of bounded variation onR^d, i.e., for each1≤j≤d, the generalized derivativeµ^ij(t,dy) =^∂a_∂yⁱ

j(t,dy)is a signed measure onR^d. Assume that the signed measuresν^ij(dt,dy) :=µ^ij(t,dy)dt,1≤ i, j≤d, are of the classK.

Letσ : [0,∞)×R^d→R^d×R^mbe a bounded continuous function satisfying condi- tions (C1) to (C3) on p. 2 and the following condition

(C4) There existsρ >0such that for all1≤k≤m,1≤i, j≤d, the function

∂σ_kⁱ

∂y_j(s, y)

2+ρ

is of the classK.

Then there exists the derivativeY_t(x) =∇ϕ_t(x)inL_p-sense: for allp >0,x∈R^d, v∈R^d,t≥0,

E

ϕ_t(x+εv)−ϕ_t(x)

ε −Y_t(x)v

p

→0, ε→0. (24)

This derivative is a unique solution of the integral equation

Yt(x) =E+ Z_t

0

dA^ν_s(ϕ(x))Y_s(x) +

m

X

k=1

Z_t

0

∇σk(s, ϕ_s(x))Y_s(x) dw_k(s), (25)

whereEis thed×d-identity matrix,∇σ_k(s, y) = _∂σi

k

∂y_j(s, y)

1≤i,j≤d; the first integral in the right-hand side of Equation(25)is the Lebesgue-Stieltjes integral with respect to the continuous function of bounded variationt→A^ν_t(ϕ(x)).

(15)

Moreover, Pn

∀t≥0 :ϕt(·)∈W_p,loc¹ (R^d,R^d),∇ϕt(x) =Yt(x)forλ-a.a.xo

= 1, (26)

whereλis the Lebesgue measure onR^d. Remark 6 – The W-functionalA^ν_t =

A^ν_t^ij

1≤i,j≤dis well defined because the signed measureνis of the classK.

Remark 7 – Recall that for all 1≤i, j≤d, the mappingsA^ν_tîj,^±, which we will denote byAîj,_t ^±, are continuous and monotonic int. So for eachT >0, the functiont→Aîj_t is a continuous function of bounded variation on [0, T] almost surely.

3 The proof of Theorem 4 on the preceding page

The existence and uniqueness of the solution to Equation (25) on the preceding page follows from Protter (2004, Ch. V, Theorem 7). Indeed, condition (C4) on the preceding page provides that for all 1≤k≤m,Rt

0|∇σk(s, ϕs(x))|²ds <∞almost surely and consequently

Zt 0

dA^ν_s(ϕ(x)) + Xm k=1

Z t 0

∇σ_k(s, ϕ_s(x)) dw_k(s), t≥0, is a semimartingale.

It is well known that the statement of the theorem is true in the case of smooth coefficients, and the derivative satisfies Equation (2) on p. 2. To prove the theorem in the general case, we approximate the initial equation by equations with smooth coefficients.

The proof is divided into two steps.

3.1 First step

In the first step, we assume that there existsR >0 such that for allt≥0,x∈R^d,

|x| ≥R,a(t, x) = 0,σ(t, x) =σe= const,σeσe^∗>0.

Forn≥1, letωn∈C₀^∞(R^d) be a non-negative function such thatR

R^dωn(z) dz= 1, andω_n(x) = 0,|x| ≥1/n. For allt≥0,x∈R^d,n≥1, and 1≤k≤m, put

a_n(t, x) = (ω_n∗a)(t, x) = Z

R^d

ω_n(x−y)a(t, y) dy, (27)

σ_n,k(t, x) = (ω_n∗σ_k)(t, x) = Z

R^d

ω_n(x−y)σ_k(t, y) dy. (28)

(16)

Note that for eachT >0, sup

n≥1

kank_{T ,}_∞≤ kak_{T ,}_∞, (29) sup

n≥1

σn,k

_{T ,}∞≤ kσkk_{T ,}_∞,1≤k≤m, (30) where

kak_{T ,}_∞= sup

t∈[0,T]

sup

x∈Rd

|a(t, x)|.

Besides, for alln≥1,σnsatisfies condition (C₂) on p. 2, and the ellipticity constant can be chosen uniformly inn.

Remark 8 – For alln≥1 the transition probability density of the process (ϕ_n,t(x))_t≥0

satisfies the inequality Equation (10) on p. 8. It follows from Equation (29) and condition (C₂) on p. 2, which holds uniformly inn, that the constants in Equation (10) on p. 8 can be chosen uniformly inn≥1.

For eachT >0, we havea_n→a,n→ ∞, inL₁([0, T]×R^d). Choosing subsequences, without loss of generality we can assume thata_n(t, x)→a(t, x),n→ ∞, for almost all t≥0 and almost allxw.r.t. the Lebesgue measure. Then for alln≥1, t≥0,x∈R^d such that|x| ≥R+ 1,

a_n(t, x) = 0, σ_n(t, x) =σ .e

Without loss of generality we can assume that this is true for allxsuch that|x|> R.

Moreover, from condition (C3) on p. 2 we derive that for eachT >0,σn→σ,n→ ∞, uniformly in (t, x)∈[0, T]×R^d.

Consider thesde:











dϕn,t(x) =an(t, ϕn,t(x)) dt+ Xm k=1

σn,k(t, ϕn,t(x)) dwk(t),

ϕ_n,0(x) =x, x∈R^d. (31)

For eachn≥1 there exists a unique strong solution of Equation (31).

Lemma 3 – For eachp≥1,

1. for allt≥0and any compact setU∈R^d, sup

x∈U , n≥1

E(

ϕn,t(x)

p+|ϕt(x)|^p)

<∞; 2. for allx∈R^d, T ≥0,

E sup

0≤t≤T

ϕ_n,t(x)−ϕ_t(x)

p!

→0asn→ ∞.