Radu Theodorescu (12 April 1933 – 14 August 2007), collaborator and friend
ON A CONTRACTIBILITY CONDITION FOR ITERATED RANDOM FUNCTIONS
ULRICH HERKENRATH and MARIUS IOSIFESCU
Wu and Shao [7] considered a geometric moment contractibility condition for ite- rated random functions. We show that this condition is not fulfilled for simple examples of such random functions and propose instead a more general one, able also to incorporate a local contractibility condition considered by Steinsaltz [6].
Moreover, we show that the functional central limit theorem as well as the conver- gence of empirical processes to Gaussian processes in Skorokhod space proved in Wu and Shao’s paper, still hold under our condition, with appropriate alterations of the hypotheses, which reduce to those assumed by these authors under their condition.
AMS 2000 Subject Classification: 60J05, 60F17.
Key words: iterated random function, contractibility, functional central limit theorem.
1. PRELIMINARIES
We adopt the notation from Wu and Shao [7] with just slight modifica- tions intended to better precise a few details (for, e.g., probability distributions and mean value operators).
Let (X, ρ) be a Polish (i.e. complete and separable) metric space with metric ρ, endowed with the σ-algebra of its Borel sets. As usual, define an iterated random function as Xn = Fθn(Xn−1), n ∈ N+ = {1,2, . . .}, where (i) the θn, n ∈ N+, take values in a second measurable space Θ and are independently distributed with identical marginal probability distributionH on theσ-algebra of measurable sets in Θ; (ii)Fθ(·) is theθ-section of a jointly measurable functionF :X ×Θ→ X, and (iii) X0 is either a fixed point inX or an X-valued random variable independent of the sequence (θn)n∈N+. On account of (iii), the sequence (Xn)n∈N, n ∈ N = {0} ∪N+, is supported by a probability space, say, (Ω,K, PH) or (Ω,K, Ph,H) according as X0 is a fixed point x∈ X or anX-valued random variable with probability distribution h on the Borel subsets ofX.
REV. ROUMAINE MATH. PURES APPL.,52(2007),5, 563–571
Set Xn(x) = Fθn ◦ · · · ◦Fθ1(x) and introduce the backward iteration process Zn(x) = Fθ1 ◦ · · · ◦Fθn(x), x ∈ X, n ∈ N+. It is obvious that for all x ∈ X and n∈ N+ the probability distributions of Xn(x) and Zn(x) are identical.
It has been known for long that (Xn(x))n∈N+ is a Markov process while the sequence (Zn(x))n∈N+ is a very different sort of object (actually, a chain of infinite order). Also (cf. Letac [4]), if Fθ(·) is continuous for any fixed θ ∈ Θ and if Z∞ := limn→∞Zn(x) exists PH-a.s. and does not depend on x∈ X, then the probability distribution of Z∞, say, π is the only stationary distribution of (Xn(x))n∈N+.
2. A GENERAL CONTRACTIBILITY CONDITION
Wu and Shao [7] proved the PH-a.s. convergence of Zn(x) as n → ∞ under Conditions 1 and 2 below. In what follows, EH(Eh,H) stands for the mean-value operator with respect toPH(Ph,H).
Condition 1. There exist x0 ∈ X and α∈(0,1] such that EH(ρα(x0, Fθ1(x0))) =
Θρα(x0, Fθ(x0))H(dθ)<∞.
Condition 2. Withx0 ∈ X andα ∈(0,1] as in Condition 1, there exist r(α) :=r∈(0,1) andC(α) :=C >0 such that
EH(ρα(Xn(x), Xn(x0)))≤Crnρα(x, x0) for any x∈ X and n∈N.
In a somewhat more restrictive context, where X is a convex subset of Rm for somem∈N+ withρ the Euclidean distance, Steinsaltz [6] introduced a local contractibility condition amounting to the existence of a drift function ϕ:X →(0,∞) and of some r∈(0,1) such that
(1) EH
lim sup
y→x
ρ(Xn(y), Xn(x)) ρ(y, x)
≤ϕ(x)rn
for any x∈ X and n∈N. It was proved by Steinsaltz [6, p. 1959] that under (1) in conjunction with a suitable assumption onϕ, which is a generalization of Condition 1, a more general condition than Condition 2 does hold, but not Condition 2 itself.
Our point is that it is possible to introduce a more general condition able to fully integrate both Steinsaltz’s and Wu and Shao’s ones.
To be precise, we introduce Condition C below.
Condition C.(i) There existx0 ∈ X, α∈(0,1], r∈(0,1), and measur- able functionsϕn :X →[0, ∞), n∈N+, such that bothR:= 1/lim sup
n→∞ δn1/n
andRx:= 1/lim sup
n→∞ ϕ1n/n(x) strictly exceed r for any x∈ X, where δn:=EH(ϕn(X1(x0))ρα(x0, X1(x0))), n∈N+. (ii) For anyx∈ X and n∈N+ one has
EH(ρα(Xn(x), Xn(x0)))≤rnϕn(x)ρα(x, x0).
Clearly, R and Rx, x∈ X, are the convergence radii of the power series
n∈N+δntn and
n∈N+ϕn(x)tn, t∈R, respectively.
Wu and Shao’s Conditions 1 and 2 correspond to the case where ϕn is a constant independent of n∈N+. Steinsaltz’s conditions imply that Condi- tion C holds with an arbitrary x0 ∈ X whileϕn is independent of n∈N+, it having the form
ϕn(x) = sup
0≤t≤1ϕ(x0+t(x−x0)), x∈ X, whereϕ:X →[0,∞) is the function occurring in (1).
3. A SIMPLE EXAMPLE
Doubtlessly, Condition 2 is restrictive enough, as simple examples show.
Such an example is given by
Fθ(x) =ax2+bθ
withx∈ X = [0,∞), θ∈[0,1] anda, b∈(0,1).Let (θn)n∈N+ be a sequence of i.i.d. Bernoulli variables with success probability 12, so that the corresponding iterated random function is defined recursively by the equation
(2) Xn(x) =aXn2−1(x) +bθn, n∈N+,
withX0(x) =x. It is easy to check that Xn(x) is an even 2nth degree poly- nomial with positive coefficients, part of which are random as functions of θi, 1≤i≤n, n∈N+. Hence the ratio
|Xn(x)−Xn(x0)|/|x−x0|
exceeds|Xn(x)−Xn(0)|/x, so that the choice x0 = 0 in Condition 2 appears to be the natural one. Next, as X0(·) is an increasing function, one sees by induction thatXn(·) is an increasing function for any n∈N+. Then, letting
qn(x) = Xn(x)−Xn(0)
x , n∈N+, 0=x∈ X,
so thatq1(x) =ax,equation (2) yields
qn(x) =a(Xn−1(x) +Xn−1(0))qn−1(x)≥2aXn−1(0)qn−1(x), n∈N+, whence
(3) qn(x)≥(2a)n−1ax
n−1 j=1
Xj(0), n≥2.
Now, note thatXj(0)≥bθj, j ∈N+, so that (4)
n−1 j=1
Xj(0)≥bn−1θ1· · ·θn−1, n≥2.
Since theθi are independent andEH(θi) = 12, i∈N+, it follows from (3) and (4) that
EH(qn(x))≥(ab)n−1ax, n∈N+, 0=x∈ X,
that is, withρ denoting the Euclidean distance on the real line, we have (5) EH(ρ(Xn(x), Xn(0)))≥(ab)n−1axρ(x,0)
for any n∈N+ and x∈ X.
Clearly, (5) shows thatCondition 2cannot hold for our example asX is unbounded.
On the other hand, it is easy to see that EH(ρ(X2(x), X2(0))) =a2x
ax2+b
ρ(x,0), EH(ρ(X3(x), X3(0))) =a3x
ax2+b a3x4+a2bx2+2ab2+b
ρ(x,0). These equations actually exhibit ϕ2(x) and ϕ3(x). It then appears that the mean valueEH(ρ(Xn(x), Xn(0))) has the form
anx
ax2+b fn
ax2+b
ρ(x,0)
for some functionfn: [0,∞)→R, n∈N+, showing that our example should enter the class of iterated function systems that obey Condition C under suit- able assumptions on the coefficientsaandb.We shall not go any further into this matter as our goal was a different one.
4. MAIN RESULT Our main result is stated below.
Theorem 1. Assume Condition Cholds.
(j)There exists an X-valued σ(θ1, θ2, . . .)-measurable random variable Z∞to whichZn(x)convergesPH-a.s. asn→ ∞for anyx∈ X and, moreover,
EH(ρα(Zn(x), Z∞))≤rnϕn(x)ρα(x, x0) +
m≥n
rmδm.
(jj) Let π denote the probability distribution of Z∞ and let X0, X0 be X-valued random variables independent of (θn)n∈N+ with common probability distribution π. Then
(6) Eπ,H
ρα
Xn(X0), Xn(X0)
≤2
m≥n
rmδm, n∈N+.
Proof. (j) As in the proof of Theorem 2 in Wu and Shao [7, p. 427], we deduce that
EH(ρα(Zn(x0), Zn+1(x0)))≤rnEH ϕn
Fθn+1(x0) ρα
x0, Fθn+1(x0)
=rnδn
for anyn∈N+.Fixr0 ∈(0,1) with Rr < r0 <1. Then, by Markov’s inequality, PH(ρα(Zn(x0), Zn+1(x0))≥rn0)≤
r r0
n
δn
for any n∈N+. As by Condition C(i) the series with general term (r/r0)nδn is convergent, the Borel-Cantelli lemma implies that
PH(ρα(Zn(x0), Zn+1(x0))≥rn0 infinitely often) = 0,
henceZn(x0)→Z∞, say,PH-a.s. asn→ ∞by the completeness ofX. Clearly, Z∞ isσ(θ1, θ2, . . .)-measurable. Next, by the triangle inequality,
EH(ρα(Zn(x0), Z∞))≤EH
j∈N
ρα(Zn+j(x0), Zn+j+1(x0))
≤
≤
j∈N
EH(ρα(Zn+j(x0), Zn+j+1(x0)))≤δn :=
m≥n
rmδm
for any n∈N+. Finally, by Condition C(ii), for anyx∈ X,
EH(ρα(Zn(x), Z∞))≤EH(ρα(Zn(x), Zn(x0))) +EH(ρα(Zn(x0), Z∞))≤
≤rnϕn(x)ρα(x, x0) +δn,
whence we conclude that Zn(x) → Z∞ PH-a.s. as n → ∞, by invoking the argument already used to show that Zn(x0) → Z∞ PH-a.s. as n → ∞. We should now chooser0 ∈(0,1) withr/min (R, Rx)< r0<1 and note then that both series
n∈N+ϕn(x) (r/r0)n and
n∈N+m≥n
(r/r0)mδm=
n∈N+
n(r/r0)nδn
are convergent by Condition C(i). The proof of (j) is complete.
(jj) This follows as in Wu and Shao [7, p. 428] with the corresponding new upper bound as indicated.
Remark 1. Theorem 1 allows one to estimate the rate of convergence of then-step transition probability functionPnof the Markov chain (Xn(x))n∈N+ to its stationary probability distributionπ. Recall [cf., e.g., Hofmann-Jorgensen [1, pp. 80–82]] that whateverα∈(0,1] a metricρL,α which metrizes the topol- ogy of weak convergence on the set of probability measures on the Borel sets ofX is defined by
ρL,α(µ, ν) =
= sup
Xfd (µ−ν)
: 0≤f≤1, |f(x)−f(x)| ≤ρα(x, x), x, x∈X
for two such probability measuresµandν, and note thatρL,α ≥ρL,1[cf., e.g., Iosifescu [2, p. 73]]. It then follows from Theorem 1 that
ρL,α(Pn(x,·), π)≤rnϕn(x)ρα(x, x0) +
m≥n
rmδm for any n∈N+ and x∈ X since
Xfd (Pn(x,·)−π) =EH(f(Xn(x))−f(Z∞)) =EH(f(Zn(x))−f(Z∞)) for any continuous bounded functionf :X →R.
Remark 2. As in Wu and Shao [7, p. 428], Theorem 1 above allows to defineX-valued random variables Xi, i≤0, in such a way that the equation Xi = Fθi(Xi−1) still holds. Let (θi)i∈Z be a doubly infinite sequence of i.i.d.
Θ-valued random variables with common distributionH. Then, as in the proof of Theorem 1(j), the limit
mlim→∞Fθi◦Fθi−1◦ · · · ◦Fθi−m(x) :=Xi(x), x∈ X,
exists PH-a.s. for any i ≤ 0, is a σ(. . . , θi−1, θi)-measurable function which does not actually depend onx, has probability distributionπ, and satisfies the equationXi=Fθi(Xi−1) for any i≤0.Notice that withXi, i≤0,so defined, the doubly infinite sequence (Xi)i∈Zconstructed by using the latter recurrence equation forany i∈Z,is a strictly stationary sequence, as well as a Markov process, on a suitable probability space, to be still denoted (Ω,K, Pπ,H).
5. LIMIT THEOREMS
Theorem 3 (a central limit theorem) in Wu and Shao’s paper can be restated in terms of our Condition C and its implication (6). Let (Xi)i∈Z be the doubly infinite sequence defined in Remark 2 above. For a fixed ∈ N+ consider a measurable functiong:X →Rand defineYi =g(Xi−+1, . . . , Xi), i∈Z, Sn, =
n
i=1Yi, n ∈N+, S0, = 0.For a real-valued random variable Z on (Ω,K, Pπ,H) denote by Z itsL2-norm and define ∆g(δ) = sup{||[g(W)− g(W)]1(ρ(W,W)≤δ)||:W and W areX-valued random variables distributed asY1},where
ρ w, w
=
i=1
ρ2 wi, wi
12
forw= (w1, . . . , w) andw = (w1, . . . , w), so that ρα
w, w
≤
i=1
ρα wi, wi forα∈(0,1]. Now, we can state
Theorem 2. Assume (6)holds,Eπ,H(g(Y1)) = 0, Eπ,H(|gp(Y1)|)<∞ for some p >2, that
(7)
1
0
∆g(t)
t dt <∞, and that
n∈N+
δn +· · ·+δn+−1
r1−αn(p−2)/2p <∞ for some r1 ∈(0,1).
Then there exists σg ≥0such that
Snu,/√n, 0≤u≤1
converges weakly toσgB.Here Bstands for a standard Brownian motion and for integer part.
Also, for π-almost all x∈X,the same conclusion holds for
1≤i≤nu
g(Xi(x), . . . , Xi+−1(x))/√
n, 0≤u≤1
,
that is, for the Markov process started at x.
The proof is mutatis mutandis identical to that of Theorem 3 on page 430 in Wu and Shao’s paper on account of (6), by using at appropriate places our assumptions.
Remark 3. It is an open problem whether the proviso ‘π-almost allx∈ X’ can be removed by using recent work of Peligrad and Utev [5], as this has been done in Lager˚as and Stenflo [3] in another context.
Corollary 1 and Theorem 4 in Wu and Shao’s paper concerning em- pirical processes, can be also restated. Let X = R with ρ the Euclidean distance. LetG(x) =Pπ,H(X1 ≤x), Gn(x) =n
i=11(Xi≤x)/n and Pn(x) =
√n(Gn(x)−G(x)), x∈R, n∈N+. It is then easy to check that defining gy(u) =1(u<y)−G(y), y, u∈R,
one has ∆2g(δ) ≤ 2 min (G(y+δ)−G(y), G(y)−G(y−δ)) := 2τ(y;δ). So that, condition (7) in our Theorem 2 holds if one assumes that
(8)
1
0
τ(x:t)
t dt <∞.
The result corresponding to Wu and Shao’s Corollary 1 can now be stated as follows.
Corollary 1. Assume the conditions in Theorem 1 with (7) replaced by (8). Then for any x ∈ X there exists some σ(x) <∞ such that Pn(x) ⇒ N
0, σ2(x)
(convergence in distribution).
As to Wu and Shao’s Theorem 4, the corresponding result can be stated as follows.
Theorem 3. Assume that
supx∈R|G(x+δ)−G(x)| ≤Clog−k δ−1 for any δ∈
0,12
with some k >5/2andC >0,and that δn =O
rn/2 2n−k as n→ ∞ for some 0< r2 < r21α.Then, under the assumptions of Corollary 1, {Pn(y), y∈R} converges in D(R) to a Gaussian process (Γt)t∈R with mean zero and covariance function
Eπ,H(ΓxΓy) =
k∈Z
cov(1(X0≤x) 1(Xk≤y))), x, y∈R.
Acknowledgement. Marius Iosifescu gratefully acknowledges support from the Deutsche Forschungsgemeinschaft under Grant 936 RUM 113/21/0-1 and from Con- tract 2-CEx-06-11-97 of the Romanian Authority for Research.
REFERENCES
[1] J. Hoffmann-Jorgensen,Probability with a View Toward Statistics. Vol. II. Chapman &
Hall, New York, 1994.
[2] M. Iosifescu,A simple proof of a basic theorem on iterated random functions. Proc. Rom.
Acad. Ser. A Math. Phys. Tech. Sci. Inf. Sci.4(2003), 167–174.
[3] A.N. Lager˚as and ¨O. Stenflo,Central limit theorems for contractive Markov chains. Non- linearity18(2005), 1955–1965.
[4] G. Letac,A contraction principle for certain Markov chains and its applications. In:Ran- dom Matrices and Their Applications(Brunswick, Maine,1984), pp. 263–273, Contemp.
Math.50, Amer. Math. Soc., Providence, RI, 1986.
[5] M. Peligrad and S. Utev,A new maximal inequality and invariance principle for station- ary sequences. Ann. Probab.33(2005), 798–815.
[6] D. Steinsaltz, Locally contractive iterated function systems. Ann. Probab. 27 (1999), 1952–1979.
[7] W.B. Wu and X. Shao,Limit theorems for iterated random functions. J. Appl. Probab.
41(2004), 425–436.
Received 28 April 2006 Universit¨at Duisburg-Essen, Campus Duisburg Institut f¨ur Mathematik
D-47048 Duisburg, Germany [email protected]
and Romanian Academy Institute of Mathematical Statistics
and Applied Mathematics Calea 13 Septembrie nr.13 050711 Bucharest 5, Romania