HAL Id: hal-01097276
https://hal.archives-ouvertes.fr/hal-01097276v3
Preprint submitted on 15 Jan 2021 (v3), last revised 14 Dec 2021 (v4)
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Deviation results for sparse tables in hashing with linear probing
Thierry Klein, A Lagnoux, P Petit
To cite this version:
Thierry Klein, A Lagnoux, P Petit. Deviation results for sparse tables in hashing with linear probing.
2014. �hal-01097276v3�
Deviation results for sparse tables in hashing with linear probing
1, Thierry Klein1, Agnès Lagnoux2, and Pierre Petit3
1Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse;
ENAC - Ecole Nationale de l’Aviation Civile , Université de Toulouse, France
2Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse;
CNRS. UT2J, F-31058 Toulouse, France.
3Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse;
CNRS. UT3, F-31062 Toulouse, France.
January 14, 2021
Abstract
We consider the model of hashing with linear probing and we establish the moderate and large deviations for the total displacement in sparse tables. In this context, Weibull- like-tailed random variables appear. Deviations for sums of such heavy-tailed random variables are studied in [20,21]. Here we adapt the proofs therein to deal with conditioned sums of such variables and solve the open question in [8]. By the way, we establish the deviations of the total displacement in full tables, which can be derived from the deviations of empirical processes of i.i.d. random variables established in [28].
Keywords: large deviations, hashing with linear probing, parking problem, Brownian motion, Airy distribution, Łukasiewicz random walk, empirical processes, conditioned sums of i.i.d.
random variables, triangular arrays, Weibull-like distribution.
AMS subject classification: 60F10; 60C05; 60G50; 68W40.
1 Introduction
Hashing with linear probing is a classical model in theoretical computer science that appeared in the 50’s. It has been studied from a mathematical point of view firstly by Knuth in [14].
Here is a simple description given in [6].
A table of length m, T[1..m] is set up, as well as a hash function h that maps keys from some domain to the interval [1..m] of table addresses. A collection of n elements with n ≤ m are entered sequentially into the table according to the following rule: Each element x is placed at the first unoccupied location starting from h(x) in cyclic order, namely the first of h(x), h(x) + 1, . . . , m, 1, 2, . . . , h(x)−1.
For more details on the model, we refer to [6, 9, 18, 3, 2, 11]. The length of the move of each element x is called the displacement of x and the sum of all displacements, denoted by dm,n, is called the total displacement. In its seminal papers [14, 15], Knuth assumes that
all the sequence of hash addresses h(x) are independent and uniformly distributed on J1, mK, computes exact expressions ofE[dm,n] and Var(dm,n), and provides their asymptotic behaviors.
The limit distribution ofdm,n remains unknown until 1998: in [6], Flajolet, Poblete, and Viola give the limit distribution of dm,n for full tables (n = m) and for sparse tables (n/m = µ ∈ (0,1)) using combinatorial arguments. In [4], Chassaing and Marckert recover the previous results in the full case via a probabilistic approach. They prove that dm,n is the area under the Łukasiewicz random walk (also called Breadth First Search random walk) associated to a Galton-Watson tree with Poisson progeny. Consequently, the limit distribution of the total displacement dm,m is that of the area under the Brownian excursion, which involves the Airy distribution.
In [9], reformulating the problem in terms of conditioned sums of random variables, Janson establishes the limit distribution of dm,n in all cases with probabilistic tools. In [10], Janson extends the central limit theorem in the sparse case to a general model of conditioned sums of random variables. The corresponding Berry-Esseen bounds are proved by Klein, Lagnoux, and Petit in [12]. Concerning the deviations of such conditioned models, Gamboa, Klein, and Prieur give an answer in the case of light-tailed random variables (see [8]). Unfortunately, their results cannot be applied to the model of hashing with linear probing since this model involves heavy-tailed random variables.
In this paper, we establish the moderate and large deviations for the total displacement dm,n in sparse tables. Deviations for heavy-tailed random variables are studied by several authors (e.g., [24, 17, 20, 21, 22]) and a good survey can be found in Mikosch [19]. In the context of hashing with linear probing, Weibull-like-tailed random variables appear. Deviations for sums of such variables are studied by Nagaev in [20, 21]. Here we adapt its proofs to deal with conditioned sums of such variables. We also need to establish the deviations of dm,n for full tables, which can be derived from the deviations of empirical processes of i.i.d. random variables established by Wu in [28].
The paper is organized as follows. In Section 2, we state the main results for full and sparse tables. The proofs for full tables are given in Section 3 and those for sparse tables can be found in Section 5. In Section 4, we expose Janson’s reformulation of the model and provide several useful estimates required in Section 5.
2 Setting and main results
2.1 Model
An equivalent formulation of the problem of hashing can be made in terms of the discrete version of the classical parking problem described, for instance, by Knuth [16]:
A certain one-way street has m parking spaces in a row numbered 1 to m. A man and his dozing wife drive by, and suddenly, she wakes up and orders him to park immediately. He dutifully parks at the first available space [...].
More precisely, the model describes the following experiment. Let n 6 m. n cars enter sequentially into a circular parking uniformly at random. The parking spaces are numbered clockwise. A car that intends to park at an occupied space moves to the next empty space, always moving clockwise. The length of the move is called the displacement of the car and we are interested in the sum of all displacements which is a random variable denoted bydm,n. When all cars are parked, there are N = m−n empty spaces. These divide the occupied spaces into blocks of consecutive spaces. We consider that the empty space following a block belongs to this block.
For example, assume that n= 8, m = 10, and (6,9,1,9,9,6,2,5) are the addresses where the cars land. This sequence of (hash) addresses is called a hash sequence of lengthm and size n.
Let di be the displacement of car i. Then d1 = d2 =d3 = 0. The car number 4 should park on the 9thspace which is occupied by the 2nd car; thus it moves one space ahead and parks on the 10thspace so that d4 = 1. The car number 5 should park on the 9thspace. Since the 9th, the 10th, and the 1st spaces are occupied, d5 = 3. And so on: d6 = 1, d7 = 1, d8 = 0.
Here, the total displacement is equal to d10,8 = 1 + 3 + 1 + 1 = 6. In our example, there are two blocks: the first one containing spaces 9, 10, 1, 2, 3 (occupied), and space 4 (empty), and the second one containing spaces 5, 6, 7 (occupied), and space 8 (empty).
In this paper, we are interested in the deviations of the total displacement dm,n in sparse tables. To do so, we need the large deviation behavior of dm,n in full tables. By the way, we also established the moderate deviation behavior of dm,n in full tables.
2.2 Deviations in full tables
In this section, we first recall some already existing results for the total displacement dm,m in full tables (n =m). As mentioned in the introduction, Knuth in [15] and Flajolet et al. in [6, Theorem 2]) derive the asymptotic behavior of the expectation and the variance of dm,m:
E[dm,m] ∼
m→∞
√2π
4 m3/2 and Var(dm,m) ∼
m→∞
10−3π
24 m3. (1)
The following result was first established in [6, Theorem 3].
Theorem 1 (Standard deviations). For full tables, the distribution of the total displacement dm,m/m3/2 is asymptotically distributed as the area Aunder the standard Brownian excursion, in the sense that, for all δ >0,
P(dm,m>m3/2δ)−−−→
m→∞ P(A>δ).
In this paper, we establish the probabilities of deviation for the total displacement dm,m. Theorem 2 (Moderate deviations). For all α ∈(3/2,2)and for all δ >0,
1
m2α−3 logP(dm,m>mαδ)−−−→
m→∞ −6δ2. Theorem 3 (Large deviations). For all δ >0,
−1
mlogP(dm,m>m2δ)−−−→
m→∞ J(δ) :=
(12 −δ)·λ(δ) + log(1−(12 +δ)·λ(δ)) if δ <1/2
∞ if δ>1/2,
where λ(δ) is the smallest solution of the equation in λ
λ·
δ+ 1 2
−1
(1−eλ) =λ. (2)
Observe that lower deviations are trivial: for allα∈(3/2,2] and all mlarge enough,P(dm,m− E[dm,m]6 −mαδ) = 0 because of the positiveness of dm,m and using (1). When dealing with the very large deviations, the same trivial behavior occurs both for upper and lower deviations:
for all α >2 and all m large enough, P(dm,m−E[dm,m]>mαδ) = 0 and P(dm,m−E[dm,m]6
−mαδ) = 0, since dm,m 6m(m−1)/2.
Remark 4. Ifδ = 0, λ(0) = 0 is the unique solution of Equation (2). If δ∈(0,1/2), Equation (2) has two solutions: λ(δ) < 0 and 0. Moreover, J(δ) ∼ 6δ2 as δ → 0 (we recover the rate function of the moderate deviations in Theorem 2) and J(δ)→+∞as δ→1/2.
Remark 5. The conclusions of Theorems 1, 2, and 3 are still valid replacing δ by δ+o(1), since the rate functions are continuous. In particular, one may replace dm,m bydm,n as soon as m−n mα−1 (for instance, in the almost full case where n =m−1). Indeed, naturally coupling dm,n and dm,m by adding m−n balls, one has
|dm,n−dm,m|6(m−1) + (m−2) +· · ·+n∼m(m−n)mα, whence
P(dm,n >mαδ) =P(dm,m >mα(δ+o(1))).
Moreover, using a new probabilistic approach developed in [4], Theorem 1 was extended in [9, Theorems 1.1 and 2.2] to the case (m−n)/√
m→a∈[0,∞): for all δ>0, P(dm,n >m3/2δ)−−−→
m→∞ P(Wa >δ), with
Wa=
Z 1 0
maxs6t (b(t)−b(s)−a(t−s))dt where b is a Brownian bridge b on [0,1] periodically extended to R.
2.3 Deviations in sparse tables
In this section, we consider asymptotics in (m, n) with m→ ∞ andn/m→µ∈(0,1) (sparse case). This definition of the sparse case is a slight extension of that of [6] (n/m=µ∈(0,1)).
Set N =m−n. By [6, Theorem 5], E[dm,n] ∼
m→∞
µ2
2(1−µ)2N and Var(dm,n) ∼
m→∞σ2(µ)N, (3)
where (cf. [6, Theorem 5])
σ2(µ) := 6µ2−6µ3+ 4µ4−µ5
12(1−µ)5 . (4)
The following result was first proved in [6, Theorem 6] while another probabilistic proof was given in [9].
Theorem 6 (Standard deviations). The distribution of the total displacement dm,n is asymp- totically Gaussian distributed, in the sense that, for all y,
P
dm,n−E[dm,n]6N1/2y−−−→
m→∞ P(Z 6y) where Z ∼ N(0, σ2(µ)).
In this paper, we establish the probabilities of deviation of the total displacement dm,n in the sparse case.
Theorem 7 (Lower moderate deviations). For all α∈(1/2,1) and for all y>0, 1
N2α−1 logP(dm,n−E[dm,n]6−Nαy)−−−→
m→∞ − y2
2σ2(µ). (5)
Theorem 8 (Lower large deviations). For all y>0, 1
N logP(dm,n−E[dm,n]6−N y)−−−→
m→∞ −Λ∗
1
1−µ, µ2
2(1−µ)2 −y
, (6)
where Λ∗ is the Fenchel-Legendre transform of the function Λ : R2 →(−∞,∞] defined by Λ(s, t) = log
∞
X
l=1
e(s−µ)l(µl)l−1
l! E[etdl,l−1].
For all α > 1, we have, asymptotically, P(dm,n −E[dm,n] 6 −Nαy) = 0, since dm,n > 0 and E[dm,n] is asymptotically linear inN.
Theorem 9 (Upper deviations).
(i) For all α∈(1/2,2/3) and for all y>0, 1
N2α−1 logP(dm,n−E[dm,n]>Nαy)−−−→
m→∞ − y2
2σ2(µ). (7)
(ii) For all y>0,
1
N1/3 logP(dm,n−E[dm,n]>N2/3y)−−−→
m→∞ −I(y) (8) with
I(y) :=
y2
2σ2(µ) if y6y(µ),
q(µ)(1−t(y))1/2y1/2+t(y)2σ22(µ)y2 if y > y(µ), where
q(µ) := inf
0<δ<1/2
√1
δ(κ(µ) +J(δ)),
κ(µ) :=µ−log(µ)−1∈(0,∞), J has been defined in Theorem 3, y(µ) := 3 (q(µ)σ2(µ))2/3/2, and t(y) is defined for y > y(µ) as the smallest root of the cubic equation in t ∈[0,1]
t3−t2+q2(µ)σ4(µ) 4y3 = 0.
(iii) For all α ∈(2/3,2)and for all y>0, 1
Nα/2 logP(dm,n−E[dm,n]>Nαy)−−−→
m→∞ −q(µ)y1/2. (9) (iv) For all y>0,
1
N log P(dm,n−E[dm,n]>N2y)
−−−→m→∞
−inf
δ>0
qy
δ(κ(µ) +J(δ)) + Λ∗0
1
1−µ −qyδ
if y < µ2/(2(1−µ)2)
−∞ if y>µ2/(2(1−µ)2),
(10) where Λ∗0 is the Fenchel-Legendre transform of the function Λ0: R → (−∞,∞] defined by Λ0(s) := Λ(s,0) and Λ has been defined in Theorem 8.
For allα >2, we have, asymptotically, P(dm,n−E[dm,n]>Nαy) = 0, since dm,n 6n(n−1)/2 and N is asymptotically linear in n.
Remark 10. Observe that, for α= 2/3 and for all y >0, I(y) = inf
t∈[0,1]f(t), where
f(t) =
q(µ)(1−t)1/2y1/2+ t2y2 2σ2(µ)
.
If y 6 y1(µ) = 3 (q(µ)σ2(µ))2/3/24/3, then f is decreasing and its minimum y2/(2σ2(µ)) is attained at t= 1. Ify > y1(µ),f has two local minima, at 1 and att(y), corresponding to the smallest of the two roots in [0,1] off0(t) = 0 which is equivalent tot3−t2+q2(µ)σ4(µ)/(4y3) =
0. If y1(µ) < y 6 y(µ), the minimum is attained at 1, and at t(y) otherwise. Let c = q2(µ)σ4(µ)/(4y3). One can prove thatt(y) = 2Re(z) + 1/3 where z is the only complex cube root of
1 27− c
2+i
s c 27− c2
4 having argument in (π/3,2π/3).
Remark 11 (Typical events of deviation in Theorem 9).
(i) All the displacements within the blocks are small but their sum has a Gaussian contribution.
(iii) One block has a large size δ−1/2Nα/2y1/2 and the displacement within this block is Nαy, δ being chosen by the optimization in q(µ) (two competing terms).
(ii) One block has a large size and the displacement within this block is large (Nnα(1−t(y))y) and the sum of the other displacements has a Gaussian contribution (two extra competing terms).
(iv) One block has a large size δ−1/2N y1/2 and the displacement within this block is N2y which forces the sum of the length of the other blocks to be abnormally small, that is m− δ−1/2N y1/2 ∼N((1−µ)−1−δ−1/2y1/2) (three competing terms).
3 Proofs: full tables
Here, we take n=m. All limits are considered as m→ ∞ unless stated otherwise.
3.1 Upper moderate deviations - Theorem 2
For allm>1, let (Um,i)16i6mbe a sequence of i.i.d. random variables with uniform distribution U on [0,1] and let, for all i ∈ J1, mK, Vm,i = dmUm,ie. Note that (Vm,i)16i6m is a sequence of i.i.d. random variables uniformly distributed on J1, mK and, for all i = 1, . . . , m, Vm,i corresponds to the hash address of item i. Let us define
Sm(k) :=
m
X
i=1
1Vm,i6k=Lm([0, k/m]),
where Lm is the empirical measure associated to the random sequence (Um,i)16i6m.
As in [9, Lemma 2.1], we extend the definition of Sm(k) for k ∈ Z so that the sequence (Sm(k)−k)k∈Z be m-periodic, whence
minl<k{Sm(l)−l}= min
16l6m{Sm(l)−l}.
Thus, by [9, Equation (2.1) and Lemma 2.1], the total displacementdm,m is given by dm,m=
m
X
k=1
Sm(k)−k− min
16l6m{Sm(l)−l}
=m
m
X
k=1
(Lm− U)([0, k/m])−m2 min
16l6m{(Lm− U)([0, l/m])}
=m2
Z 1
0
(Lm− U)([0, y])dy− inf
06y61, y∈Q
{(Lm− U)([0, y])}+Am m
where Am ∈[−3/2,−1/2]. Let α∈(3/2,2). Then, P(dm,m>mαδ) =P
√ m mα−3/2
Z 1
0
(Lm− U)([0, y])dy− inf
06y61, y∈Q
{(Lm− U)([0, y])}+Am
m
>δ
=P
ϕ
√ m
mα−3/2(Lm− U)
>δm
,
where δm = δ−Am/m−(α−1) → δ and, for any measure ν ∈ M([0,1]) (the space of signed measures on [0,1]),
ϕ(ν) :=
Z 1
0
ν([0, y])dy− inf
06y61, y∈Q
ν([0, y]). (11)
Since
ϕ(ν) =
Z 1 0
(1−x)dν(x)− inf
06y61, y∈Q
Z 1
0 1[0,y](x)dν(x)
by Fubini’s theorem,ϕis a measurable function whenM([0,1]) is equipped with the σ-algebra generated by the applications ν 7→ν(f) for all bounded measurable function f on [0,1]. By [28, Equation (1.3)], we have
1
m2α−3logP
ϕ
√ m
mα−3/2(Lm− U)
>δ
→ −inf
(1 2
Z 1 0
dν dy(y)
2
dy; ν ∈ M([0,1]), ν U, ϕ(ν)>δ, ν([0,1]) = 0
)
=−inf
1 2
Z 1 0
G0(y)2dy ; G∈AC([0,1]),
Z 1 0
G(y)dy−minG>δ, G(0) =G(1) = 0
, (12) where AC([0,1]) is the space of absolutely continuous functions on [0,1]. We have used the fact that inf{G(y) ; 0 6y61, y ∈Q} = minG by the continuity of G on [0,1]. If G is a minimizer in (12) and if ˜y is a minimizer of G, then ˜G(y) = G(˜y +y)−G(˜y) is also a a minimizer in (12). The addition ˜y+y has to be understood as the addition on the torus T. Moreover, min ˜G= 0 and R01G˜0(y)2dy=R01G0(y)2dy. So
inf
1 2
Z 1 0
G0(y)2dy ; G∈AC([0,1]),
Z 1 0
G(y)dy−minG>δ, G(0) =G(1) = 0
= inf
1 2
Z 1 0
G0(y)2dy; G∈AC([0,1]),
Z 1 0
G(y)dy>δ, G(0) =G(1) = 0
. (13)
Using the method of Lagrange multipliers, if G is a minimizer of
1 2
Z 1 0
G0(y)2dy; G∈AC([0,1]),
Z 1 0
G(y)dy =δ
, (14)
then there exists λ∈R such that 1 2
Z 1 0
G0(y)2dy+λ
Z 1 0
G(y)dy = 0.
Differentiating with respect to G, we get
∀h∈ Cc∞((0,1))
Z 1 0
G0(y)h0(y) +λh(y)dy= 0.
Integrating by parts, one has
∀h∈ Cc∞((0,1))
Z 1 0
G0(y)−λyh0(y)dy= 0.
By Du Bois-Reymond’s lemma in [5, p.184], the function y7→G0(y)−λy is constant, so G is a quadratic polynomial. Getting back to (14), the minimizer is given by
G(y) = 6δy(1−y)
and the infimum is 6δ2. Moreover, we check that G(0) = G(1) = 0 as required in (13).
Consequently,
1
m2α−3 logP ϕ
√m
mα−3/2 (Lm− U)
!
>δ
!
→ −6δ2. Using the fact that, for Φ = ϕ((√
m/mα−3/2) (Lm− U)),
P(Φ>δ+ε)6P(Φ>δm)6P(Φ>δ−ε) (15) for all ε >0 and all m large enough, we conclude to the result in Theorem 2 by continuity of the function δ 7→ −6δ2.
3.2 Upper large deviations - Theorem 3
The result for δ = 0 is trivial. Suppose thatδ >0. We have P
dm,m >m2δ=P
Z 1
0
(Lm− U)([0, y])dy− inf
06y61, y∈Q
{(Lm− U)([0, y])}+ Am m >δ
=P
ϕ(Lm− U)>δ− Am m
,
where ϕis the measurable function defined in (11). By Sanov’s theorem (see, e.g., [28, Equa- tion (1.1)]), we have
1
mlogP(ϕ(Lm− U)>δ)
→ −inf
(Z 1 0
dν dy(y)
log
dν dy(y)
dy; ν ∈ M+1([0,1]), ν U, ϕ(ν− U)>δ
)
=−inf
(Z 1 0
F0(y) logF0(y)dy ; F ∈AC([0,1]), F0 >0,
Z 1 0
(F −id)(y)dy−min(F −id)>δ, F(0) = 0, F(1) = 1
)
, (16)
where M+1([0,1]) is the space of probability measures on [0,1]. If F is a minimizer in (16) and if ˜y is a minimizer of F −id, then ˜F(y) =F(˜y+y)−F(˜y) is also a a minimizer in (16).
Moreover, min( ˜F −id) = 0 and R01F˜0(y) log ˜F0(y)dy =R01F0(y) logF0(y)dy. So
−inf
(Z 1 0
F0(y) logF0(y)dy ; F ∈AC([0,1]), F0 >0,
Z 1 0
(F −id)(y)dy−min(F −id)>δ, F(0) = 0, F(1) = 1
)
=−inf
(Z 1 0
F0(y) logF0(y)dy ; F ∈AC([0,1]), F0 >0,
Z 1 0
F(y)dy>δ+1 2, F(0) = 0, F(1) = 1
)
=−infK,
where K: AC([0,1])→[0,∞] is the convex function defined by K(F) =
R1
0 F0(y) log(F0(y))dy if F0 >0,R01F(y)dy>δ+12, F(0) = 0,F(1) = 1
∞ otherwise.
It is a standard convex optimization problem, a minimizer of which is F¯(y) =a(1−eλy), where
a(1−eλ) = 1 a=δ+ 12 − 1λ.
(One can see that a > 1 and λ 6 0.) Indeed, by the definition of a convex function and the subdifferential, it suffices to check that 0 belongs to the subdifferential of K at ¯F. For all h∈AC([0,1]) and for all t >0,
K( ¯F +th) =
R1
0( ¯F0+th0) log( ¯F0+th0) if F0+th0 >0, R01h>0,h(0) =h(1) = 0
∞ otherwise.
Differentiating under the integral sign with respect to t and integrating by parts gives K0( ¯F;h) =
Z 1 0
h0(y)(log ¯F0(y) + 1)dy=−λ
Z 1 0
h(y)dy>0
since h(0) =h(1) = 0, R01h(y)dy> 0, and λ <0. It remains to compute the value of K at ¯F and to apply the same argument of continuity as in (15) to get the required result.
4 Interlude
4.1 Janson’s reformulation
Here, we consider (m, n) withm → ∞andn/m→µ∈(0,1). As a consequence,N =m−n →
∞. To make notation clearer, we make quantities depend on N and all limits are considered as N → ∞unless stated otherwise. In the next section, we are interested in the deviations of dmN,nN in that regime, which is called the sparse case (see [6, 9] for this denomination with slight variants). In this section, we introduce a reformulation of the model of hashing with linear probing due to Janson in [9] and prove some preliminary results.
For all N > 1, we consider a vector of random variables (XN, YN) defined as follows. We assume that XN is distributed according to the Borel distribution with parameter µN :=
nN/mN ∈(0,1), i.e.
∀l ∈J1,∞J P(XN =l) =e−µNl(µNl)l−1
l! (17)
(see, e.g., [6] or [9] for more details). In some places, for the ease of computation, we may also use the parametrizationλN =e−µNµN to get an equivalent definition of the Borel distribution:
P(XN =l) = 1 T(λN)
ll−1λlN
l! , (18)
where T is the tree function (see, e.g., [7, p. 127]). Furthermore, we assume that YN given {XN =l} is distributed as dl,l−1.
Let (XN,i, YN,i)16i6N be an i.i.d. sample distributed as (XN, YN) and define, for allk ∈J1, NK, SN,k :=
k
X
i=1
XN,i and TN,k :=
k
X
i=1
YN,i.
To lighten notation, let SN := SN,N and TN := TN,N. Notice that, for all N > 1, E[XN] = (1−µN)−1 =mN/N, so E[SN] =mN. Moreover,P(SN =mN)>0 and we have the following identity (see [9, Lemma 4.1]):
L(dmN,nN) = L(TN|SN =mN).