• Aucun résultat trouvé

Nonparametric estimation of a shot-noise process

N/A
N/A
Protected

Academic year: 2021

Partager "Nonparametric estimation of a shot-noise process"

Copied!
5
0
0

Texte intégral

(1)

HAL Id: hal-01418963

https://hal.inria.fr/hal-01418963

Submitted on 13 Jan 2020

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

To cite this version:

Paul Ilhe, François Roueff, Éric Moulines, Antoine Souloumiac. Nonparametric estimation of a

shot-noise process. SSP 16 - Statistical Signal Processing Workshop, IEEE Signal Processing Society, Jun

2016, Palma de Mallorca, Spain. pp.7551709, �10.1109/SSP.2016.7551709�. �hal-01418963�

(2)

NONPARAMETRIC ESTIMATION OF A SHOT-NOISE PROCESS

Paul Ilhe

, Franc¸ois Roueff

T´el´ecom ParisTech-T´el´ecom ParisTech

Eric Moulines, Antoine Souloumiac

Ecole Polytechnique-CEA List

ABSTRACT

We propose an efficient method to estimate in a nonpara-metric fashion the marks’ density of a shot-noise process in presence of pileup from a sample of low-frequency observa-tions. Based on a functional equation linking the marks’ den-sity to the characteristic function of the observations and its derivative, we propose a new time-efficient method using B-splines to estimate the density of the underlyingγ-ray spec-trum which is able to handle large datasets used in nuclear physics. A discussion on the numerical computation of the al-gorithm and its performances on simulated data are provided to support our findings.

Index Terms— Shot-noise, nonparametric estimation,

B-splines,γ-spectroscopy, pile-up correction

1. INTRODUCTION

In this paper, we are focusing on a nonlinear inverse prob-lem that arises in nuclear science. In the specific context of

γ-spectroscopy, a photon beam stemming from a radioactive

source hits a scintillator detector. Assuming that the nuclear reactions at the origin of the emitted photons are independent, the arrival times of interaction between particles and matter can be modeled by a homogeneous Poisson point process. Each interaction between a particle and the detector gener-ates an electric current called impulse response whose ampli-tude depends on the partial or total transfer of the incident photon energy. Indeed, photons interact with matter follow-ing three different phenomena: the Compton scatterfollow-ing, the photoelectric absorption and the pair production (for supple-mentary details, see [6]). We particularly focus our attention on shot-noise processes that produce pileup, which is the phe-nomenon that occurs when two or more electric currents gen-erated by different photons overlap. For instance, this is the case when the mean inter-arrival time between two photons is shorter than the support of the impulse response, as illustrated in Figure 1. The form of the impulse response mainly depends on the type of detector and the data acquisition setup. Since the instrumentation chain is usually set by the physicists, we

This research was partially supported by Labex DigiCosme (project

ANR-11-LABEX-0045-DIGICOSME) operated by ANR as part of the pro-gram “Investissement d’Avenir” Idex Paris-Saclay (ANR-11-IDEX-0003-02).

assume, in this article, that we exactly know the impulse re-sponse. The measured electric current can be modeled by a shot-noise process(Xt)t≥0defined by

Xt:=



k:Tk≤t

Ykh(t − Tk) , (1)

under the assumptions

(SN-1) kδTk,Ykis a real-valued homogeneous Poisson point process with times Tk arriving with intensity λ > 0 and independent real-valued i.i.d. marksYk with prob-ability density functionf. In addition, f is compactly supported in[0, M ] for some M > 0 and belongs to the spaceH2([0, M ]) defined by

H2([0, M ]) := {f : [0, M ] → R, f, fare

absolutely continuous and

 M

0 |f

(t)|2

dt < ∞} . (SN-2) the impulse responseh of the detector is causal,

inte-grable and satisfies

h(t) =

t→∞O

 e−t .

Such a process is well defined as soon as the densityf and the impulse responseh jointly satisfy the condition



min(1, |y h(s)|) f (y)dyds < ∞ .

Figure 1 displays a simulated shot-noise sample path with intensity λ = 3, impulse response of the form h : t → 10te−10t1t≥0and marks’ density following a Gaussian

mix-ture.

Inγ-spectroscopy, the marks (Yk)k representγ-ray

emit-ted energies that depend on the underlying nucleus. In order to identify which nucleus are present and to quantify the in-tensityλ, one needs to estimate the flow

fλ:= λf

by comparing it to knownγ-ray spectrums.

Shot-noise processes belong to the family of stochastic processes well-suited to model dynamical systems that can

(3)

Fig. 1: Red: simulated shot-noise process and sample points Blue: corresponding singular electrical impulsions.

be interpreted as the convolution between a function (called a filter) and a marked point process. Among the long list of applications, one can, for instance, use it in pharmacokinetics [4], to model membrane potential dynamics over time in [10] and, more recently, to model pile-up distortion in astrophysics [2].

From a statistical standpoint, we are looking at the fol-lowing problem: based on n low-frequency observations

Xδ, . . . , Xnδ of the shot-noise process defined by (1) and

under the assumption that the impulse responseh is known, we aim to estimate the functionfλ:= λf .

The statistical inference methods for shot-noise processes were initiated in [7], and, more recently, in [5]. Moreover, methods based on the likelihood can not be used because the marginalX0may not have a density (with respect to the Lebesgue measure) or,if it exists, the function can be analyt-ically intractable. In addition, the low sampling rate of the signal precludes the use of techniques involving the joint laws of(Xt)t≥0, so that we only rely on methods exploiting the

marginal law of the process. Since X0 is infinitely divisi-ble (see [11]), one might be tempted to estimate the associ-ated characteristic L´evy triplet (see [9] for statistical inference of L´evy processes). However, in our case, the observations

X1, . . . , Xn are neither i.i.d. nor Markovian as in [5] for the

function

h(t) = e−αt1 t≥0.

In this particular case, it is possible to establish a formula linking the characteristic function of the markY0to a function involving the characteristic function ofX0and its derivative. For every realu, we have

ϕY0(u) = 1 + α u λ ϕ X0(u) ϕX0(u) .

Under the assumption that the marks’ density f belongs to some functional Sobolev space, they derive an estimator that converges at a polynomial speed. However, this pa-per exploits that the shot-noise process is Markovian, which is not true for general impulse responsesh. This property allows to control the rates of convergence of the estimator using theβ-mixing coefficients (see [3]) of the observations

(X1, . . . , Xn) associated to the shot-noise process given by

(1).

2. ESTIMATION PROCEDURE

In this article, we propose a method encompassing [5] that es-timate the functionfλ. Without loss of generality, we assume that the sampling frequencyδ−1= 1 that f is compactly sup-ported in [0, 1]. Since the marginal law of X0 is infinitely

divisible, we can derive the analytic expression of the charac-teristic functionϕ of the marginal X0. For every realu, we have ϕ(u) : = EeiuX0 = exp  0  1 0 eiuxh(s)− 1 fλ(x)dxds . (2)

Under the mild assumption |x|fλ(x)dx < ∞, the differen-tiation of (2) with respect tou leads to the functional equation

g(u) = Kh[fλ] (u) , u ∈ R ,

whereg = ϕ/ϕ and Kh is a bounded linear operator only depending onh, which is defined by

Kh: f →  u →  0  1 0 ixh(s)e iuxh(s)f(x)dxds .

However, we only have access to a noisy versionˆgn of the functiong that we compute by a “plug-in” method

ˆgn(u) := ˆϕn(u)/ ˆϕn(u), u ∈ R ,

whereϕˆn(u) := n−1nk=1eiuXkcorresponds to the empiri-cal characteristic function associated to the shot-noise obser-vationsX1, . . . , Xn. The general framework of those inverse problems is developed in [8] where the authors assume that one can construct unbiased estimators of the functiong and explain the continuous-time singular value decomposition of the operatorKh. In general, these two hypothesis are not sat-isfied. Consequently, we propose to build an estimator offλ from a finite grid ofN evaluation grid {ui, i ∈ 1, . . . , N} for which we consider the equations

ˆgn(ui) = Kh[fλ] (ui) + n(ui), i ∈ {1, . . . , N} ,

where(u1), . . . , (uN) correspond to the estimation errors

of g(ui) by ˆϕn(ui)/ ˆϕn(ui) for i = 1, . . . , N . In

particu-lar, the following result ensures that the normalized vector of errors can be estimated by a correlated centered Gaussian vector.

Proposition 1. Letλ > 0 and X1, · · · , Xn observations of the shot-noise process given by (1) under Assumptions ( SN-1) and (SN-2). Furthermore, assume there exists some η > 0 such thatE|Y1|4+η< ∞. Let ϕ and ϕrespectively denote

(4)

the characteristic function ofX1 and its derivative. Then, for any integer N and any N-tuple of real numbers u =

(u1, · · · , uN), we have

n (n(u1), . . . , n(uN)) ⇒ ˜Z(u) ,

where ˜Z(u) is a centered complex-valued Gaussian vector with covariance matrix denoted byW (u).

For the sake of brevity, we omit here to give the explicit formula ofW (u). However, one can estimate it using several methods: ”plug-in”, generalized method of moments or boot-strap. In the sequel, we will use the bootstrap method for the numerical applications.

B-splines parameterization

Sincefλlies in the functional spaceH = H2([0, 1]), the es-timation problem can be translated as the following

ˆ fn,λ(α) := argmin f∈H ˆgn(u) − Kh[f ](u) 2 ˆ W−1 n + αf  2 H, (3)

wheref2H := 01|f(t)|2dt. In order to tackle this prob-lem, we propose to estimate fλ using the B-splines basis. This approach is suggested by [1], which studies a closely related problem. Given two positive integersq, k and a set of real numbers t = (t1, . . . , tk) called knots that satisfy

t1 < · · · < tk , the space Stq of B-splines of degreeq

as-sociated to the knotst is the set of functions v that can be

written v(t) = k+q  j=1 ωjBq j,k(t) =: ωTBqk(t) ,

for some vector1, . . . , ωk+q) of real numbers and where

the basis functionsBqk:= (B1,kq , . . . , Bk+q,kq ) are defined by

Bj,0(t) :=



1 if tj  t < tj+1

0 sinon

forq = 0 and satisfy the recursive relationship

Bj,l(t) := tt − tj

j+l− tjBj,l−1(t) +

tj+l+1− t

tj+l+1− tj+1Bj+1,l−1(t) ,

forl ∈ {1, . . . , k + q}. We consider such a parameteriza-tion for at least three reasons. On the one hand, the func-tions(Bj,q)1≤j≤k+q(for givenk, q and t) are compactly

sup-ported: this allows us to model the detector’s resolution in nuclear applications, which is known`a priori. On the other

hand, if a functionv belongs to the B-splines vector space S, then there exists a linear relationship betweenv and its deriva-tives. More precisely, if for some integersq ≥ 3, k ≥ 1, and some vectorω, v = ωTBq k, then v= ω(2)TBq−2k , with ω(2)= Aq kω and Aqk ∈ R(k+q−2)×(k+q).

Moreover, the B-splines vector spaces are of particular inter-est to approximate the functions belonging toH. Thus, it is possible to construct an estimator offλ able to adapt to the local regularity of the target function by

ˆ

fn,λ(α) := ˆωn,λ(α)TB .

whereωˆn,λ(α) is the solution of the quadratic

minimiza-tion problem

minˆgn(u) − Kh[B](u)Tω2Wˆ−1 n + αω

TDk+q

2 ω , (4)

forω ∈ Rk+qand whereDk+q2 is a matrix only depending on

k and q.

Penalty cross-validation

The choice of the penalty coefficientα in the minimization problem (4) is related to the quadratic problems considered in [12]. The author proposes a practical and efficient way to choose, for a given number of penalty termsα1, . . . , αp, which one is the closest to the usual leave-one out cross-validation score. Coarsely speaking, it boils down to choose the real numberα ∈ {α1, . . . , αp} that minimizes the func-tion CV(α) := ˆgn(u) − An(α)ˆgn(u) 2 ˆ W−1 n (1 − Tr(An(α))/N )2 (5) with An(α) := Kh[Bqk]  Kh[Bqk]Wˆn−1Kh[Bqk] + αDk+q2  Kh[Bqk]Wˆn−1.

3. ALGORITHM AND NUMERICAL RESULTS The detailed estimation procedure described in the previous section is summed up in the following pseudo-code. In par-ticular, we propose a practical manner to set hyperparameters and illustrate the performances of this algorithm on simulated datasets.

We illustrate the performance of our algorithm on simu-lated observations of a shot-noise process with intensity and impulse response given by

λ = 3 and h : t → 10 t e−10t1 t≥0,

and marks(Yk)kfollowing a Gaussian mixture with density

f = 0,7 N0,3; 2,5.10−3+ 0,3 N0,6; 2,5.10−3 , whereN (μ; σ2) is the probability density function of Gaus-sian random variable with mean μ and variance σ2. Fig-ures display the densities obtained using Algorithm 1 for sev-eral number of observations and a boxplot of the integrated

(5)

Algorithm 1 B-splines estimator

Inputs : Shot-noise observationsX1, · · · , Xn.

1. Choice ofΔn>0. Computation of the histogram (Hi)i∈Zbased

on the observationsX1,· · · , Xn:

Hi := {Xk ∈ [Δni; Δn(i + 1)]}/n and (dHi)i∈Z defined by dHi= ΔiHi

2. Choice ofN := 2log(n)and computation ofXφ= FFT(H, N), dXφ= FFT(dH, N). Fork= 1, · · · , N, we set ˆgn 2π(k−1) N Δ  := dXφk Xφk

3. Choice of the knots’ numberk= N, the degree q = 3 and

com-putation of the design matrix(Kh[Bj,q](2π(i − 1)/NΔ))i,jfor

i∈ {1, . . . , N}, j ∈ {1, . . . k + q}

4. Estimation of ˆWnby bootstrap.

5. Estimation ofˆωn(α) for α ∈ {2−15, . . . ,215} via (4) and choice

by the cross-validation score using (5).

0.0 0.2 0.4 0.6 0.8 1.0 0 2 4 6 8 10 12 14 16 18 fn, λ, n = 105 fn, λ, n = 106 fn, λ, n = 107 fλ

Fig. 2: Estimation offλforn ∈ {105, 106, 107}

n = 105 n = 106 n = 5. 106 Sample size 10-2 10-1 100 101 102 Integrated squared error ˆ fn,λ − fλ 2 2

Fig. 3: Boxplot of the Integrated Squared Error ˆfn,λ− fλ22 forn ∈ {105, 106, 5.106} and 100 independent runs of Algo-rithm 1 over independent input data sets.

squared error ˆfn,λandfλfor 100 independent runs of Algo-rithm 1 and three different sample sizes.

As shown in Figure 2 and 3, the estimator ˆfn,λconverges

tofλ. In addition, we see that the estimator well retrieves the modes offλ, even with a restricted number of observations . This presents an interest forγ-spectroscopy because it permits to identify the characteristic peaks of a given radionuclide.

4. REFERENCES

[1] Cardot, Herv´e, Spatially adaptive splines for statistical linear inverse problems, Journal of Mutivariate

Analy-sis, (2002)

[2] Chaplin, V., Bhat, N., Briggs, M. S., Connaughton, V. Analytical modeling of pulse-pileup distortion using the true pulse shape; applications to Fermi-GBM. Nuclear

Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associ-ated Equipment, 717, 21-36. (2013)

[3] Doukhan, Paul, Mixing, Springer, (1994)

[4] Helfenstein, U., Steiner, M. . Random drug excess.

Eu-ropean journal of drug metabolism and pharmacokinet-ics, 15(4), 309-315. (1990)

[5] Ilhe, Paul, Moulines, Eric, Roueff, Franc¸ois and Souloumiac Antoine. Nonparametric estimation of mark’s distribution of an exponential Shot-noise pro-cess. Electronic Journal of Statistics, 9(2). (2015) [6] Knoll, Glenn F, Radiation detection and measurement,

Wiley New York, (1989)

[7] Macchi, Odile and Picinbono, Bernard C., Estimation and detection of weak optical signals, Information

The-ory, IEEE Transactions on,18(5),562-573, (1972)

[8] Mair, B. A., Ruymgaart, F. H., Statistical inverse estima-tion in Hilbert scales. SIAM Journal on Applied

Mathe-matics, 56(5), 1424-1444. (1996)

[9] Neumann, M. H., Reiss, M. Nonparametric estimation for Lvy processes from low-frequency observations.

Bernoulli, 15(1), 223-248. (2009)

[10] Rudolph, M., Pospischil, M., Timofeev, I., Destexhe, A. Inhibition determines membrane potential dynam-ics and controls action potential generation in awake and sleeping cat cortex. The Journal of neuroscience, 27(20), 5280-5290, (2007)

[11] Sato, Ken-iti, L´evy processes and infinitely divisible dis-tributions, Cambridge University Press, (1999)

[12] Wood, Simon N, Modelling and smoothing parameter estimation with multiple quadratic penalties, Journal

of the Royal Statistical Society: Series B (Statistical Methodology), (2000)

Figure

Fig. 1: Red: simulated shot-noise process and sample points Blue: corresponding singular electrical impulsions.
Fig. 3: Boxplot of the Integrated Squared Error  f ˆ n,λ − f λ  2 2 for n ∈ {10 5 , 10 6 , 5

Références

Documents relatifs

Applications are given for the biased density model, the nonparametric regression model and a GARCH-type model under some mixing dependence conditions (α-mixing or β- mixing)... , Z

Gerold Alsmeyer, Bastien Mallein. A simple method to find all solutions to the functional equation of the smoothing transform.. In order to characterize all solutions within this

Regarding attentional maintenance, some studies reported a higher viewing time on negative pictures than on neutral pictures at early processing stages in healthy volunteers 48

La dernière famille de mots, qui se rencontre presque uniquement en poésie, a pour point de départ un verbe qui se trouve dans l’Odyssée, ἑψιάομαι « s’amuser, jouer

We have established the uniform strong consistency of the conditional distribution function estimator under very weak conditions.. It is reasonable to expect that the best rate we

However, U^Th data (Bourdon et al., 2000; Hora et al., 2009) and plagioclase chemical zoning (Ginibre &amp; Wo«rner, 2007) strongly suggest that all Parinacota lava compositions

To our knowledge, few state of the art methods can deal with such a general noise model as the one we propose here, so we estimate the noise parameters with the different

Lastly, we propose a model selection strategy relying on a Goldenschluger and Lepski [21] method and prove a risk bound for the nal estimator: this result holds for