• Aucun résultat trouvé

Additional File 3 of ”Goodness-of-fit tests and nonparamet- ric adaptive estimation for spike train analysis”: adaptive properties of the Lasso estimate for Hawkes processes Patricia Reynaud-Bouret

N/A
N/A
Protected

Academic year: 2022

Partager "Additional File 3 of ”Goodness-of-fit tests and nonparamet- ric adaptive estimation for spike train analysis”: adaptive properties of the Lasso estimate for Hawkes processes Patricia Reynaud-Bouret"

Copied!
3
0
0

Texte intégral

(1)

Additional File 3 of ”Goodness-of-fit tests and nonparamet- ric adaptive estimation for spike train analysis”: adaptive properties of the Lasso estimate for Hawkes processes

Patricia Reynaud-Bouret

∗1

and Vincent Rivoirard

2

and Franck Grammont

1

and Christine Tuleau- Malot

1

1Univ. Nice Sophia Antipolis, CNRS, LJAD, UMR 7351, 06100 Nice, France

2CEREMADE UMR CNRS 7534, Universit´e Paris Dauphine, Place du Mar´echal De Lattre De Tassigny, 75775 PARIS Cedex 16, France

Email: Patricia Reynaud-Bouret- [email protected]; Vincent Rivoirard - [email protected]; Franck Grammont - [email protected]; Christine Tuleau-Malot - [email protected];

Corresponding author

Theorem 1 of [1] does not explicit particular basis on which the interaction functions are expansed. It is stated in its general form as follows. Note that [1] is a proceedings and therefore no proof has been given of the result in [1]. Ifni.i.d. trials are recorded, each trialicorresponds to the observation ofNi= (Ni(1), ..., Ni(M)), the multivariate Hawkes process whose intensity is given by the predictable transformation denoted ψi. Furthermore, to each triali, we can associate an intensity λi and a contrastγ(i). The global least-squares contrast over the ntrials can also be seen as

γn(f) =

n

X

i=1

γ(i)(f). (1)

We use the following notation: for any predictable processes H = (Hi(1), ..., Hi(M))i=1,...n, K = (Ki(1), ..., Ki(M))i=1,...,n, set

H•N =

n

X

i=1 M

X

m=1

Z T2

T1

Hi,t(m)dNi(m)(t), (2)

H⋄K=

n

X

i=1 M

X

m=1

Z T2 T1

Hi,t(m)Ki,t(m)dt, (3) andH⋄2=H⋄H.

In general, we use a dictionary Φ of known functions ofHand we only consider linear combinations of functions of Φ for estimatingf:

fa=X

ϕ∈Φ

aϕϕ, for a∈RΦ. (4)

1

(2)

Then, by linearity ofψ, one can rewrite (1) as

γn(fa) =−2abn+aGna, (5) where for anyϕand ˜ϕin Φ,

(bn)ϕ=ψ(ϕ)•N and (Gn)ϕ,ϕ˜=ψ(ϕ)⋄ψ( ˜ϕ).

Given a vector of positive weightsd, the Lasso estimate of f is ˜fn :=fa˜n where ˜an is a minimizer of the followingℓ1-penalized least-square contrast:

˜

an ∈arg min

aRΦ{−2abn+aGna+ 2d|a|}. (6) Then Theorem 1 of [1] is stated as follows:

Theorem 1. We introduce the following two events:

V,B ={∀ϕ∈Φ, sup

t∈[T1,T2],m,i

(m)i,t (ϕ)| ≤Bϕ and(ψ(ϕ))2•N≤Vϕ}, for positive deterministic constantsBϕ andVϕ and

c=n

∀a∈RΦ, aGna≥c aao

, (7)

for a positive constantc. Letxandεbe strictly positive constants and for all ϕ∈Φ, dϕ=

q

2(1 +ε) ˆVϕµx+Bϕx

3 , (8)

with

ϕµ= µ

µ−φ(µ) (ψ(ϕ))2•N+ B2ϕx µ−φ(µ)

for a real number µsuch thatµ > φ(µ), whereφ(µ) = exp(µ)−µ−1. Then, with probability larger than

1−4X

ϕ∈Φ

 log

1 + µVB2ϕ ϕx

log(1 +ε) + 1

e−x−P((ΩV,B∪Ωc)c), the following inequality holds

[ψ( ˜fn)−λ]⋄2≤C inf

aRΦ

[ψ(fa)−λ]⋄2+1 c

X

ϕ∈S(a)

d2ϕ

 ,

whereC is an absolute positive constant and whereS(a)is the support ofa, i.e. its coordinates with non-zero coefficients.

2

(3)

Proof. We use the notation of [2] and transposition of this notation. First by scaling the data, it is always possible to assume thatA= 1. We have at handn×M point processesNm(i). In the more general case, we need to model eachλ(m,i), intensity of Nm(i), by a

ψf(m,i)

n (t) =µ(m,i)+X

ℓ,j

Z t

−∞

g(m,i)ℓ,j (t−u)dNj(ℓ)(u),

wherefn belongs toHn which replaces the spaceH:

Hn= (R×L2((0,1])nM)nM=n fn=

(m,i),(gℓ,j(m,i))ℓ=1,...,M, j=1,...,n)m=1,...,M, i=1,...n

:

gℓ,j(m,i)with support in (0,1] andkfnk2=X

m,i

(m,i))2+X

m,i

X

ℓ,j

Z 1 0

g(m,i)ℓ,j (t)2dt <∞

 .

For everyfn=

(m,i),(g(m,i)ℓ,j )ℓ=1,...,M, j=1,...,n)m=1,...,M, i=1,...n

in Hn, we denote for eachm andi, fn(m,i)= (µ(m,i),(g(m,i)ℓ,j )ℓ=1,...,M, j=1,...,n.

In the same way for everyf =

(m),(gℓ,j(m))ℓ=1,...,M, j=1,...,n

m=1,...,M inH,we denote for eachmandi, f(m)= (µ(m),(gℓ,j(m))ℓ=1,...,M, j=1,...,n.

Now our dictionary Φ of Hcan be transformed into a dictionary Φn ofHn by stating that for anyϕin Φ we associate aϕn inHn such that for alli, m,ϕ(m,i)n(m). Therefore it is easy to see that the vectorb of [2] associated to fn is actually our vectorbn and that the matrixGof [2] is our matrixGn. The V and B are in the same way translated and the present result is a pure application of Theorem 2 of [2]

References

1. Reynaud-Bouret P, Rivoirard V, Tuleau-Malot C: Inference of functional connectivity in Neurosciences via Hawkes processes. In 1st IEEE Global Conference on Signal and Information Processing 2013, Austin Texas.

2. Hansen N, Reynaud-Bouret P, Rivoirard V: Lasso and probabilistic inequalities for multivariate point processes.to appear in Bernoulli 2013. [Http://arxiv.org/abs/1208.0570].

3

Références

Documents relatifs

In this case, standard homogeneous or inhomogeneous Poisson processes cannot be used and models based on univariate or multivariate Hawkes processes or variations of them are

More speci…cally, we are concerned with the correct speci…cation (or model selection) of the dynamic structure with time series and/or spatial stationary processes fx (t)gt 2Z de…ned

The EDMFT formalism has an even more basic deficiency: since it is based on a local approximation to the self-energy, it does not include even the first-order nonlocal interaction

Peptide binding is dominated by the α chain CDR3 loop (blue) with two hydrogen bonds and 32 van der Waals contacts, and β chain CDR3 and CDR1 loops (green) with a total of four

Our goal is to investigate the asymptotic properties of a kernel density estimator asso- ciated with the driven noise of a linear regression in adaptive tracking and to propose

We recover the optimal adaptive rates of convergence over Besov spaces (that is the same as in the independent framework) for τ -mixing processes, which is new as far as we know..

For para- metric estimation of hypo-elliptic diffusions, we refer the reader to Gloter [15] for a discretely observed integrated diffusion process, and Samson and Thieullen [32] for

In the case of Sobolev null density f 0 , the fact that our testing procedure attains the minimax rate of testing (upper bound of the testing rate (4)), relies on a very sharp