• Aucun résultat trouvé

Discussion of the paper: "Sampling schemes for generalized linear Dirichlet process random effects models” by M. Kyung, J. Gill, and G. Casella

N/A
N/A
Protected

Academic year: 2021

Partager "Discussion of the paper: "Sampling schemes for generalized linear Dirichlet process random effects models” by M. Kyung, J. Gill, and G. Casella"

Copied!
3
0
0

Texte intégral

(1)

Stat Methods Appl (2011) 20:295–297 DOI 10.1007/s10260-011-0170-3

D I S C U S S I O N

Discussion of the paper: “Sampling schemes

for generalized linear Dirichlet process random effects

models” by M. Kyung, J. Gill, and G. Casella

M. Filippone · A. Mira · M. Girolami

Published online: 12 August 2011 © Springer-Verlag 2011

We congratulate the Authors for an interesting and thought provoking paper that com-pares different MCMC strategies for the class of generalized linear Dirichlet pro-cess random effects models. In our discussion we focus on the logistic regression model presented in Section 3.2 and would like to propose two alternative sampling schemes related to manifold methods that are quite general and could be employed every time a Metropolis-Hastings step is used within an MCMC simulation and the target distribution has closed form Fisher Information (FI). In particular we have imple-mented the same sampling scheme as presented by the Authors, but have sampled the parametersβ using Simplified Manifold Metropolis Adjusted Langevin Algorithm (S-MMALA) and Riemann Manifold Hamiltonian Monte Carlo (RM-HMC), both presented inGirolami and Calderhead(2011).

The starting point to apply these methods is the joint density of the observations and the parameters of interest, given by the likelihood of the model along with the prior distribution forβ. The gradient of the logarithm of this joint density and the FI (the expectation, taken with respect to the observed variables y, of the negative Hessian of the log-likelihood) are then computed.

In order to keep the notation uncluttered, we define the logistic elements

li+= logistic(Xiβ + (Aη)i) =

1

1+ exp[−Xiβ − (Aη)i]

and similarly

M. Filippone· M. Girolami

Department of Statistical Science, University College London, London, UK

A. Mira (

B

)

Institute of Finance, University of Lugano, Lugano, Switzerland e-mail: Antonietta.Mira@uninsubria.it

(2)

296 M. Filippone et al.

li = 1 − li+= 1

1+ exp[Xiβ + (Aη)i].

The logarithm of the joint density of observations y and parametersβ is:

Lβ= n  i=1  yilog(li+) + (1 − yi) log(li)− β Tβ 2dσ2.

Using the property of the logistic function, for which∇βli+= li+liXi, the gradient

with respect toβ is easily evaluated as

βLβ= n  i=1 Xi  yi− li+− β dσ2.

The Hessian ofLβreads

ββLβ = − n  i=1  li+li−XiXTi − 1 dσ2,

where there is no dependence from y anymore. As a consequence, the expectation with respect to y of the negative Hessian is equal to the negative Hessian itself, and therefore the FI, along with the negative Hessian of the prior, is

G= n  i=1  li+li−XiXTi + 1 dσ2

with derivatives with respect to theβr, given by:

∂G ∂βr = n  i=1 XiXiT  li+li− li − li+Xir.

In S-MMALA we set an integration step size ofε = 1, whereas for RM-HMC we set a maximum number of leapfrog steps to 10 with integration stepε = 0.75. The autocorrelation for the components of β are reported in Fig. 1, where it can be seen that the autocorrelation is lower than the one obtained by the Authors and reported in the right panel of Fig. 2 in the paper. The use of a Metropolis-Hastings step within the Gibbs sampler avoids the use of any auxiliary variable in samplingβ, that in the expression of the likelihood are effectively integrated out. As pointed out by the Authors in Section 3.3.2, one of the challenges in setting up efficient MCMC methods using Metropolis-Hastings steps, however, is how to tune the parameters of the proposal. Here, by using the idea presented inGirolami and Calderhead(2011), the proposal is automatically tuned based on the geometry of the underlying statistical model, whereby only the integration stepε and/or the number of leapfrog steps need

(3)

Generalized linear Dirichlet process random effects models 297 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0.2 0.4 0.6 0.8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0.2 0 .4 0.6 0.8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0.2 0 .4 0.6 0 .8 1.0 Lag AC F 0 5 10 15 20 25 30 35 0.0 0 .2 0.4 0 .6 0.8 1 .0 Lag AC F

Fig. 1 Autocorrelation of the samples forβ obtained by the KS mixture representation by the Authors

(first row or right panel of Fig. 2 in the paper), S-MMALA (second row), and RM-HMC (third row)

to be tuned. Such a proposal mechanism allows to efficiently deal with multivariate correlated posterior distributions as confirmed by the results.

Reference

Girolami M., Calderhead B. (2011) Riemann Manifold Langevin and Hamiltonian Monte Carlo methods. J R Stat Soc Series B Stat Methodol 73(2):123–214

Figure

Fig. 1 Autocorrelation of the samples for β obtained by the KS mixture representation by the Authors (first row or right panel of Fig

Références

Documents relatifs

Holding meduc and f educ xed, by how much does sibs have to increase to reduce expected years of education by one year?. (A noninteger answer is acceptable

With these strengths, helicity has become widely applied to topics as diverse as magnetohydrodynamic ( MHD ) turbulence, magnetic dynamos, magnetic reconnection, turbulent relaxation

Concomitantly to these developments, we have proposed a few results in [4, 14, 5] to explain the success of structured acquisitions by using spar- sity assumptions on the signal to

We extend the so-called Generalized Linear Sampling Method (GLSM) to the case of limited aperture 4.. data at a

A crucial step is to apply a FPCA (functional principal components analysis) on ob- servations of the explanatory random function, X i , in this selected stratified sample in order

complex priors on these individual parameters or models, Monte Carlo methods must be used to approximate the conditional distribution of the individual parameters given

Bayesian methods / mixed linear model / Dirichlet process prior / correlated random effects / Gibbs

Then, there are two natural ways to define a uniform infinite quadrangulation of the sphere: one as the local limit of uniform finite quadrangulations as their size goes to