• Aucun résultat trouvé

Measures of model adequacy only and of model adequacy and model

1.3 Measures

1.3.1 Measures of model adequacy only and of model adequacy and model

We introduce and compare in this Section the measures belonging to categories A, A&S and F&S (cf. Table 1.1).

1.3.1.1 Gelman and Pardoe (2006)

For a LMM with L variance components, Gelman and Pardoe (2006) presented two Bayesian measures that summarize information in the data at each “level” of the model.

We write level in quotation marks because it corresponds to the separate variance com-ponents, rather than to the more usual definition based on the hierarchy of the data. For instance, consider this varying-intercept model:

yij =β0+bi0+β1xij+ij, i= 1, . . . , m, j = 1, . . . , ni

bi0 ∼ N(0, τ2), ij ∼ N(0, σ2), (1.2)

with a predictor xij. (1.2) can be written hierarchically withβ0i =β0+bi0, as yij ∼ N(β0i+β1xij, σ2), β0i ∼ N(β0, τ2).

This model has two “levels” (data yij, interceptsβ0i) with a different variance component at each “level” (σ2, τ2).

The model is defined at each “level” l= 1, . . . , L as ζk(l) =νk(l)+e(l)k

for k = 1, . . . , K(l). ζk(1) corresponds to yij in (1.1) and for l > 1, ζk(l) are the random coefficients. νk(l) are the linear predictors and e(l)k are the errors that follow a distribution with mean 0 and standard deviation σ(l). For instance, for model (1.2) and for l = 1, we have ζk(1) = yij, νk(1) = β0 +bi0 +β1xij, e(1)k = ij and σ(1) = σ. For l = 2, we have ζk(2) =β0i, νk(2) =β0, e(2)k =bi0 and σ(2) =τ.

Subsequently, we suppress the superscripts (l), as in Gelman and Pardoe (2006), be-cause we work with each “level” separately. The variation explained by the linear pre-dictors νk for each “level” is defined in the population by 1−[E (Var(ek))][E (Var(ζk))]−1 and is computed as

R2“level” = 1− EVKk=1ek) EVKk=1ζk),

where “E” is the posterior mean, “Var” is the posterior variance, “V” is the finite-sample variance operator (Vmi=1(xi) = (m−1)−1Pmi=1(xix)¯ 2), and ˆek and ˆζk are the estimates of ek and ζk, respectively. The expectations are estimated by averaging over posterior simulation draws, which gives rise to a “Bayesian adjusted R2”, that is a generalization of the classical adjusted R2 in regression. R2“level” usually takes values between 0 and 1.

For each “level,” the values of 0 and 1 indicate, respectively, a poor and a perfect fit with respect to the error variance explained at each “level.” IfR2“level” is negative, the prediction is so poor that the estimated error variance is larger than the variance of the data.

The measure that summarizes the average amount of pooling at each “level” is the pooling factorλ“level”, which is defined in the population by 1−[Var (E (ek))][E (Var(ek))]−1 and is computed as

λ“level”= 1− VKk=1(E (ˆek)) EVKk=1ek).

This measure ranges from 0 to 1, where 0 and 1 correspond to no and, respectively, complete pooling. For model (1.2), the complete-pooling model is yij = β0+β1xij +ij

with common estimates ∀iand the no-pooling model is yij =β0i+β1xij+ij withm β0i’s estimated by least squares. A low pooling factor λ“level” < 0.5 indicates a higher degree of within-group information than population-“level” information. A high pooling factor λ“level”>0.5 indicates a higher degree of population-“level” information than within-group information.

1.3.1.2 Snijders and Bosker (1994)

Two measures of modeled variation, one at each level of a two-level LMM, are defined.

This model is equivalent to model (1.1), in which levels 1 and 2 correspond to the subject level j and group level i, respectively, Ri = σ2Ini and D = τ2Iq, where Iq is the q×q identity matrix. These measures are defined for two-level models only and they require a null model, which contains a fixed intercept and a random intercept, with variance τ02Iq, and for which Ri0 =σ20Ini.

The level-1 modeled proportion of variation is defined in the population as the propor-tional reduction in mean squared prediction error foryij, 1−(var(yijXijβ)) (var(yij))−1. The corresponding criterion is

R12 = 1− var(yd ijXijβ) var(yd ij) ,

where var is obtained by plugging-in the parameter estimates in the population formulad

and Xij is the 1×p vector of fixed effects for subjectj in group i.

The level-2 modeled proportion of variation is defined in the population as the propor-tional reduction in mean squared prediction error for ¯yi., 1−var(¯yi.Xi.β)(var(¯yi.))−1.

The corresponding criterion is

R22 = 1− var(¯d yi.Xi.β) var(¯d yi.) ,

where ¯yi. and Xi. are the group means of yij and Xij, respectively.

To estimate population parameters, the random effects and the errors are assumed normally distributed. For R21 and for R22 for balanced data (ni =n ∀i), the numerator is the sample variance of the model of interest and the denominator is the sample variance of the null model. For example, for model (1.2), the criteria that estimate the population parameters are R21 = 1−(ˆσ2+ ˆτ2)/(ˆσ02+ ˆτ02) andR22 = 1−(ˆσ2/n+ ˆτ2)/(ˆσ02/n+ ˆτ02), with ˆ

σ2, ˆτ2, ˆσ02 and ˆτ02, that are the estimates ofσ2, τ2, σ02 and τ02 respectively.

In the case of unbalanced data, the authors advise to use a representative value ofni, such as the harmonic mean m−1Pin−1i −1. The interpretation of the R21 and R22 is the same as the traditional coefficient of determination (Draper and Smith, 1998). R12 andR22 identify which predictors are useful to predict yij and ¯yi., respectively. Population values lie between 0 and 1. Negative values of the estimates are possible when the fixed part of the model is misspecified.

1.3.1.3 Vonesh et al. (1996)

The model concordance correlation coefficient for generalized nonlinear mixed-effects mod-els is defined as

rc= 1−

Pm

i=1(yiyˆi)0(yiyˆi)

Pm

i=1(yiy1¯ ni)0(yiy1¯ ni) +Pmi=1yiy1ˆ ni)0yiy1ˆ ni) +Nyy)ˆ 2. Initially introduced by L. I. Lin (1989) to measure the degree of agreement between pairs of observations, rcis interpretable as a concordance correlation coefficient between observed and predicted values.

To assess the GOF associated with the fixed effects and to select the best set of fixed effects, a marginal model concordance correlation is obtained by setting ˆbi = 0 in ˆyi. If bˆi are not set to zero, as in the original definition, rc is referred to as the conditional model concordance correlation and it assesses the GOF associated with fixed and random effects. The range of values of rc is between −1 and 1, as the usual Pearson correlation, but with a slightly different interpretation. Indeed, rc measures the level of agreement, or concordance, between yi and ˆyi, and a value of 1 indicates perfect fit, while a value smaller than, or equal to, zero indicates lack of fit. Adjusted values for the number of parameters in β are defined as rc,a= 1−N(N −p)−1(1−rc), which allows for using the conditional rc,a for model selection.

1.3.1.4 Vonesh and Chinchilli (1996)

Another measure of explained residual variation for generalized nonlinear mixed-effects models, that requires the specification of a null model is introduced as follows:

R2V C = 1−

Pm

i=1(yiyˆi)0V−1i (yi−ˆyi)

Pm

i=1(yiyˆi0)0V−1i (yi−ˆyi0),

for any positive definite matrix Vi. Sensible choices for Vi are either ˆRi or ˆRi0, the covariance estimates of Ri or Ri0, respectively obtained by plugging-in the parameter

estimates. For a measure that can be compared across different candidate models, the most reasonable choice is ˆRi0.

This measure can be interpreted as the proportional decrease in residual variability compared with the residual variability of a null model, which is either one containing only a fixed intercept, or one with a fixed intercept and a random intercept. As for rc, a marginal and a conditional R2V C can be defined, depending whether ˆbi in ˆyi are set to zero or not (original definition). The adjusted values are denoted R2V C,a and are defined as 1−N(N −p)−1(1−R2V C). As for rc, the marginal R2V C can be used for fixed effects selection and the adjusted conditional R2V C, R2V C,a, allows one for adopting it for model selection.

1.3.1.5 Zheng (2000)

For generalized LMMs with normally distributed random effects, we consider three mea-sures for assessing model adequacy among a set proposed by Zheng (2000). The first measure is the proportional reduction in deviance Drand, defined as

Drand = 1−

Pm

i=1di(yi,yˆi)

Pm

i=1di(yi,y1¯ ni),

where the numerator is the deviance under the model of interest and the denominator is the deviance under the null model that fits only a fixed intercept. For a fixed dispersion parameterφ, theN×1 vector of responsesy= (y1, . . . ,ym)0, theN×pdesign matrix for fixed effects X = (X1, . . . ,Xm)0, the mq×1 vector of random effects b = (b1, . . . ,bm)0, µ=E(y|X,b), and the log of the joint likelihood function given Xandb,l(µ, φ;y), the devianced(y,µ) is defined asd(y,µ) =−2φ[l(µ, φ;y)l(y, φ;y)]. Drandis an extension of the coefficient of determination, as, in the LMM with normal random effects and errors,

Pm

i=1di(yi,yˆi) = Pmi=1(yiyˆi)0(yiyˆi) and Pmi=1di(yi,y1¯ ni) = Pmi=1(yiy1¯ ni)0(yi

¯

y1ni). The measure Drand ranges between 0 and 1, with 0 meaning that the model of interest provides no improvement in prediction over the null model and 1 meaning perfect prediction.

The second measure is the proportional reduction in Penalized Quasi-Likelihood (PQL) Prand, defined as

Prand = 1−

Pm

i=1di(yi,yˆi)/(2 ˆφ) + ˆb0( ˆDIm)−1b/2ˆ

Pm

i=1di(yi,y1¯ ni)/(2 ˆφ) ,

where Im is the m×m identity matrix and ˆb = (ˆb1, . . . ,bˆm)0 is the mq×1 vector of predicted random effects. Prand ranges between 0 and 1, equals 0 when the prediction is poor and/or the random effects are large, and equals 1 when the prediction is perfect and the random effects are 0. Due to the penalty for a large number of random effects, Prand allows for model selection.

The third measure by Zheng (2000) is the concordance index c, a measure of cross-sectional agreement. It equals the proportion of pairs of observations with unequal y values for which the ranks of ˆy and of y are concordant. As c depends on ranking information only, it cannot distinguish between models that yield the same ranking of the fitted values.

In order to identify the best set of fixed effects, we further consider the marginal ver-sions of these three measures obtained by setting ˆbi = 0 in ˆyi for Drand and c, and, for

Prand, by setting ˆbi = 0 in ˆyi and in the second term of its numerator, which simpli-fies to the marginal Drand, showing that Prand is also an extension of the coefficient of determination.

1.3.1.6 Xu (2003)

For LMMs with normal errors with constant varianceRi =σ2Ini and with normal random effects, Xu (2003) presented three measures that require a null model specified either with only a fixed intercept, or with also a random intercept. For both null models, Ri0 is equal to σ20Ini.

The two measures of explained variation are defined as r2 = 1− σˆ2

ˆ σ02,

where ˆσ2 and ˆσ20 are the estimates of σ2 and σ20, and R2 = 1−

Pm

i=1(yiyˆi)0(yiyˆi)

Pm

i=1(yiyˆi0)0(yiyˆi0).

A measure of explained randomness using the conditional likelihood of the observed data given the predicted random effects is also proposed as

ρ2 = 1−σˆ2 ˆ σ02 exp

Pm

i=1(yiyˆi)0(yiyˆi)

ˆ2

Pm

i=1(yiyˆi0)0(yiyˆi0) ˆ02

!

.

These measures are interpretable as the classical coefficient of determination, with range [0,1]. Xu (2003) emphasized that in ordinary linear regression, if the REML esti-mates are used for the variance components, r2 is the adjusted R2. As an extension of the classical adjusted coefficient of determination, r2 can be used for model selection.

For fixed effects selection, we further consider the marginal version of R2 by setting bˆi = 0 in ˆyi.

1.3.1.7 H. Liu et al. (2008)

To estimate the portion of explained variation of the modeled data by the fitted LMM, H. Liu et al. (2008) introduced three R2 statistics. They assumed that the errors and the random effects are normally distributed with a constant variance Ri =σ2Ini for the errors. The measures compare the residuals of an intercept-only model with the residuals of the model of interest considering, respectively, the fixed effects only (R2F), the fixed and random effects (R2T), and the total fixed effects, where all variables and individuals are treated as fixed (R2TF):

R2F = 1−

Pm

i=1(yiXiβ)ˆ 0(yiXiβ)ˆ

Pm

i=1(yiy1¯ ni)0(yiy1¯ ni), RT2 = 1−

Pm

i=1(yiyˆi)0(yiyˆi)

Pm

i=1(yiy1¯ ni)0(yiy1¯ ni), R2TF = 1−

Pm

i=1(yi−(Xi,Zi) ˆη)0(yi−(Xi,Zi) ˆη)

Pm

i=1(yiy1¯ ni)0(yiy1¯ ni) ,

where ˆη =Pmi=1((Xi,Zi)0(Xi,Zi))−1(Xi,Zi)0yi. R2F is obtained by setting ˆbi = 0 in R2T. Thus, it can be seen as a marginal measure and can be used for fixed effects selection. And R2T can be seen as a conditional measure, as in Sections 1.3.1.3 and 1.3.1.4. H. Liu et al.

(2008) noted thatR2TF is defined more as being a theoretical measure of the upper bound of R2 rather than a practical R2 measure for modeling with the LMMs. We nevertheless consider it for completeness.

To adjust for the dimensions of the design matrices, adjusted values for R2F and R2TF, but not for R2T, are proposed: R2F,a = 1 − N(N − (p +q))−1(1− R2F) and R2TF,a = 1−N(N −rank((Xi,Zi)))−1(1−R2TF). R2TF,a can thus be used for model selection.

These measures usually range between 0, indicating complete lack of fit, and 1, indi-cating perfect fit. Negative values are also possible and would indicate that the model of interest provides no improvement, in terms of explained variation, over the intercept-only model.

1.3.1.8 Measures comparison

When the data yij are normally distributed in the LMM, some of the measures discussed above are equivalent. This is the case for (a) the marginal R2V C, the marginal Drand, the marginal Prand, the marginal R2 and R2F; (b) the conditional R2V C and R2 based on the null model with a fixed intercept, Drand and R2T; and (c) the conditional R2V C and R2 based on the null model with a fixed intercept and a random intercept. When we consider the adjusted values for the marginalR2V C, the conditionalR2V C’s andR2F, these equalities no longer hold.

Some other measures, while they are not equivalent, are defined similarly and will thus give close values. This is the case for (a) the two measures of Snijders and Bosker (1994); (b) the marginal rc, the marginal R2V C, the marginal Drand, the marginal Prand, the marginal R2 and R2F; (c) the conditional rc, the conditionalR2V C,R2 and ρ2 based on the null model with a fixed intercept, Drand,Prand,R2T and R2TF; and (d) the conditional R2V C,R2 and ρ2 based on the null model with a fixed intercept and a random intercept.