HAL Id: hal-00894160
https://hal.archives-ouvertes.fr/hal-00894160
Submitted on 1 Jan 1997
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
On biased inferences about variance components in the
binary threshold model
C Moreno, D Sorensen, La García-Cortés, L Varona, J Altarriba
To cite this version:
Original
article
On biased inferences about variance
components
in the
binary
threshold
model
C Moreno
1
D
Sorensen
LA García-Cortés
1
L Varona
J
Altarriba
1Unidad de Genética Cuantitativa y
Mejora Animal,
Facultad deVeterinaria,
Universidad de
Zaragoza, Miguel
Servet177,
50013Zaragoza, Spain;
2
Danish
Instituteof
AnimaLScience,
Research CenterFoulum,
PO Box39,
DK-8830Tjele,
Denmark(Received
2September
1996;accepted
6February
1997)
Summary - A simulation
study
was conducted tostudy frequentist
properties of three estimators of the variance component in a mixed effectbinary
threshold model. The three estimators were: the mode of a normal approximation to themarginal posterior
distribution of the component, which is denoted in the literature as
marginal
maximum likelihood(MML);
the mean of themarginal
posterior distribution of the component,using
the Gibbssampler
toperform
themarginalisations
(GSR);
andthird,
the mode of thejoint
posterior
distributions of location and the variance parameter, used inconjunction
with the iterativebootstrap
bias correction(MJP-IBC).
The latter wasrecently proposed
in the literature as a method to obtainnearly
unbiased estimators. The results of thisstudy
confirm that MML canyield
biased inferences about the variance component, and that thesign
of the biasdepends
on the amount of information associated with either fixed effects or with random effects. GSR canproduce positively
biased inferences when the amountof data per fixed effect is small. When fixed effects are
poorly estimated,
the biaspersists,
despite
the fact thatposterior
distributions areguaranteed
to be proper, and that the amount of information about the variance component islarge.
In this case, themarginal
posterior
distribution of the variance component ishighly peaked
andsymmetric,
but it shows a shift towards theright
with respect to the true(simulated)
value. This bias can be reducedby assigning
a Gaussianprobability density
function to theprior
distribution of the fixedeffects,
but this strategy does not work with very sparse data structures. The method based on MJP-IBCyielded
unbiased inferences about the variance component in all the cases studied. This estimator iscomputationally simple,
but contrary to GSR with normalpriors
for the fixedeffects,
can lead to estimates that fall outside the parameter space.binary traits
/
Gibbs sampling/
biased estimators/
bootstrap*
Résumé - À propos des biais d’estimation de composantes de variance dans le modèle binaire à seuil. Une étude de simulation a été conduite pour étudier les
propriétés
fréquentistes
de trois estimateurs de la composante de variance dans un modèle mixte à seuils pour variable binaire. Les trois estimateurs ont été : le mode del’approximation
normale de la distribution
marginale
aposteriori
de la composantequi
estappelée
dans la littérature « maximum de vraisemblancemarginale
» (MML),
l’espérance
de distributionmarginale
a posteriori de la composante, en utilisantl’échantillonnage
de Gibbs(GSR)
et
finalement,
le mode de la distributionconjointe
a posteriori desparamètres
de positionet de variance, utilisé avec une méthode itérative de correction des biais par
bootstrap
(MJP-IBC).
Cette dernière a été récemmentproposée
dans la littérature comme uneméthode d’obtention d’estimateurs approximativement non biaisés. Les résultats de l’étude
confirment
que le MML peut donner desinférences
biaisées à propos de la composantede variance et que le
signe
du biaisdépend
de laquantité
d’information
attachée auxeffets fixes
ou auxeffets
aléatoires. Le GSR peutproduire
des estimées à biaispositif
quand il
y a peud’information
par niveaud’effet fixe.
Dans ce cas, le biais persiste mêmequand l’information
sur la composante de variance est importante etquand
la distributionpostérieure
est propre. La distributionmarginale
aposteriori
est hautement concentrée etsymétrique
mais montre undécalage
à droite de la vraie valeur(simulée).
Ce biais peut être réduit en considérant que la distribution a priori deseffets fixes
est de typegaussien
mais cettestratégie
n’est pase,!cace
pour les structures de données oùl’information
est trèsdispersée.
La méthode MJB-IBC aproduit
des estimées non biaisées de la composante de variance dans tous les cas étudiés. Cet estimateur estsimple
à calculer mais contrairement à GSR avec deseffets fixes
à distributiongaussienne,
elle peut mener à des estimées situées en dehors del’espace
desparamètres.
caractères binaires
/
échantillonnage de Gibbs/
estimateurs biaisés / bootstrapINTRODUCTION
Ordered
categorical
traits are oftenanalysed
by
means of the thresholdliability
model,
first usedby Wright
(1934)
in studies of the number ofdigits
inguinea
pigs,
andby
Bliss(1935)
intoxicology experiments.
In the thresholdmodel,
it ispostulated
that there exists a latent orunderlying
variable(liability)
which hasa continuous distribution. A response in a
given category
is observed if the actualvalue of
liability
falls between the thresholdsdefining
theappropriate
category.
Herewe will consider the situation where the
liability
is modelled as a linear function offixed effects and
possibly
correlated random effects. In atypical
likelihoodsetting,
one may be interested inmaking
joint
inferences about estimable functions of fixedeffects and on variance
components
associated with the random effects. Within aBayesian framework,
one may concentrate inferences on themarginal
posterior
distributions of contrasts of ’fixed’ effects and of the variance
components,
andpossibly
on themarginal posterior
distributions of the random effects. In agenetic
context,
the latter involvetypically
additivegenetic
values andknowledge
about these isimportant
to carry out selection decisions and to describe response to selection. Other inferences of interest mayrequire computing probabilities
that observations fall in agiven
category
forgiven
individuals or combinations ofparameters.
All these inferential
problems,
whether in a likelihood or in aBayesian
setting,
require
marginalisations.
Forexample,
computation
of the likelihood or of thejoint
with
respect
to the random effects.Although
theseintegrals
do not have a closedform
solution,
numericalintegration using quadrature
is feasible when the random effects are uncorrelated(Anderson
andAitkin,
1985).
When random effects arecorrelated,
as is the case with animalmodels,
these numericalintegration
techniques
cannot be used. To circumvent this
problem,
alarge
number ofapproximations
and different
approaches
has beensuggested
in the literature(see,
forexample,
McCullagh
and Nelder(1989)
for a discussion of some ofthese,
andFoulley
et al(1990)
andFoulley
and Manfredi(1991)
for a review of theanalysis
ofcategorical
traits in an animal
breeding
context).
The introduction of Markov chain Monte Carlo methods allows
computation
of multidimensionalintegrals
andanalytic
approximations
can therefore be avoided.The use of the Gibbs
sampler
toanalyse
orderedcategorical
traits has beenreported
by Zeger
and Karim(1991)
andby
Albert and Chib(1993),
using
aBayesian
perspective.
McCulloch(1994)
applies
the Gibbssampler
to estimate variancecomponents
forbinary
data in a likelihoodsetting.
In an animalbreeding
context,
the Gibbs
sampler
wasapplied
to theanalysis
ofcategorical
traitsby
Moreno(1993)
andby
Sorensen et al(1995),
both from aBayesian
perspective.
Zeger
et al(1988)
studied non-linear models for theanalysis
oflongitudinal
data,
and drew attention to thedependence
between themarginal
expectation
of the data and the variance of the random effects. In the context of ageneralized estimating
equations approach,
they
showed that estimators of the fixed effects and of the variancecomponent
associated with the random effects are not evenasymptotically
orthogonal,
asthey
are in the Gaussian model. A similardependency
in the caseof the
logistic
mixed model was shownby
Drum andMcCullagh
(1993)
in the context of restricted maximumlikelihood,
where the latter was constructedusing
error contrasts. Thisapproach
toconstructing
the restricted likelihoodgenerates
anequation
that is not a function of the fixed effects in the case of the Gaussianmodel,
but not in the case of non-linear models. The association between the fixed effectsand the variance
component
indicates that inference about the latter isdependent
upon the amount of information in the data with which fixed effects are estimated.
The
study
of Moreno(1993)
reported
frequentist
properties
of the mean ofthe
marginal
posterior
distribution of the variancecomponent
in abinary
mixedthreshold model. In this
study,
a sire model was assumed. Moreno(1993)
notedthat when there is little information about fixed
effects,
the mean of themarginal
posterior
distribution of the variancecomponent
is biasedupwards.
This is so evenin cases where there are very
large
numbers of sires andoffspring
persire,
andconsequently
thismarginal
posterior
distribution ispeaked
andsymmetric.
It is well known that when the data associated with aparticular
fixed effect fall allwithin the same
category
(known
as the extremecategory
problem
(ECP),
the likelihood is underidentified(eg,
Misztal etal,
1989;
Gelman etal,
1995).
In thissituation,
in aBayesian
setting,
theassignment
ofimproper prior
distributions tothe ’fixed effects’ can lead to
improper
posterior
distributions.However,
we showthat the bias related to inference about the variance
component
when there is little information about fixed effectspersists,
when all theparameters
of the model areThis bias
problem
has been thesubject
of an extensive simulationstudy by
Hoeschele and Tier
(1995).
Theseauthors,
who used the Gibbssampler
with dataaugmentation
(Tanner
andWong,
1987),
confirm the results of Moreno(1993),
and indicate thatpart
of theproblem
can be alleviatedby
assuming
that ’fixed effects’follow a
priori
a Gaussian distribution. As we showbelow,
treating ’fixed
effects’ asif
they
were ’random’ can reduce the bias associated with the mean of themarginal
posterior
distribution of the variancecomponent,
but this is not astrategy
thatalways
works.An alternative
approach
to inferences in the mixed threshold model waspre-sented
by
Tempelman
(1994).
The method isquite
general
and is based on anapproximation
to themarginal
posterior
distribution of the variancecomponent,
where the
marginalisation
is obtainedusing Laplacian integration.
Thecomputa-tional burden of this method amounts to
obtaining
repeatedly
the determinant of the Hessian of thejoint posterior distribution,
conditional on the current value ofthe variance
component.
The dimension of the Hessian isequal
to the number of ’fixed’ and random effects in the model. In a small simulationstudy,
Tempelman
(1994)
showed that the methodperformed
well in a siremodel,
butyielded
biased estimates of the variancecomponent
in an animal model.Breslow and Lin
(1995)
haverecently developed
methods toadjust
for the bias inapproximate
estimators of the variancecomponent
andregression
coefficients forgeneralised
linear mixed modelshaving
one source of random variation. The biascorrection
applies
toapproximations
based onpenalised
quasilikelihood
and holdsfor
generalised
linear mixed modelshaving
canonical linkfunctions,
the latterbeing
a condition that is not satisfied in the case of the
probit
mixed model studied inthis work.
Kuk
(1995)
proposed
an alternative iterative methodusing
thebootstrap
to obtainasymptotically
unbiased estimators. Theappealing
aspect
of thisapproach
is that in
principle
it can beapplied
to any estimator.Though computationally
intensive,
the method can beimplemented
in astraightforward
manner.The
objective
of this work is tostudy
frequentist
properties
of threeapproaches
tomaking
inferences about variancecomponents
in mixedbinary
threshold models. One of theseapproaches,
which does notrely
onanalytic
approximations,
is basedon the mean of the
marginal posterior
distribution of the variancecomponents.
Westudy
itsproperties using
different data structures andassuming
either proper orimproper prior
distributions for the ’fixed’ effects and for the variancecomponent.
Wereport
alsofrequentist
properties
of acomputationally simple
estimator basedon the mode of the
joint posterior
distribution of location and scaleparameters,
used in
conjunction
with the iterative bias correctionusing
thebootstrap proposed
by
Kuk(1995).
MATERIAL AND METHODS
Model for the
analysis
ofbinary
responsesA Bernoulli random variable yg is recorded for observation
j(j
=1,...,!,)
insubclass
i(i
=1, ... ,
s),
and takes the value y2! = 1 if!i!
> t ory2! = 0
otherwise,
Conditionally
on fixed effects(herds) (b,
of orderp)
and on random(sire)
effects(u,
of orderq),
the!z!
areindependently
andnormally distributed,
ie,
N(x’b
+z!u, 1),
and the Yij areindependent
withPr(yg
=O!b, u) _
4l(t -
xib - z!u).
The
symbol
4l(.)
denotes the standard normal cumulative distribution function andx!(z!)
denotes the ith row of the knowndesign
matrixX(Z).
The subclass i is defined as the herd-sire combination i. We will denote theprior
distribution forthe fixed effects as
p(b).
Invoking
an infinitesimalmodel,
the vector of sire effectsis multivariate
normally
distributed:ulA, a 2 -
N(O,
A
U2
),
where A is a known qby
q matrix
af additivegenetic relationships
amongsires,
andQ
u
is
the unknown variance between sires. Theprior probability density
forQ
u
will
be denotedp(o, u 2).
Under themodel,
the conditional distribution of the data y,given
b and u is:where nil is the number of observations in class i with !2! = 0 and n22is the number
of observations in class i with y2! = 1. The
joint posterior
distribution isgiven
by:
Unless otherwise
stated,
p(b)
is assumed uniform in the interval[t
-5,
t +5!,
andp(
Q
u)
is assumed uniform in the interval!0,1/3!.
Consequently,
in these cases,posterior
distributions are proper.As is well established
(Cox
andSnell,
1989),
in thebinary
threshold model neither the residual variance nor the threshold can be estimated. In thiswork,
thesimulated data are
analysed
using
a model in which the residual variance is setequal
to1,
and the value of the threshold isarbitrarily
setequal
to the simulated value.Methods of inference
In this section we
briefly
describe the three methods of inference studied in thiswork. These are,
first,
marginal
maximum likelihood(MML)
(Foulley
etal,
1987).
The second method is based on an estimator of the mean of themarginal
posterior
distribution of the sire variance.
Here,
marginalisations
were obtainedusing
the Gibbssampler applying
anadaptive rejection sampling algorithm
due to Gilks andWild
(1992) (GSR).
The third method is based on the mode of thejoint posterior
distribution
!2!,
used inconjunction
with the iterativebootstrap
bias correction(MJP-IBC).
The method based on MML was chosen because it is one of the most common
methods used in the
analysis
ofcategorical
traits. Method GSR does notinvolve
analytic
approximations
and therefore it isinteresting
to compare it to MML undera
variety
of data structures. As we showbelow,
both MML and GSR in the formimplemented
in thiswork,
can lead to biasedinferences,
even in cases with alarge
Marginal
maximum likelihood(MML)
This method is based on
locating
the modal value of anapproximation
to Inp(QuIY)!
the
logarithm
of themarginal
posterior
distribution of the variancecomponent.
As shown in
Foulley
et al(1987)
andapplying
their result for the caseQ
u
u 2 _
Uniform [0, 1/3]:
The
expression
on theright
hand side of[3]
involvesfinding
theexpected
value ofu’A-l
u,
where thisexpectation
is taken withrespect
to the distribution ofuly,
afl .
That is:In the Gaussian
model,
uJy, (7!
is
normal,
and theexpectation
in[4]
involves the solution of Henderson’s(1973)
mixed modelequations.
Thisyields
E(uly, (7!),
and the inverse of the coefficient matrix of the mixed modelequations,
yields
Var(uly,
(7!).
In the context of thepresent
binary
model, the
form of the distribution ofuly, (7!
is
not known. MML consists ofapproximating
E(uly, (7!)
by
the mode ofp(b, uJy, (7!)
andVar(uly, (7!)
by
the inverse of minus theexpected
value of the second derivatives ofp(b, uly, (7!)
withrespect
to b and u.Setting
[3]
equal
to zero and
solving
for theunknown,
leads to an iterative scheme which involvesthe solution of a non-linear
system
reminiscent of Henderson’s(1973)
mixed modelequations.
Theequation
for(7! requires
thecomputation
of:where
C
uu
is thepart
of thecomplete
inverse of the coefficient matrixcorresponding
to the uby
u block.This method is known to
yield
biased inference in smallsamples
withrespect
to the variancecomponent
(Hoeschele
etal, 1987; Dellaportas
andSmith,
1993;
Engel
etal,
1995).
Thesign
of the biasdepends
on the data structure(Hoeschele
andTier,
1995;
Engel
etal,
1995).
With alarge
amount of information on fixed effectsand little information on random
effects,
the bias isnegative.
When the amount ofinformation on the random effects is
large,
but fixed effects arepoorly
estimated,
the bias is
positive.
This behaviour of the estimator can lead to situations whereno bias is detected.
Mean of the
marginal posterior
distribution of sirevariance,
using
theGibbs
sampler
(GSR)
of all the
parameters
of the model. The literature on Markov chain Monte Carlomethods is very
large
and many theoretical andapplied
issues can be found in the recentpublication
of Gilks et al(1996).
In many
applications,
thefully
conditionalposterior
distributions areknown,
and the
implementation
of the Gibbssampler
isstraightforward.
This is the case ofthe
binary
threshold mixedmodel,
when the Gibbssampler
is used inconjuction
with dataaugmentation,
as in Albert and Chib(1993).
If thetechnique
of dataaugmentation
is notused,
thepresent
model leads tofully
conditionalposterior
distributions of the locationparameters
that are not of standard form. In this case, othersampling
schemes can beimplemented,
such as theadaptive rejection
sampling
proposed
by
Gilks and Wild(1992).
This is theapproach
chosen in this paper. Details of thefully
conditionalposterior
distributions can be found in theAppendix.
We followclosely
theadaptive rejection
sampling
algorithm
as describedin
Dellaportas
and Smith(1993).
The
implementation
of GSRrequires
draws from thefully
conditionalposterior
distributions of the
parameters.
One of these is thefully
conditionalposterior
distribution of the sire
variance,
p (
0
,
2
u 1 b,
u,y),
which is in the form of a scaledinverted
chi-square.
Theheritability,
h
2
=40&dquo;!
u / (0,2
+
1) ,
was constructed for each value drawn fromp(a u
2
lb,
u, y).
The Monte Carlo estimate of the mean of themarginal
posterior
distribution ofheritability
wascomputed
as the meanh
2
over the Gibbssamples.
In thepresent
study,
differences between mean, mode and median of themarginal
posterior
distribution ofheritability
werenegligible
and results basedon the mean
only
arereported.
The Gibbssampler
wasimplemented
using
severalchains,
and all thesamples
in each werekept.
Burn inperiods
were determined on apragmatic
basis. The criterion of Gelman and Rubin(1992)
wasadopted
to determine convergence.The Gibbs
sampler
allows thedrawing
of inferencesusing
afully Bayesian
approach,
withoutresorting
toanalytical
approximations.
Asymptotically,
posterior
distributions arenormal,
with meanequal
to the maximum likelihood estimator. It seems relevant tostudy
the behaviour of thisapproach
under different datastructures,
in view of the fact thatanalytic approximations
are not involved.Mode of
the
joint posterior
distribution followedby
the iterative boost-rap bias correction(MJP-IBC)
The first
step
of this method is to find the mode of thejoint posterior
distribution of location anddispersion
parameters
!2!,
under theassumption
that theprior
ofp(
0
,2)
follows a scaled invertedchi-square
distribution. The latter is of the form:where
q
*
andu*
2
areparameters
of theprior
distribution. In thepresent
study,
q
*
the
resulting
joint posterior
withrespect
too, u 2,
setting
to zero andsolving
foror2 U )
produces
the iterative scheme:as shown
by
Hoeschele et al(1987),
whereu
is the mode ofp(b,
ula!,y)
as inMML. Notice that in contrast to
!5!, [7]
does notrequire
knowledge
of the inverse of any matrix and is very easy tocompute.
Use of a properprior
distribution foro
l2
is
important,
because with animproper
uniformprior,
this method is known toyield
zero estimates forQ
u
(Lindley
andSmith,
1972).
The second
step,
consists ofapplying
the iterative bias correction that wasproposed by
Kuk(1995).
Thisprocedure
can be described as follows. Let 0 bethe
parameter
ofinterest,
and lete
represent
an initial estimate obtained as asolution to a
system
ofestimating equations
!(0!y),
where y are the data that areassumed to follow a distribution
f(y!0).
Thesystem
ofequations
!(9!y)
could bethe score
function,
forexample,
or someapproximation
to it. In thepresent
case,ljJ(ely)
is thesystem
ofequations
that results from thejoint
maximization of theposterior
distribution withrespect
to location anddispersion
parameters.
LetA
*
,
theexpected
value of the estimator ofe,
satisfy:
If the estimator of 0 is
biased,
theasymptotic
bias isgiven
by:
where
g(e)
=e
*
.
Kuk(1995)
proposes thefollowing
correction for this bias. Let(3!°!
be an initial estimate of the bias of0.
Then define anupdated
estimate of the bias as:... I- 1 .1 I- . 11
and define an
updated
bias-corrected estimate of 0 as:Kuk
(1995)
shows that this estimator isasymptotically
consistent. Since the functiong(8)
is notusually known,
Kuk(1995)
proposes toapproximate
g(8)
by
g
M
(8)
!
which is the average ofe
over simulatedsamples.
That is:The simulated values yi,..., ym are obtained from
f (y!0),
where in the kth round ofiteration,
e is set to 8 -(3!).
The
implementation
of the aboveprocedure using
method
(MJP-IBC)
was asfollows.
First,
estimates of location and scaleparameters,
e,
are obtained from thejoint
maximization of theposterior
distribution!2!,
conditional on theoriginal
data.
Second,
simulatedsamples
are generated
fromf(ylO),
where e is set to the value of these initial estimates6.
Third,
6(y
i
)
are obtained from thejoint
maximization of theposterior
distribution[2],
conditional on the ith simulatedsample
(i
=1, ... ,
m).
Fourth,
anaverage of
8(
Yi
)
over the msamples
iscomputed,
which is denotedby
g
M
(9),
and the estimate of the bias is obtained from[13].
Finally,
6
is correctedusing
!14!,
which is used togenerate
a new set of simulatedsamples.
This iterative scheme isrepeated
until convergence is achieved.In the presence of the extreme
category
problem
(ECP),
it wasrequired
thatthe
proportion
of ECPs In the simulatedsamples
was the same as in theoriginal
data. This was achieved
by adjusting
the values of the fixed effects associated with ECPs. Anotherpoint
tohighlight
in theapplication
of MJP-IBC in models withECPs is that the correction in
expression
[14]
involves thecomponent
of variance and the non-ECP fixed effectsonly.
Moreover,
if in a simulatedsample
yi, an ECPis
generated
for afixed
effect that was a non-ECP fixed effect in theoriginal data,
then the element in6(y
i
)
associated with this fixedeffect,
is set to the value that ’was used to simulate this fixed effect in the
sample
yi, and not to its estimated value. This ad hocstrategy
was arrived at on a trial and errorbasis,
and it wasfollowed because it avoided
problems
of lack of convergence. Otherstrategies
could beenvisaged,
one suchbeing
based onassigning
mildly
informativepriors
to the ’fixed’ effects.Simulation
study
A Monte Carlo simulation
study
was carried out in order to evaluate the threemethods of inference under a
variety
of data structures. The criteria used toassess the
performance
of the methods were bias and mean square error. The data were simulated as follows.Liability
was drawn from a normaldistribution,
withmean
equal
to the fixed effectcorresponding
to the individual inquestion
and variancegiven by
(72
+
o,,2,
whereU2
is
the sire variance andU2
was setequal
to 1.Only
one set of fixed effects were simulated(herds),
and these were drawn fromuniform distributions within the interval
(-2, 2).
Sires were drawn from a normaldistribution,
with mean(1/2)v,P
and variance(3/4)
Q
u,
where up is the realizedvalue of the sire of the individual. Base
population
sires were drawnassuming
a mean of zero and varianceu U. 2
Threenon-overlapping
generations
of data(with
noThe data at the
phenotypic
scale weregenerated
as follows:where the value of the known threshold
t,
was setdepending
on the desired level ofincidence.
The simulations of the
original
data were carried out such that theresulting
structures led to cases with different amounts of information associated with the
parameters
of the model. The simulation results were all based on 50replicates.
RESULTS
Estimates of
heritability
using
MML and GSR(mean
of themarginal
posterior
distribution ofheritability)
for three data structures are shown in table I. Theanalyses
based on GSR assumed the proper uniformpriors
for the ’fixed’ effect andthe variance
component,
as described in Modelfor
theanalysis
of binary
responses.The structure Al is one in which there is
adequate
information in the data tomake inferences about all the
parameters
of the model. There is no ECP in this structure and the incidence is set at60%.
Both MML and GSRyield
estimates that on average are ingood
agreement
with the true value. In structure A2 there isadequate
information related to fixedeffects,
but there isrelatively
less informationassociated with the random effects. In A2 there are no ECPs. In structure A3 there
are
ECPs,
but there is alarge
amount of data associated with the non-ECP fixedeffects and there is little information about the random effects. As
expected,
the MMLyields
estimates ofheritability
that are biased downwards. GSR does not showsigns
of bias. Structures A4 and A5represent
cases where there is little informationabout fixed effects. The
percentage
of ECPs ishigher
under structure A5(65%
inA5 versus
25%
inA4).
Both methodsproduce positively
biasedresults;
the biasTable II shows estimates of the mean of the
marginal posterior
distribution of theheritability
using
GSR under avariety
of datastructures,
andperforming
theBayesian analysis
on the basis of proper andimproper posterior
distributions. Thepoint
of the results in the table is to draw attention to thetype
of data structures that can cause GSR toyield
biased inferences about the variancecomponent
and to illustrate that the bias is notnecessarily
related to theimpropriety
of theposterior
distribution. In cases Bl and
B3,
the datacomprise
unrelated sire families andan
improper
uniformprior
wasassigned
to the fixed effects and to the variancecomponent
in such a way that the conditionsrequired
toguarantee
that theposterior
distribution is proper(Natarajan
andMcCullogh,
1995)
are not met.Although
the ECP ispresent
at a level of27%
inBl,
with alarge
number offamilies and a
large
number of observations perfamily
and per fixedeffect,
the estimate of the mean of themarginal
posterior
distribution ofheritability
over the 50 Monte Carloreplicates
showsgood
agreement
with the true value. This result isdisturbing
since thesampler
retrieved ameaningful
estimate from aposterior
distribution which is not
guaranteed
to beproper!
Thispoint
is discussed atlength
in Casella(1996).
In cases B2 and B3 there is a limited amount of information per random effect and per fixed effect. The ECP is not
present.
The difference between these two casesis that under
B2,
GSR wasimplemented
with properpriors
for all theparameters,
which leads to a proper
joint posterior
distribution.However,
the estimatesusing
GSR are biased in both cases and the size of the bias is similar. Thus the presence
of bias does not seem to be
necessarily
related to the fact that theposterior
distribution isimproper.
In
B4,
theanalysis
was carried outassigning
properpriors
to all theparameters
and there are no ECPs(the
marginal
posterior
distribution ofheritability
isguaranteed
to beproper).
In common with cases B2 andB3,
there arerelatively
few observations per fixed and random effect. Relative to cases B2 and
B3,
caseof the
heritability
is almost three times smaller than in cases B2 and B3.Here,
the
marginal
posterior
distribution ofheritability
ishighly peaked
andsymmetric,
but is shifted to theright
relative to the true value of 0.5. One of the 50replicates
wasarbitrarily
chosen and this distribution is shown infigure
1.Despite
thelarge
number of sire
families,
the absence of ECPs and the fact that themarginal
posterior
distribution is proper, the bias
using
GSRpersists.
Table III shows
comparisons
of GSR and MJP-IBC. Method GSR wasimple-mented
assigning
normally
distributedpriors
for the fixed effects and the proper uniform distribution in the interval!0, 1/3!
for theprior
sire variance.Cases Cl and C2 have the same data structure as B2 in table
II,
where there arerelatively
few observations per ’fixed’ effect but no ECPs.Using
GSR andassigning
a normal
prior
to the ’fixed’ effects is effective inremoving
the bias. Data structures C3 and C4 are identical to A4 and A5. These are characterisedby
few observationsper ’fixed’
effect,
but with ECP.Again,
assigning
a normalprior
to the ’fixed’ effectsproduces
estimates ofheritability
in betteragreement
with the true value. In all these four cases, MJP-IBCyields
estimates ofheritability
ingood
agreement
with the true value. The mean square errors of estimates based on GSR and on MJP-IBCare of similar
magnitude.
In data structure
C5,
thefrequency
of ECP is80%
and the incidence is90%.
The
strategy
ofusing
GSRtreating
fixed effects as randomyields
estimates that areDISCUSSION
The purpose of this work has been to draw attention to the poor
frequentist
properties
of the estimator of the mean of themarginal posterior
distribution ofvariance
components
in mixed threshold models when there is little information associated with the fixed effects. The simulation results indicate that this bias is notnecessarily
related to theimpropriety
ofposterior
distributions.Indeed,
we haveshown that biased inferences result in cases when the
posterior
isguaranteed
to beproper, and
agreement
with the true value was obtained when thepropriety
of theposterior
distributions cannot beguaranteed.
Further,
this is seen as aproblem,
because the bias
persists
in situations when thismarginal
posterior
distribution ishighly peaked,
symmetric
andguaranteed
to be proper. In otherwords,
this is not the bias to beexpected
from a non-linear estimator in a smallsample setting.
The
type
of data structures that can lead to thisproblem
isubiquitous
in animalbreeding.
This is characterisedby
models with alarge
number of fixedeffects,
withrelatively
few observations per fixed effect. As we haveshown,
the biased inferencesassociated with the variance
component
are notnecessarily
related to the extremecategory
problem, although
this can be conducive to then.The
inability
to obtain a closed form solution to themarginal
posterior
distri-bution of the variance
component
hasprecluded
us fromfully understanding
thecause of the bias. The lack of
asymptotic independence
between the estimator of thefixed effect and of the variance
component,
either in the context of thegeneralized
estimating equation
studiedby
Zeger
et al(1988)
or in the context of the likelihood of error contrastsreported
by
Drum andMcCullagh
(1993),
does notnecessarily
castlight
on thepresent
problem.
In aBayesian setting,
one ismarginalising
withrespect
to the fixedeffects,
andprovided
that one does not stumble intoproblems
of
impropriety
ofdistributions,
it is not clear to us how assertions that are validA
strategy
that can beinteresting
toexplore
is to run the Gibbssampler
after
having integrated
the fixed effects outanalytically
from thejoint posterior
distribution. The relevance of this is based on the observation that
running
thesampler
conditioning
on the true values of the fixedeffects, yields
correct inferencesabout
heritability
(Moreno, unpublished).
Using
GSR buttreating
fixed effects asrandom,
removes much of the bias ininference about variance
components,
as shownby
Hoeschele and Tier(1995).
However,
we have illustrated that this alternative does notalways
work. In suchcases, it is useful for the researcher to have a choice of other methods to solve the
problem.
The iterativebootstrap
bias correction is analternative,
computationally
simple method,
that showsgood
frequentist
behaviour. In thiswork,
we chose touse it in
conjunction
with the mode of thejoint posterior distribution,
due to itscomputational
simplicity.
Westress,
however,
that the iterative bias correction canin
principle
beapplied
to anymethod,
but if it isused,
forexample, together
withthe Gibbs
sampler,
computing
time can become alimiting
factor. Animportant
point
to stress is that theprice
to pay for unbiasedness is that estimates can falloutside the
parameter
space. This will never be the caseusing
GSRtreating
fixedeffects as
random,
orusing approximations
to themarginal posterior
distributionof the variance
component.
It wouldcertainly
beinteresting
to extend thestudy
ofTempelman
(1994)
and toexplore
the behaviour of alternativecomputational
forms of theLaplacian approximation
further.The
present
simulationstudy
was based on the sire model where each sireeffect is
represented by
several observations. A more relevant model in an animalbreeding
context is the individual animalmodel,
where each additivegenetic
value is associated at the most with one observation. In the case of the animalmodel,
the bias
problems
discussed in this paper areaccentuated,
as illustrated in thesimulation
study
of Hoeschele and Tier(1995).
Thestudy
ofTempelman
(1994)
also showed poor behaviour of theLaplacian approximation
in the case of theanimal model. It is
precisely
in such a situation that methods which control thisbias such as the MJP-IBC
implemented
here may be useful.ACKNOWLEDGEMENTS
The authors are very
grateful
to JOcana,
whosuggested
thebootstrap
tostudy
theproblem
of bias on inferences about variance components in threshold models. Wegratefully acknowledge
the support of theDiputaci6n
General deArag6n
(Grant 2-2-94)
and ofCaja
de Ahorros de la Inmaculada(Grant
CA396).
Three anonymous reviewers made usefulsuggestions.
REFERENCES
Albert
JH,
Chib S(1993)
Bayesian analysis
ofbinary
andpolychotomous
response data. J Am Stat Assoc88,
669-679Anderson
DA,
Aitkin M(1985)
Variance component models withbinary
response: inter-viewervariability.
J R Stat Soc B47,
203-210Bliss CI
(1935)
The calculation of thedosage-mortality
curve. AnnA
P
pd
Biol 22, 134-167 BreslowNE,
Lin X(1995)
Bias correction ingeneralised
linear mixed models with asingle
Casella G
(1996)
Statistical inference and Monte-Carloalgorithms.
Test 5, 1-30 CoxDR,
Snell EJ(1989)
Analysis of Binary
Data.Chapman
andHall,
LondonDellaportas P,
Smith AFM(1993)
Bayesian
inference forgeneralized
linear and propor-tional hazards models via Gibbssampling. Appl
Stat42,
443-459Drum
ML, McCullagh
P(1993)
REML estimation with exact covariance in thelogistic
mixed model. Biometrics49,
677-689Engel B,
BuistW,
Visscher A(1995)
Inference for threshold models with variancecomponents from the
generalized
linear mixed modelperspective.
Genet Sel Evol27,
15-32Foulley JL,
ImS,
GianolaD,
Hoeschele I(1987)
Empirical Bayes
estimation of parametersfor n
polygenic binary
traits. Genet SelEvol 19,
197-224Foulley JL,
GianolaD,
Im S(1990)
Genetic evaluation for discretepolygenic
traits in animalbreeding.
In: Advances in Statistical Methodsfor
GeneticImprovement of
Livestock(D
Gianola,
KHammond,
eds),
Springer-Verlag,
NewYork,
361-409Foulley JL,
Manfredi E(1991)
Approches statistiques
de 1’evaluationg6n6tique
desreproducteurs
pour des caract6res binaires a seuils. Genet Sel Evol 23, 309-338 GelmanA,
Rubin DB(1992)
Asingle
series from the Gibbssampler provides
a falsesense of
security.
In:Bayesian
Statistics IV(JM
Bernardo,
JOBerger,
APDawid,
AFM
Smith,
eds),
OxfordUniversity Press, Oxford,
625-631Gelman
A,
CarlinJB,
SternHS,
Rubin DB(1995)
Bayesian
DataAnalysis. Chapman
andHall, London,
1st edGilks
WR,
Wild P(1992)
Adaptive rejection sampling
for Gibbssampling. Appl
Stat 41,337-348
Gilks
WR,
RichardsonS, Spiegelhalter
DJ(1996)
Markow Chain Monte-Carlo in Practice.Chapman
andHall, London,
1st edHenderson CR
(1973)
Sire evaluation andgenetic
trends. In:Proceedings of
the AnimalBreeding
and GeneticsSymposium
in Honorof
Dr DLLush, Blacksburg, Virginia,
10-41Hoeschele I,
Foulley
JL, Gianola D(1987)
Estimation of variance components withquasi-continuous data
using Bayesian
methods. J Anim Breed Genet104,
334-349Hoeschele
I,
Tier B(1995)
Estimation of variance components of threshold charactersby
marginal posterior
modes and means via Gibbssampling.
Genet Sel Evol27,
519-540Kuk AYC
(1995)
Asymptotically
unbiased estimation ingeneralized
linear models with random effects. J R Stat Soc B57,
395-407Lindley DV,
Smith AFM(1972)
Bayes
estimates for the linear model. JRoy
Stat SocB 34, 1-41
McCullagh P,
Nelder JA(1989)
Generalized Linear Models.Chapman
andHall, London,
2nd edMcCulloch CE
(1994)
Maximun likelihood variance components estimation forbinary
data. J Am Stat Assoc 89, 330-335
Misztal
I,
GianolaD, Foulley
JL(1989)
Computing
aspects of a nonlinear method of sireevaluation for
categorical
data. JDairy
Sci 72, 1557-1568Moreno C
(1993)
Estudio de los componentesgen6ticos
del modelo umbral. Tesis doctoral. Universidad deZaragoza, Spain
Natarajan
R,McCullogh
CE(1995)
A note on the existence of theposterior
distribution for a class of mixed models for binomial responses. Bio!rrzetrika 82, 639-643Sorensen D, Andersen
S,
GianolaD, Korsgaard
I(1995)
Bayesian
inference in threshold modelsusing
Gibbssampling.
Genet SelEvol 27,
229-249Tempelman
RJ(1994)
Evaluation of derivative-freemarginal
maximum likelihood esti-mation of variance components in threshold mixed models. In: Procof
the 5th WorldCong
GenetAppl
to LivestockProd, University
ofGuelph, Guelph,
22, 15-18Wright
S(1934)
Ananalysis
ofvariability
in number ofdigits
in an inbred strain ofguinea
pigs.
Genetics19,
506-536Zeger SL, Liang K,
Albert PS(1988)
Models forlongitudinal
data: ageneralized estimating
equation approach.
Biometrics44,
1049-1060Zeger SL,
Karim MR(1991)
Generalized linear models with random effects: a Gibbssampling approach.
J Am Stat Assoc 86, 79-86APPENDIX
This
appendix
shows thelog-conditional
posterior
distributional forms for the fixed and random effects and the variancecomponent
of thebinary
threshold model. Theadaptive
rejection sampling
algorithm
wasimplemented using
these distributionalforms.
The
log-fully
conditionalposterior
distributional form for the xth fixed effect is:where ci