HAL Id: hal-00894114
https://hal.archives-ouvertes.fr/hal-00894114
Submitted on 1 Jan 1995
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Estimation of variance components of threshold
characters by marginal posterior modes and means via
Gibbs sampling
I Hoeschele, B Tier
To cite this version:
Original
article
Estimation of variance
components
of threshold
characters
by
marginal
posterior
modes and
means
via
Gibbs
sampling
I
Hoeschele
B Tier
Virginia Polytechnic
Institute and StateUniversity,
Department of Dairy Science, Blacksburg,
VA24061-0315,
USA(Received
19 October 1994;accepted
24August
1995)
Summary - A Gibbs
sampling
scheme forBayesian analysis
ofbinary
threshold datawas derived. A simulation
study
was conducted to evaluate the accuracy of 3 variancecomponent estimators, deterministic
approximate marginal
maximum likelihood(AMML),
Monte-Carlomarginal posterior
mode(MCMML),
and Monte-Carlomarginal posterior
mean
(MCMPM).
Severaldesigns
with different numbers ofgenetic
groups,herd-year-seasons
(HYS),
sires and progeny per sire were simulated. HYS weregenerated
asfixed,
normally
distributed or drawn from a proper uniform distribution. The downward biasof the AMML estimator for small
family
sizes(50
sires,
average of 40progeny)
waseliminated with the MCMML estimator. For
designs
with manyHYS,
0.9incidence,
50 sires and 40 progeny on average, the
marginal posterior
distribution ofheritability
was
non-normal;
MCMML and MCMPMsignificantly
overestimatedheritability
underthe sire
mode,
while under the animal model the Gibbssampler
did not converge. Fordesigns
with 100 sires and 200 progeny persire,
themarginal posterior
distribution ofheritability
became more normal and thediscrepancy
among MCMML and MCMPMestimates vanished.
Heritability
estimates under the animal model were less accurate than those under the sire model. For the smallerdesigns,
the MCMML estimates were veryclose to the true value when
using
a normalprior
for HYSeffects, irrespective
of the true state of nature of the HYS effects. For extremeincidence,
small data sets and manyHYS,
observations within an HYS willfrequently
fall into the same category of response. With flatpriors
for the HYSeffects,
theposterior density
islikely improper, supported by
ananalytical proof
for asimplified
model andanalyses
from Gibbs output. Inanalyses
oflimited
binary
data with extremeincidence,
effects of a factor with many levels should begiven
a normalprior.
Assigning
a proper uniformprior
orfixing
values of such levels was*
On leave from Animal Genetics and
Breeding Unit, University
of NewEngland,
not useful. Most accurate estimation of
genetic
parametersrequires
verylarge
data sets.Further work is needed on
diagnosis
ofimproperness
and on alternative properpriors.
Bayesian estimation/
Gibbs sampling/
categorical data/
marginal maximum likelihood/
variance component estimationRésumé - Estimation des composantes de variance de caractères à seuil par les modes et les moyennes marginales a posteriori à l’aide de l’échantillonnage de Gibbs.
Un
plan d’échantillonnage
de Gibbs pourl’analyse bayésienne
de caractères binaires à seuil a été établi. Parsimulation,
on a pu comparer laprécision
de 3 estimateursde composante de variance, un maximum de vraisemblance
marginale
déterministe etapproximatif
(MVMA),
le modemarginal
aposteriori
de Monte Carlo(MVMMC)
et lamoyenne
marginale
aposteriori
de Monte Carlo(MMPMC).
Plusieursplans d’expérience
avec des nombresdifférents
de groupesgénétiques,
de cellulestroupeau-année-saison
(TAS),
depères
et de descendants parpère
ont été simulés. Les TAS ont été établiscomme des
effets fixes,
ou tirés d’une distributionnormale,
ou tirés d’une distributionuniforme.
L’erreur pardéfaut
du MVMA pour de petites tailles defamille
(50
pères,
40
descendants parpère)
a été éliminée par le MVMMC. Pour desdispositifs
incluant denombreux
TAS,
avec une incidence de0,9,
50pères et
4
0
descendants parpère
en moyenne,la distribution
marginale
aposteriori
de l’héritabilité n’est pasnormale ;
MVMMC etMMPMC surestiment
significativement
l’héritabilité dans un modèlepaternel,
alorsqu’avec
le modèle individuel la
procédure
de Gibbs ne converge pas. Avec 100pères
et 200 descendants parpère,
la distributionmarginale
aposteriori
de l’héritabilité serapproche
de la normale et les discordances entre les estimées MVMMC et MMPMC
disparaissent.
Les estimées d’héritabilités avec le modèle individuel sont moins
précises qu’avec
le modèlepaternel.
Pour les petitsdispositifs,
les estimées MVMMC sont trèsproches
de leur vraievaleur
quand
la distribution apriori
deseffets
TAS estnormale, quelle
que soit la réalitédes TAS. Pour des incidences
extrêmes,
de petits échantillons et ungrand
nombre deTAS,
les observations à l’intérieur d’une cellule TAS tombent souvent dans la mêmecatégorie
de
réponse.
Avec des a prioriuniformes
pour leseffets TAS,
la densité a posteriori estprobablement
impropre, comme tend àl’indiquer l’analyse
des résultats d’uneprocédure
Gibbs
appliquée
à un modèlesimplifié.
Dansl’analyse
de données binaires en nombre limitéet avec une incidence extrême, une distribution normale a
priori
devrait êtreassignée
auxeffets
desfacteurs
ayant de nombreux niveaux,plutôt qu’une
distributionuniforme
oudes valeurs
fixées.
Des estimationsprécises
desparamètres génétiques requièrent
dans ce cas de trèsgrands
ensembles de données. Il reste encore à étudier la manière de décelerl’impropriété
des distributions apriori
et de choisir de meilleurs apriori.
estimation bayésienne / échantillonnage de Gibbs
/
données catégorielles/
maximum de vraisemblance marginale / composante de varianceINTRODUCTION
Bayesian
analysis
ofbinary
orpolychotomous
threshold traits via Gibbssampling
has
recently
been describedby
Albert and Chib(1993),
Sorensen et al(1995)
and Jensen(1994).
In all 3 papers, the Gibbssampler
wasimplemented
incombina-tion with data
augmentation,
ieparameters
andmissing
data weresampled
fromeffects and variance
components,
while themissing
data consisted of the liabilitiesor latent continuous variables in the threshold model. In contrast with the afore-mentioned papers,
Zeger
and Karim(1991)
implemented Bayesian analysis
with aGibbs
sampler
based on theposterior
density
of theparameters
rather than thejoint posterior
ofparameters
andmissing
data.Zeger
and Karim(1991)
also used a differentprior
for thedispersion
parameters.
McCulloch(1994)
derived maximumlikelihood rather than
Bayesian
estimation via anexpectation-maximization
(EM)
algorithm
with Gibbssampling
of the liabilities within eachE step.
In this
contribution,
a Gibbssampling
schemeapplied
toparameters
andliabil-ities was
implemented
forBayesian analysis
of abinary
trait. A simulationstudy
was conducted to evaluate the accuracy of 3 estimators of variance
components,
deterministicapproximate
marginal
maximum likelihood(AMML)
(Foulley
etal,
1987;
Hoeschele etal,
1987),
the Monte-Carlo evaluatedmarginal
posterior
mode(MCMML),
and the Monte-Carlo evaluatedmarginal
posterior
mean(MCMPM).
The
analysis
was restricted to a threshold model with one variancecomponent
and was
performed
under sire and animal models. InMML,
themarginal
poste-rior
density
of the variance parameters is maximized withrespect
to theseparame-ters. ’Fixed’ effects and variance parameters have
improper,
flatprior
distributions.Thus,
themarginal posterior
density
of the variancecomponents
isproportional
totheir
marginal
likelihood. InBayesian
analysis
implemented
via Gibbssampling
applied
to allparameters
(or
allparameters
andmissing
data),
theparameter
sam-ples provide
inferences about anyparameter
from itsmarginal posterior
density.
For asingle
variancecomponent,
themarginal
posterior
mode isequivalent
to the MML estimator.Therefore,
the Monte-Carlo evaluatedmarginal
posterior
modewas termed the MCMML estimator.
With conventional deterministic
algorithms,
MML estimates arecomputed using
a normal
approximation,
and severe biases of variance and covariance estimates have beenreported
(Gilmour
etal,
1985;
Hoeschele etal, 1987;
Hoeschele andGianola, 1989;
Simianer andSchaeffer,
1989).
While Gilmour et al(1985)
observed an underestimationof heritability
with smallfamily
sizes,
Hoeschele et al(1987)
and Hoeschele and Gianola(1989)
found an overstimation ofheritability
for abinary
trait with extreme incidence and in the presence of a fixed factor with many levels. The
objective
of thisstudy
was to compare the 3 variancecomponent
estimatorsAMML, MCMML,
and MCMPM in terms of theirfrequentist properties, and,
inparticular,
toinvestigate
whether the biases observed for the AMML estimator canbe eliminated
by
computing
exact MML estimates via MCMCalgorithms.
A related sideobjective
was toinvestigate potential
causes of the biases.MATERIALS AND METHODS
Methodology
The
Bayesian analysis
described below is identical to that of Sorensen et al(1995),
whenapplied
to abinary
trait,
but differs from Albert and Chib(1993)
in thesampling
of the variancecomponents.
The Gibbssampling
scheme differs from thesampler
of Sorensen et al(1995)
only
when run under an animal model whereLet y
represent
the observed dichotomous variable and w theliability
variable. In the threshold model for 2categories
of response, yi = 1 ifw
i > 0.0 and
y
2 = 0 otherwise.
Conditionally
on the fixed(!3)
and random effects(u),
thew
i are
independent
N(x!fJ
+z!u, 1),
and the yi areindependent
Bernouilli withProb(y
i
=1)
=4
i(x
i
#
+ziu),
where4
i(.)
denotes the standard normal cumulativedistribution function. Matrices X and Z are the usual incidence matrices with row
i denoted
by
x!
(zD.
Theparameter
vector(0)
includes#,
u =[u! ..., u!,...,
u9!’,
and the
af
j
(I, j
=1, ... ,
q),
withCov(u
i
,
u) )
=A
ij
a
ij
and the A matrices known.The
joint posterior
density
of e and w ={w
i
}
iswhere c is a
constant,
0(
p,;a
2
)
is thedensity
function ofN(!.;
a
2
),
andI(
XE
S)
is the indicator functionequal
to 1 if variable x is contained in set S and zerootherwise. For
independent
u
j
s’,
theprior
densities of the uj and the <7,!simplify
to
products
of thef(u
j
la
J
),
thedensity
of the MVN(0;
A!Q! ),
andproducts
of thef (a?) 3
orprior
densities of theaJ.
Samples
from thejoint posterior
distribution canbe obtained
by sampling
in turn fromf(Blw, y) f
(0 1 w)
andf (w 10, y).
These 2conditional distributions are of standard
forms,
and thefully
conditionalparameter
densities,
derived from/(!w),
are identical to those in the standard mixed linearmodel
(eg,
Gelfand etal, 1990;
Wang
etal,
1993).
Then,
from standard mixed linear model results(eg,
Gianola etal,
1990),
and with q =1,
Uj = u and
a
§
=Q!,
where
(!Z
is element i of vectorf
l, (
i
3-
is this vector with element iomitted,
xi iscolumn i of matrix x, x-i is x with column
i deleted, a
ii
is element(i, i)
of the inverse additivegenetic relationship
matrixA-
1
,
andAi
is
row i ofA-
1
without element i.Under the animal
model,
breeding
values(u)
of a sire and hisoffspring
weresampled jointly by sampling
a sire’s u from itsmarginal
normal distribution whilesampling
the u of eachoffspring
from its full conditional distribution in(3!.
Mean and variance of themarginal
distribution were obtained as the BLUP of the sire’su and its
prediction
error variance afterabsorbing
theoffspring
u into the sire’s uin MME for this sire and his
offspring.
For q = 1 andf(u u 2)
=constant,
where the inverse
chi-square
distribution has n - 2df
with nequal
to the numberreported problems
inestimating
variancecomponents
due to the Gibbssampler
&dquo;being
trapped
at zero&dquo;. Theirprior
resulted from an inversechi-square
(or
inverseWishart)
distribution with zeroprior degrees
offreedom,
yielding
theprior density
f(afl) =
(afl)!!
andchanging
thedf
in[4]
to n. Thisproblem
vanishes whenusing
the flatprior,
as also observedby
other researchers(D
Sorensen,
personal
communication).
Note also that with animproper prior
distribution theresulting
posterior
distribution can be proper orimproper
(eg,
Berger
andBernardo, 1992a,
b;
Hobert andCasella,
1994).
Hober and Casella(1994)
established necessaryand sufficient conditions for the
joint posterior
density
of the fixed and randomeffects and the variance
components
in ahierarchical
Bayes
LMM to be proper,ie
integrable.
One condition was that for any variancecomponent
j,
besides theresidual,
withprior
f (!! )
=(!! )-!a!+1>,
we must have aj < 0.Setting
aj = -1yields
the flatprior
f ( ?)
= 1 used in this paper, while aj = 0produces
theprior
used
by
Zeger
and Karim(1991).
Morespecifically,
in thisstudy,
a bounded flatprior
forafl
was used under a sire model as in Sorensen et al(1995),
while animproper
flatprior
was used under the animal model.The
marginal
posterior
mode ofheritability
wascomputed by
agrid
search of the Rao-Blackwell estimate(eg,
Gelfand etal,
1990)
of themarginal
posterior density
ofa
2
and
achange
of variable toh
2
,
orwhere 6 = 4 or 6 = 1 for the sire or animal
model, respectively,
k denotes the Gibbssample,
and J is the Jacobian of the transformationafl -
h
2
,
Conditional on the
parameters,
the latent data weresampled
from truncated normaldistributions,
orwith
To
implement
the Gibbssampler, starting
values for theparameters
wereobtained
by
computing
the maximum apriori
(MAP)
estimates of/
3
and u(eg,
Gianola and
Foulley,
1983)
evaluated at theapproximate
MML estimates(Foulley
etal,
1987)
of the0’!.
Given initialparameter
values,
a firstsample
of the latentdata was drawn from
!5!,
followedby
thesampling
of newparameter
values from!2!, [3]
and!4!,
and further Gibbscycles.
Because the
prior
distributions for the fixed effects and for the variancecompo-nent under the animal model are
improper,
theposterior
distributionmight
alsoposterior
distribution whichonly
hold for hierarchical linear mixed models.Fur-thermore,
a reviewerpointed
out that for asimple
fixed model with one factor and thelogit
linkfunction,
thejoint posterior
distribution of the fixed effects isim-proper, if in at least one factor level all observations fall into the same
category
ofresponse. An
analytical investigation
of whetherf(0)y)
is proper for the nonlinearmixed model considered
here, however,
is very difficult or intractable.Therefore,
it is desirable to be able to detect an
improper posterior
from Gibbsoutput.
Acandidate
approach
is the MC evaluation of themarginal
likelihoodf (y).
With Brepresenting
theparameter vector,
themarginal
likelihood isThe
reciprocal
off (y)
is anormalizing
constantensuring
that theposterior density
f (9!y)
integrates
to one, if it is proper. If it is not,[7]
is infinite.While the
computation
ofBayesian
point
estimates and related inferences(eg,
marginal posterior density plots,
highest
posterior
density
regions)
from Gibbsoutput
isstraightforward,
computation
ofmarginal
likelihoods orBayes
factors hasproved
to be verychallenging
(eg,
Chib, 1994;
Newton andRaftery,
1994).
Because all conditional distributions(ie !2!, [3]
and!4!)
arestandard,
theapproach
of Chib(1994)
formarginal
likelihood estimation from Gibbsoutput
wasadopted
here. The
marginal
likelihood can be written asand
is estimatedby
evaluating
[8]
at aparticular point,
eg, theposterior
mean8 =
.E(<!y).
With dataaugmentation,
the denominator of[8]
can be written aswhere w is the vector of
missing
data. The Monte-Carlo estimate of[9]
iswhere N is Gibbs
sample
size and thew
i
aresamples
fromf (B, w!y)
and hence arealso
samples
fromf(wly).
To evaluate the densities in theright-hand
side of!10!,
let B =[0’,
u’,
u
Then,
one way offactoring
thejoint posterior density
evaluated at the vector ofposterior
means of theparameters
isNote that with the parameter vector
partitioned
into 3subvectors,
[12a,
b,
c]
represents
one of 6possible
factorizations of theposterior
density.
Furthermorenote that
density
[12a]
can be evaluatedimmediately
at the termination of the Gibbschain,
while the evaluation of[12b]
requires output
from a second Gibbs chain withafl fixed
at itsposterior
mean estimate obtained from the firstchain,
and evaluation of[12c]
is from a third Gibbs chain with u andQ
u
fixed
at theirposterior
mean estimates from the first Gibbs chain. Simulated data andanalyses
Data were simulated on a
binary
threshold trait.Eight
differentdesigns
wereinvestigated,
whichfirstly
differed in the numbers ofgenetic
groups,herd-year-season
(HYS)
effects,
sires,
and progeny per sire.Secondly,
designs
different in theway HYS effects were simulated: as fixed
(generated
once from a normal distribution and held constant in allreplicates),
as random andnormally
distributed,
or asrandom and drawn from a proper uniform distribution. Genetic groups were
always
fixed and sires or animals were random. The
designs
are defined in table I.Genetic group means for
liability
were-0.4, -0.15, 0.15,
and 0.4. Sire or animal effects onliability
weregenerated
fromN(0,
0D
.
for aheritability
ofh
2
= 0.25(or2= h 21(6 -h 2)
for HYS fixed and72
=h
2
(1
+(}!Ys)/(8 - h
2
)
for HYSrandom,
with 6 = 4(6
=1)
under the siregenerated
fromN(0, 1).
HYS effects weregenerated
fromN(0,
0’ Hys ! 2
0.46)
or fromU [a, b],
where a =—0.5(12cr!Ys)!! !
-b was set such that HYS effects had thesame variance under both distributions. The truncation
point
used to dichotomize the liabilities was!-1
(p)*(1 +a!Ys+a;)O.5,
where p was the desired incidenceequal
to 0.9 for all
designs
except
VI-F with p = 0.5. With 135HYS,
theprobabilities
ofhaving
0,
1,
or 2offspring
in any HYS were0.8, 0.1,
and0.1,
respectively,
for eachsire;
with 32HYS,
sires had0,
6 or 7offspring
per HYS with the sameprobabilities;
and with 320
HYS,
sires had0,
2 or 4 progeny per HYS withprobabilities 0.8,
0.09,
and0.11,
yielding approximately
the average progeny group sizesgiven
in table I. The numbers of sires in the 4genetic
groups were12,
14,
13,
and 11.Designs
withaverage progeny group size of 40
(200)
werereplicated
40(20)
times.All data sets were
analyzed
with the siremodel,
and data setsII-F,
V-F and VI-F were alsoanalyzed
with the animal model. Note that for thesedesigns
and if a linear mixed model wereused,
sire andanimal,
models would be(linearly)
equivalent
(Henderson, 1985),
ieyield
the same estimates of fixed and sire effects.HYS effects were treated as fixed
( f )
in theanalysis,
ie hadimproper
uniformpriors,
for alldesigns
where HYS effects were fixed(F).
Additionally,
HYS were treated as random with normalprior
(n)
in theanalyses
of data sets fordesigns
II-F,
II-Nand II-U where HYS were
fixed,
random andnormally
distributed,
and random anddrawn from the proper uniform
distribution,
respectively. Treating
HYS as random with normalprior required
to also estimate HYS variancea
H
y
s
.
For some of the
designs,
data sets contained HYS levels with the so-calledextreme
category
problem
(ECP)
(Misztal
andGianola,
1989).
Extremecategories
are the first and last
category,
hence for abinary
trait bothcategories
are extreme.In an HYS
exhibiting
theECP,
all records are in the same extremecategory.
Onaverage,
frequency
of HYS with the ECP was45%
fordesigns
I-F andII-F/N/U,
22%
fordesign
III-F,
17%
fordesign
IV-F,
and0%
for all otherdesigns.
Solutionsfor such HYS classes do not converge but tend toward
infinity
in deterministic AMMLalgorithms.
Therefore,
in AMML solutions for these HYS were fixed atI 10
(Misztal
andGianola,
1989).
For the Gibbssampler, options
were considered for the treatment of HYS in the presence of the ECP:(i)
to use a proper normalprior
in the
analysis;
(ii)
to use a proper uniform distribution asprior,
eg,U(—10., 10.);
and
(iii)
to fix HYS effects with the ECP at ± 10 or at ± 3 in all Gibbscycles,
denoted
by
f10 andf3, respectively.
RESULTS AND DISCUSSION
Estimates of
heritability
(h
2
)
were obtained under the sire model for alldesigns
and under the animal model fordesigns
II-F, V-F,
and VI-F. Fordesigns
II-F andIV-F,
HYS variance was also estimated when HYS effects had aprior
normal distributionin the
analysis.
Estimators weredeterministic, AMML,
MCMML and MCMPM. MC estimates werecomputed
from 20 000 consecutive Gibbssamples
for thesire model and 200 000
samples
for the animalmodel,
with a burn-inperiod
of an additional 2000cycles.
The burn-inperiod
was determined based onplots
ofsample
values forheritability
versuscycle
numbershowing consistently
that 2 000cycles
were more than needed.Typical plots
fordesign-analysis
combinations II-F-fdue to the use of
good starting
values for both location and varianceparameters,
which were the estimates obtained from the AMML
procedure.
The number of Gibbs
cycles
of 20 000(200 000)
was determined as sufficientbased on the autocovariance structure of the
sample
of heritabilities. This entailedcalculating
an effectivesample
size as the ratio of variance ofsamples
to the variance of thesample
meancomputed
from the estimated autocorrelations(Sorensen
etal,
1995).
Autocovariance forlag
t was estimated asVariance of
sample
meangiven
the estimated autocovariances was estimated withwhere m is the
largest
integer
satisfying
the conditionand effective
sample
size ESS wasFor the sire model with 20 000
cycles,
the minimum ESS was 580 and was found in one of thedesign
I-F data setsanalyzed
with HYS treated as fixed( f ).
For atypical
replicate
with ESS =1435,
autocorrelations atlags
1, 10,
20 and 50 were0.66,
0.34,
0.13 and
0.007,
respectively.
For most data sets of thisdesign,
ESS exceeded 1000. For otherdesigns
(II-VI),
ESS was in the 2 000 to 6 000 range,and,
forexample,
autocorrelations at the abovelags
were0.39,
0.14, 0.03,
and 0.006 with ESS = 3 705for a
design
IV-F data set with HYS treated as fixed. For the animal model andMean estimates and
empirical
SE across 40replicates
arepresented
in table II. The first column of table IIspecifies
thedesign
(eg, I-F)
in combination with thetype
ofanalysis
(eg, I-F-f)
according
to the treatment of HYS effects.Progeny
group sizestypically ranged
from 25 to 55 for average progeny group size of 40 and from120 to 280 for an average of 200.
All estimators
strongly
overestimatedheritability
for thedesign-analysis
com-binations I-F-f and II-F-f under the sire model. Theupward
bias waslargest
forMCMPM,
followedby
MCMML and AMML. Thedesigns
were characterizedby
true value than the mean. A
plot
of themarginal posterior density
ofh
2
based on[5]
for II-F-f can be found infigure
3.For
design-analysis
combinationIII-F-f,
with thedesign containing
the samenumber of sires and records as, but a smaller number of HYS levels
than,
I-Fand
II-F,
theupward
biases of the MCMPM and MCMML estimators decreased while the AMML estimator tended to underestimateh
2
.
Overestimation(Hoeschele
et
al,
1987;
Hoeschele andGianola,
1989;
Simianer andSchaeffer,
1989)
and underestimation(Gilmour
etal,
1985;
Thompson,
1990)
with AMML are ingood
agreement
with the literature. It appears that the AMML estimates are closest tothe true value of
h
2
.
However,
acomparison
between the results for I-F-f and II-F-fversus III-F-f
strongly
suggests
that thisfinding
is due to apartial counterbalancing
ofupward
and downward biases of the AMML estimator for data sets with a small number of records and progeny per sire.For the
larger designs
with 100 sires and average progeny group size of200,
biases of and
discrepancies
among estimators werestrongly
reduced(design IV-F)
andnegligible
fordesign
VI-F with the mean as theonly
fixed effect and an incidencelevels were increased
by
a factor of 10 relative to III-F-f. As aresult,
theAMML,
estimator nolonger
underestimatedh
2
,
while the overstimations obtained with andthe
discrepancy
between the MCMPM and MCMML estimatorsstrongly
decreased. Aplot
of themarginal posterior
distribution ofh
2
fordesign-analysis
combinationIV-F-f is
presented
infigure
4,
which shows a moresymmetric
distribution withsmaller variance relative to
figure
3.It may be concluded that the
discrepancies
found between the trueh
2
,
the MCMPM and the MCMML estimates are due to a lack of information in the datacausing
theposterior
distribution to be non-normal and its mean to be a biased estimator ofh
2
.
Anotherexplanation
might
be that thejoint posterior
distribution of theparameters
isimproper.
As mentionedearlier,
for thelogit
link and asimple
fixedmodel, improperness
can be shownanalytically
when levels of the fixed factorexhibit the ECP.
Consequently,
use of a properprior
distribution for the fixedeffects, resulting
in a properposterior,
should eliminate thisproblem. Therefore,
2 proper uniform distributions were
employed
aspriors,
U(—3, 3)
andU(—10, 10)
of
sample
value versus Gibbscycle
for an HYS with the ECP. In thisanalysis,
animproper
uniformprior
wasused,
andsample
values drifted towardextremely
small numbers far below the lower limits of -3 and -10 in the proper uniformpriors.
Next,
values of HYS effects with the ECP(all
recordsequal
to0)
were fixed at -3 or at -10 across all Gibbscycles
while the other HYS effects weresampled
and had animproper prior.
Results from theseanalyses
are in table II in the rows fordesign-analysis
combinations II-F-f3 and 11-F-flO.Upward
biases were stillsubstantial and
only slightly
smaller than those for II-F-f. Theposterior
distribution for theparameters
sampled
should have been properand,
if so, the results indicatethat biases were
(mostly)
due to limited information in the data.Because a proper uniform
prior
could not be used for technical reasons, a normalprior
distribution waspostulated
for the HYS. Thedesigns analyzed
with a normalprior
wereII-F,
II-N,
andII-U,
where the true state of nature of the HYS effectswas
fixed,
normally
distributed and drawn from a properuniform, respectively.
The results for all 3designs
were very similar. The AMML estimatorsignificantly
The
empirical
SE of the MC estimators tended to beslightly higher
than the SE of the deterministic AMML estimator for the smallerdesigns. Doubling
the number of Gibbscycles
hadvirtually
noeffect, suggesting
that the cause was notthe Monte-Carlo error.
Possibly
the AMML estimator had smaller variance in caseswhere it exhibited a
large
downward bias.When an animal model was used for
design-analysis
combinationII-F-f,
the Gibbssampler
’blewup’
in each of severalreplicates
analyzed,
ie the additivegenetic
variance continued to increase insubsequent
Gibbscycles,
soonreaching
unreasonably high
values. Whendesign
V-F withonly
one fixed effect wasanalyzed
under the animal
model,
the Gibbssampler ’converged’,
but estimates hadlarger
biases and were more variable
compared
to those obtained under the sire model.’Convergence’
of the Gibbssampler
refers to thesampler
reaching
stationarity,
wheresample
values fluctuate around a constant(see
figure
6 for presence of andfigure
5 for lack of’convergence’;
thesefigures
are discussed furtherbelow).
Toverify
this result and ensure the correctness of the software under the animal modeloption,
thelarger design
VI-F was alsoanalyzed
under the animal model.Then,
the average estimates were much closer to the true values and almost identical toThe results discussed so far
strongly
suggest
that the cause of theupward
biasin the MCMPM and MCMML estimators is a
highly non-symmetrical marginal
posterior
distribution ofheritability
in situations where the data contain little information. For thelarger
designs
(IV-F, VI-F),
the biasesstrongly
decreased.However,
it appears to belikely
that theposterior density
of theparameters
isalso
improper
in the presence of HYS levels with the ECP. As mentionedearlier,
improperness
can be shownanalytically
for asimple
fixed model and thelogit
link forbinary
data when at least one level of the fixed factor exhibits the ECP. Thefact that the Gibbs
sampler
’blewup’
for the smallerdesigns
under the animal model alsosupports
thehypothesis
of animproper posterior. Moreover,
when ajoint posterior
density
of theparameters
isimproper,
’marginal
posterior
mean ormode’ estimates or
’marginal posterior density plots’
from Gibbsoutput
may look’reasonable’,
although
these were not obtained from a Markov chain with knownstationarity
properties
(Hobert
andCasella,
1994).
In thelogit
linkexample,
effects of fixed levels notshowing
ECP are still’reasonably
well estimated’ in the presenceof other levels with the ECP
causing
theposterior
to beimproper.
Because an
analytical
proof
of animproper
posterior
under the nonlinear mixedmodel for
binary
dataemployed
in thisinvestigation appeared
to be very difficult orintractable,
themarginal
likelihood in!7J,
or thereciprocal
of theintegration
constant of the
posterior
density,
was considered as apotential
criterion fordetecting
improperness
from Gibbsoutput.
It was estimated from several modifieddesign
II-F data setsusing expression
[11]
andrequired
3 consecutive Gibbs chains which were run atlengths
of 10 000 and 20 000(after
burn-in)
cycles.
Two situationswere considered in which the
posterior
density
wasprobably
improper:
in thepresence of the ECP and when
using
theprior
1/
0
,
for
the variancecomponent.
To examine theECP,
4 data sets were created. Data set 1 had 70 HYS with theECP,
data set 2 hadonly
1 HYS with the ECP and data set 3 had none. Data sets2 and 3 were obtained
by
reducing
the incidence of the trait from 0.9 to 0.5 andby reducing
the size of the differences among HYS differences. Data set 4 was fromdesign
II-N with random HYS effects.Table III contains the estimated
normalizing
constants(reciprocals
ofmarginal
likelihoods)
and the MCMPM estimates ofheritability
for each of the 5 data setsand the 2
lengths
of thesampler.
The estimates of theintegration
constant for data sets 1 and 2 of table III differedstrongly
betweenlengths
of 10 000 and 20 000cycles,
ie did not’converge’.
The estimate of theintegration
constant,
however,
did converge for data set 3 notcontaining
HYS with ECP and for data set 4treating
HYS as random. For all 4 datasets,
the estimate ofheritability
appeared
to have
’converged’,
becausevirtually
the same means were obtained after 10 000 and 20 000cycles. ’Convergence’
was also indicatedby
theplot
ofsample
value forheritability
versus Gibbscycle
infigure
7although
it showed morevariability
than acorresponding plot
infigure
8 for data set 3 notcontaining
any HYS with ECP.Note that the
numbering
of the Gibbscycles
in thefigures
begins
after burn-in.Expectedly,
aplot
of HYSsample
value versus Gibbscycle
(fig 5)
showednon-convergence for an ECP HYS in data set 1 of table III. As a
control,
figure
6presents
theplot
ofsample
value versus Gibbscycle
for an HYS withoutECP,
which demonstrated convergence. Plots of themarginal
posterior
density
for anTo
verify
the effect of theparticular
factorization of theposterior
density
evaluated at the vector ofposterior
means of theparameters
in[11],
theanalysis
of data set 1 in table II wasrepeated
for the other 5factorizations,
which all indicatednon-convergence of the
integration
constant(results
notpresented).
To examine the second case of an
improper posterior
causedby
the1/<T!
prior,
improper posteriors
in the linear model(Hobert
andCasella,
1994).
Use of thisprior
in estimation underdesign
V-Foccasionally yielded
zeroh
2
estimates
or underestimatedh
2
.
Although heritability
was underestimated(h
2
=0.09),
the Monte-Carlo estimates of theintegration
constant were almostunchanged
after 10 000 and 20000 Gibbscycles indicating
that this form ofimproperness,
if itexists,
was not detectedby
Monte-Carlo estimation of theintegration
constantbased on
!11).
CONCLUSIONS
This
study
confirmed that the AMML estimator of theheritability
of a thresholdcharacter has a downward bias if
family
sizes are small(Gilmour
etal,
1985),
inthis case if sire progeny group size is small for a
binary
trait withhigh
(or
low)
incidence. This bias is due to theapproximation
in the AMML estimator(Foulley
et
al,
1987;
Hoeschele etal,
1987),
because it is eliminatedby
computing
exactMML estimates via Gibbs
sampling.
For small data sets with extreme incidence and many fixed effects
( eg,
design
II-F),
ie with little information about theheritability,
themarginal posterior density
ofheritability
ishighly non-normal,
its mean and mode differ and overestimateh
2
2effect and the incidence was 0.5. The
phenomenon
ofobtaining
biased estimates when random effects areincorporated
into ageneralized
linear model isquite
well
known,
and an iterative bias correction method forparametric
modelsusing
thebootstrap
has beendeveloped
(Kuk,
1995).
Moreno(personal communication)
obtainedpromising
results whenapplying
Kuk’s method to abinary
thresholdtrait,
however,
he noted that thebootstrap
iscomputationally
extremely
demanding.
Without biascorrection,
accurate estimation ofgenetic
parameters
forbinary
threshold traitsrequires
alarge
amount of data in absolute terms and relative to the number of fixed effects. This islikely
even moreimportant
forgenetic
correlations than for heritabilities with the former known to bepoorly
estimated frombinary
data
(Simianer
andSchaeffer,
1989).
Anencouraging result, however,
is that for situations whereonly
a small data setcontaining
many HYS levels is available foranalysis,
the MCMML estimator ofheritability
appears to be unbiased if a normalprior
is used for the HYSeffects,
irrespective
of the true state of nature of the HYSeffects
(fixed,
normally
or boundeduniformly
distributed).
Ascategorical
traits areusually
not under intenseselection,
a non-random association of sires and herds seemsunlikely
whichjustifies treating
HYS effects as random inpractice.
Otherapproaches
toimproving
heritability
estimation for suchdata,
eg,fixing
values ofprior,
were not successful. The use of alternativeprior
distributions( eg,
Berger
andBernardo, 1992a,
b)
for fixed factors with many levelsexhibiting
the ECP shouldbe
investigated
further.The accuracy of
heritability
estimation was also found to differ among sire andanimal models. For data with extreme
incidence,
a limited number of sires(50)
and small progeny group size(40),
estimates obtained with the animal model were lessaccurate than those from the sire model. When the number of HYS effects
(treated
as
fixed)
wasadditionally large,
the Gibbssampler
did not’converge’.
Therefore,
the animal model should be used
only
when there is sufficient information in thedata;
otherwise theapparently
more robust sire model should bepreferred.
ACKNOWLEDGMENTS
Financial support for this work was
provided by
the National Science Foundation grantBIR-94-07862,
the Australian Meat ResearchCorporation,
and the AustralianDepartment
ofIndustry, Science,
andTechnology.
This research was conductedusing
the resourcesof the center’s
Corporate
Research Institute. The authorsacknowledge
discussions with MATanner,
D Sorensen and D Gianola. We also thank MA Tanner forproviding
us withtechnical reports on the Monte-Carlo estimation of
Bayes
factors andmarginal
likelihoods. Two anonymous reviewers are thanked for valuablesuggestions
and forimproving
theclarity
of this paper.REFERENCES
Albert
JH,
Chib S(1993)
Bayesian analysis
ofbinary
andpolychotomous
response data.J Am Stat Assoc 88, 669-679
Berger JO,
Bernardo JM(1992a)
On thedevelopment
of referencepriors.
In:Bayesian
Statistics VI
(JM
Bernardo,
JOBerger,
APDawid,
AFMSmith,
eds),
OxfordUniver-sity Press,
35-60Berger JO,
Bernardo JM(1992b)
Referencepriors
in a variance componentsproblem.
In:Proceedings of
the Ircdo-USAWorkshop
onBayesian Analysis
in Statistics(P
Goel,
ed),
Springer Verlag,
NewYork,
177-194Chib S
(1994)
Marginal
Likelihoodfrom
the GibbsOutput.
TechnicalReport,
Olin School of Business,Washington University, Washington,
DCFoulley JL,
ImS,
GianolaD,
Hoeschele I(1987)
Empirical Bayes
estimation of parametersfor n
polygenic binary
traits. Genet Sel Evol 19, 197-224Gelfand
AE,
HillsSE,
Racine-PoonA,
Smith AFM(1990)
Illustration ofBayesian
inference in normal data modelsusing
Gibbssampling.
J Am Stat Assoc85,
972-985Geyer
CJ(1992)
Practical Markov chain Monte-Carlo(with discussion).
Stat Sci7,
467-511
Gianola
D, Foulley
JL(1983)
Sire evaluation for orderedcategorical
data with a threshold model. Genet Sel Evol 15, 201-224Gianola
D,
ImS,
Macedo FW(1990)
A framework forprediction
ofbreeding
value. In:Advances in Statistical Methods
for
GeneticImprovement of
Livestock(D
Gianola,
KHammond,
eds),
Springer Verlag, Berlin,
210-238Gilmour
AR,
AndersonRD,
Rae AL(1985)
Theanalysis
of binomial data in ageneralised
linear mixed model. Biometrika72,
593-599Henderson CR
(1985)
Equivalent
linear models to reducecomputations.
JDairy
Sci 68,2267-2277
Hobert
JP,
Casella G(1994)
Gibbssampling
withimproper prior
distributions. TechnicalReport BU-1221-M,
Biometrics Unit, CornellUniversity, Ithaca,
NYHoeschele I
(1989)
A note on local maxima in MaximumLikelihood,
Restricted MaximumLikelihood,
andBayesian
estimation of variance components. J StatComp
Simul 33, 149-160Hoeschele
I,
Foulley JL,
Gianola D(1987)
Estimation of variance components withquasi-continuous data
using Bayesian
methods. J Anim Breed Genet104,
334-339Hoeschele
I,
Gianola D(1989)
Bayesian
versus MaximumQuasi
Likelihood methods for sire evaluation withcategorical
data. JDairy
Sci72,
1569-1577Janss
LLG, Thompson R,
van Arendonk JAM(1994)
Application
of Gibbssampling
forinference in a mixed
major gene-polygenic
inheritance model in animalpopulations.
Theor
Appl
Genet(in
press)
Jensen J
(1994)
Bayesian analysis
of bivariate mixed models with one continuous and onebinary
traitusing
the Gibbssampler. Proc
5th WorldCongr
GenetAp
P
I
Livest Prod Vol 18, 333Mc Culloch CE
(1994)
Maximum likelihood variance components estimation forbinary
data. J Am Stat Assoc 89, 330-335
Misztal
I,
Gianola D(1989)
Computing
aspects of a nonlinear method of sire evaluation forcategorical
data. JDairy
Sci 72, 1557-1568Newton
MA, Raftery
AE(1994)
Approximate Bayesian
inference with theweighted
likelihood bootstrap. J R Stat Soc B 56, 3-48Simianer H, Schaeffer LR
(1989)
Estimation of covariance components between one continuous and onebinary
trait. Genet Sel Evol 21, 303-315Sorensen
DA,
AndersenS,
GianolaD, Korsgaard
I(1995)
Bayesian
inference in threshold modelsusing
Gibbssampling.
Genet Sel Evol 27, 229-249Thompson
R(1990)
Generalized linear models andapplications
to animalbreeding.
In: Advances in Statistical Methods
for
GeneticImprovement
of
Livestock(D
Gianola,
K