HAL Id: hal-00894193
https://hal.archives-ouvertes.fr/hal-00894193
Submitted on 1 Jan 1998
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
A link function approach to heterogeneous variance
components
Jean-Louis Foulley, Richard L. Quaas, Caroline Thaon d’Arnoldi
To cite this version:
Original
article
A link
function
approach
to
heterogeneous
variance
components
Jean-Louis
Foulley
Richard
L.
Quaas
b
Thaon
d’Arnoldi
aStation de
génétique quantitative
etappliquée,
Institut national de la rechercheagronomique,
CR deJouy,
78352Jouy-en-Josas Cedex,
France bDepartment
of AnimalScience,
CornellUniversity, Ithaca,
NY 14853, USA(Received
20 June1997; accepted
4 November1997)
Abstract - This paper presents
techniques
of parameter estimation in heteroskedastic mixed modelshaving
i)
heterogeneous log
residual variances which are describedby
alinear model of
explanatory
covariates andii)
log
residual andlog
u-componentslinearly
related. This makes the intraclass correlation a monotonic function of the residual variance. Cases of a
homogeneous
variance ratio and of ahomogeneous
u-component of variance are also included in this parameterization. Estimation andtesting procedures
of the
corresponding dispersion
parameters are based on restricted maximum likelihoodprocedures. Estimating equations
are derivedusing
the standard andgradient
EM. Theanalysis
of a smallexample
is outlined to illustrate thetheory. ©
Inra/Elsevier,
Parisheteroskedasticity
/
mixed model/
maximum likelihood/
EM algorithmRésumé - Une approche des composantes de variance hétérogènes par les fonctions de lien. Cet article
présente
destechniques
d’estimation desparamètres
intervenant dans des modèles mixtes caractérisési)
par deslogvariances
résiduelles décrites par un modèle linéaire de covariablesexplicatives
etii)
par des composantes u et e liées par une fonction affine. Cela conduit à un coefficient de corrélation intraclassequi
varie comme une fonction monotone de la variance résiduelle. Le cas d’une corrélation constante et celui d’unecomposante u constante sont
également
inclus dans cetteparamétrisation.
L’estimation et les tests relatifs auxparamètres
dedispersion correspondants
sont basés sur les méthodes du maximum de vraisemblance restreint(REML).
Leséquations
à résoudre pour obtenirces estimations sont établies à
partir
del’algorithme
EM standard etgradient.
La théorie est illustrée parl’analyse numérique
d’un petitexemple. ©
Inra/Elsevier,
Parishétéroscédasticité
/
modèle mixte/
maximum de vraisemblance/
algorithme EM*
1. INTRODUCTION
A
previous
paper of this series[4],
presented
an EM-REML(or ML)
approach
to
estimating dispersion
parameters
for heteroskedastic mixed models. We assumedi)
a linear model onlog
residual(or e)
variances,
and/or
ii)
constant u to e varianceratios.
There are different ways to relax this last
assumption.
The first one is toproceed
as with residual
variances,
i.e.hypothesize
that the variation inlog
u-components
or of the u to e-ratiodepends
onexplanatory
covariates observed in theexperiment,
e.g.
region,
herd, parity,
management
conditions,
etc. This is the so-called structuralapproach
describedby
San Cristobal et al.[23],
andapplied by Weigel
et al.[28]
and De Stefano[2]
to milk traits ofdairy
cattle.Another
procedure
consists inassuming
that the residual andu-components
aredirectly
linked via arelationship
which is less restrictive than a constant ratio. Abasic motivation for this is that the
assumption
ofhomogeneous
variance ratios or intra class correlations(e.g.
heritability
for animalbreeders)
might
be unrealistic[19]
although
very convenient to set up for theoretical andcomputational
reasons(see
theprocedure
by
Meuwissen et al.[16]).
As a matter offact,
the power ofstatistical tests for
detecting
suchheterogeneous
heritabilities isexpected
to be low[25]
which may alsoexplain why
homogeneity
ispreferred.
The purpose of this second paper is anattempt
to describe aprocedure
of thistype
which we will call a link functionapproach
referring
to its close connection with theparameterization
used in GLMtheory
[3, 14].
The paper will be
organized along
similar lines as theprevious
paper[4]
including
i)
an initial section ontheory,
with a brief summary of the models and apresentation
of the
estimating equations
andtesting procedures,
andii)
a numericalapplication
based on a small data set with the same structure as the one used in theprevious
paper[4].
2. THEORY
2.1. Statistical model
It is assumed that the data set can be stratified into several strata indexed
by
(i
=1, 2, ... ,
I)
representing
apotential
source ofheteroskedasticity.
For the sake ofsimplicity,
we will consider a standardized one-way random(e.g.
sire)
model as inFoulley
[4]
andFoulley
andQuaas
[5].
where yi is the
(n
i
x1)
data vector for stratumi;
j3
is a(p
x1)
vector of unknown fixed effects with incidence matrixX
i
,
and ei is the(n
2
x1)
vector of residuals. The contribution of thesystematic
randompart
isrepresented
by
O
&dquo;uiZ
iU
*
where u*
is a
(q
x1)
vector of standardizeddeviations,
Z
i
is thecorresponding
incidencematrix and <7u, is the square root of the
u-component
ofvariance,
the value of whichThe influence of factors
causing
theheteroskedasticity
of residual variances is modelledalong
the linespresented
in Leonard[13]
andFoulley
et al.[6, 7]
via alinear
regression
onlog-variances:
where 5 is an unknown
(r
x1)
real-valued vector ofparameters
andp’
is thecorresponding
(1
xr)
row incidence vector ofqualitative
or continuous covariates.Residual and
u-component parameters
are linked via a functionalrelationship
or
equivalently
where the constant T
equals
exp(a).
The differential
equation
pertaining
to[3ab],
i.e.(d
C7u
jC
7uJ
-
b(dC7ejC7eJ
= 0 is a scale-freerelationship
which showsclearly
that theparameter
of interest in thismodel is b. Notice the close connection between the
parameterization
inequations
[2]
and[3ab]
with that used in theapproach
of the’composite
link function’proposed
by
Thompson
and Baker[24]
whosesteps
can be summarized as follows:i
) (C7ui,C7eJ’
=f(a,b,C7e
J
;
ii)
C7ei =
exp(?
7
i),
and qi
=(112)p’
6
.
Ascompared
toThompson
andBarker,
theonly
difference is that the functionf
ini)
is not linear and involves extraparameters,
i.e. a and b.The intraclass correlation
(proportional
toheritability
for animalbreeders)
is an
increasing
function of the varianceratio p
i
=ou
i
/!e..
In turn pi increases or decreases withu,
2
depending
on b > 1 or b <1,
respectively,
or remains constant(b
=1)
sincedpi/p
i
=2(b - l)do’e!/o’e!.
Consequently
the intraclass correlationincreases or decreases with the residual variance or remains constant
(b
=1).
For2.2. EM-REML estimation
The basic EM-REML
procedure
[1, 18]
proposed
by
Foulley
andQuaas
(1995)
for
heterogeneous
variances isapplied
here.Letting
g/ ’ ’ y’ )’ e=(e’ e’ e’ .. e’ )’
i 1 1 2 i 1 > and’y = (6’, T, b)’,
1
> > >
the EM
algorithm
is based on acomplete
data set definedby
x =(p ,
u
*
,
e’)’
andits
loglikelihood
L(y; x).
The iterative process takesplace
as in thefollowing.
TheE-step
is defined asusual,
i.e. at iteration[t],
calculate the conditionalexpectation
ofL(y; x)
given
the data y and y =y’
l
as shown in
Foulley
andQuaas
[5],
reduces towhere
E1
t
] (.)
is a condensed notation for a conditionalexpectation
taken withrespect
to the distribution of x inQ given
the data vector y and y =1
’[
t]
.
Given the current estimate
1
’[
t]
of y, theM-step
consists incalculating
the nextvalue
1
’[
tH
]
by
maximizing
)
t]
1’[
I
’
1
Q(
inequation
(4)
withrespect
to the elements of the vector y of unknowns. This can beaccomplished
efficiently
via theNewton-Raphson algorithm.
Thesystem
ofequations
to solveiteratively
can be written inmatrix form as:
where
P(
rxl
) _
(P1!P2,...,Pi,...,P1)i
Vó
[
Ix1
]
=f a!la!n!e!J! vT -
fi9QIa-rl,
v
b =
{8Q/ab!;
W
a
p
=åQ/åaå/3’,
for a andj3
being
components
of y =
(5’,
T
, bl’.
Note that for this
algorithm
to be a trueEM,
one would have to iterate the NRalgorithm
inequation
(5)
within an innercycle
(index
£)
until convergence tothe conditional maximizer
y
[t+1]
_
y
l
’,’
]
at each Mstep.
Inpractice
it may beadvantageous
to reduce the number of inneriterations,
even up toonly
one. This is anapplication
of the so called’gradient
EM’algorithm
the convergenceproperties
of which are almost identical to standard EM
[12].
The
algebra
for the first and second derivatives isgiven
in theAppendix.
These derivatives are functions of the current estimates of theparameters
y =’
Yl
’l,
andof the
components
ofE!t](eiei)
defined at theE-step.
with a cap for their conditional
expectations,
e.g.These last
quantities
arejust
functions of the sumsX’y
i
,
Z’yi,
the sums of squaresy§y
i
withinstrata,
and the GLS-BLUP solutions of the Henderson mixedmodel
equations
and of their accuracy[11],
i.e.where ’ 1
Thus,
deleting
[t]
for the sake ofsimplicity,
one has:r <&dquo;* f i where
j3
and u* are solutions of the mixed modelequations,
and C _[
CO
,
3
CUO
C
O
.
Cuu
Ju
1
is the
partitioned
inverse of the coefficient matrix inequation
(7).
Forgrouped
data(n
i
observations in subclass i with the same incidence matricesX
i
=l
ni
x’
andZi
=1,,iz’),
formulae(8)
reduce to:where
2.3.
Hypothesis testing
Tests of
hypotheses
aboutdispersion
parameters
y =(Õ’,
7
, b)’
can be carriedout via the likelihood ratio statistic
(LRS)
asproposed
by
Foulley
et al.[6, 7].
LetH
o
:
y E.f2
o
be the nullhypothesis,
andH
1
:
y E ,f2 -,f2
o
its alternative where,f2
o
and Q refer to the restricted and unrestrictedparameter
spaces,respectively,
where
y
and
y
are
the REML estimators of y under the restricted(Ho)
and unrestricted(H
o
UH
1
)
models. Under standard conditions forH
o
(excluding
hypotheses allowing
the trueparameter
to be on theboundary
of theparameter
space as addressedby
Robert et al.[22], A
has anasymptotic
chi-square
distribution with r = dim ,f2 - dimS
-2
0
degrees
of freedom. Under model(1),
anexpression
of -2L(y; y)
is:The theoretical and
practical
aspects
of thehypotheses
to be tested about the structuralcomponent
5 have beenalready
discussedby
Foulley
et al.[6, 7],
SanCristobal et al.
[23]
andFoulley
[4].
As far as the functional
relationship
between the residual andu-components
isconcerned,
interest focusesprimarily
on thehypotheses
i)
a constant variance ratio(b
=1),
andii)
a constantu-component
of variance(b
=0) [2,
16, 22,
28].
Note that the structural functional model can be tested
against
the doublestruc-tural model:
fn o, ei
2
= pi b
e
,
andfn o, u 2i
=p§ 5
u
with the same covariates. The reasonfor that is as follows. Let P =
[11P’],
5
e
=,
e
[ry
6*]
andð
u
=(r!!, 6*]
pertaining
to a traditionalparameterization
involving
intercepts
qe and ?7u fordescribing
the residual andu-components
ofvariance,
respectively,
of a referencepopulation.
The structural functional model reduces to the nullhypothesis
6*
=2bbe,
thusresult-ing
in anasymptotic
chi-square
distribution of the LRScontrasting
the two models withRank(P)-2
degrees
of freedom.2.4. Numerical
example
For readers interested in a test
example,
a numerical illustration ispresented
based on a small data set obtained
by
simulation. Forpedagogical
reasons, thisexample
has the same structure as thatpresented
inFoulley
[4],
i.e. it includes twocrossclassified fixed factors
(A
andB)
and one random factor(sire).
The model used to
generate
records is:where a
is ageneral
mean, ai,1
3
j
are the fixed effects of factorsA(i = 1, 2)
andB(j
=1,2,3),
sk
the standardized contribution of male k as a sire and1/2se )
that of male as a maternalgrand
sire.Except
for T= 0.001016 and b =1.75,
the values chosen for theparameters
areTable
II presents
-2Lvalues,
LR statistics and P-valuescontrasting
thefollowing
different models:1)
additive for bothlog
Q
e
andlog or
2
;
2)
additive forlog u2
and
log as
= a + blog a,;
3)
constant variance ratio(b
=1);
4)
constant sire variance(b = 0).
In this
example,
models(3)
and(4)
wererejected
asexpected
whatever thealternatives,
i.e. models(1)
or(2).
Model(2)
wasacceptable
whencompared
to(1)
thus
illustrating
that there is room between thecomplete
structuralapproach
andthe constant variance ratio model.
The
corresponding
estimates ofparameters
are shown in table IIL Estimatesof the functional
relationship
are T = 0.001143 and b =3.0121,
this last valuebeing higher
than the true one, but - notsurprisingly
in this smallsample -
notsignificantly
different(A
= 1.5364 and P-value =0.215).
3. DISCUSSION AND CONCLUSION
This has been an issue for many years in the animal
breeding community.
Forinstance for milk
yield,
theassumption
of a constantheritability
across levels ofenvironmental factors
(e.g.
countries, regions, herds,
years,management
conditions)
has
generated
considerablecontroversy:
see Garrick and Van Vleck[8],
Wiggans
and VanRaden
[29];
Visscher and Hill[26],
Weigel
et al.[28]
and DeStefano[2].
Maximum likelihoodcomputations
arebased, here,
on the EMalgorithm
anddifferent
simplified
versions of it(gradient
EM,
ECM).
This is apowerful
toolfor
addressing
problems
of variancecomponent
estimation,
inparticular
those ofheterogeneous
variances[4,
5,
7, 20,
21].
It is notonly
an easyprocedure
toimplement
but also a flexible one. Forinstance,
ML rather than REML estimators can be obtained after aslight
modification of theE-step
resulting
forgrouped
data in
where
M
uu
is the u x u block of the coefficient matrix of the Henderson mixedmodel
equations.
Moreover the
procedure
can be extended to models withseveral(k
= 1, 2, ... ,
K)
uncorrelated u randomfactors,
e.g.Such an extension will be easy to make via the ECM
(expectation
conditionalmaximization)
algorithm
[15]
in its standard orgradient
versionalong
the samelines as those described in
Foulley
[4].
However caution should be exercised inapplying
thegradient ECM,
for thisalgorithm
nolonger
guarantees
convergence inlikelihood values. Other alternatives
might
be considered as well such as the average information-REMLprocedure
[10, 17].
In
conclusion,
the likelihood frameworkprovides
apowerful
tool both forestimation and
hypothesis
testing
of differentcompeting
modelsregarding
thoseproblems. However,
additional research work is still needed tostudy
someproperties
of these
procedures
especially
from apractical
point
ofview,
forexample
the powerof
testing
suchassumptions
as b = 1.ACKNOWLEDGEMENTS
This work was conducted while Caroline Thaon was on a
’stage
de fin d’6tudes’ at the Station deg6n6tique quantitative
etappliqu6e
(SG(aA),
Inra,
Jouy-en-Josas
as a student from the Ecole
Polytechnique,
Palaiseau. Shegreatly acknowledges
thesupport
of both institutions inmaking
thisstay
feasible. Thanks areexpressed
toElinor
Thompson
(Inra-Jouy
enJosas)
for theEnglish
revision of themanuscript.
REFERENCES
[1]
Dempster
A.P.,
LairdN.M.,
RubinD.B.,
Maximum likelihood fromincomplete
datavia the EM
algorithm,
J. R. Statist. Soc. B 39(1977)
1-38.[2]
DeStefanoA.L., Identifying
andquantifying
sources ofheterogeneous
residual and sire variances indairy production data,
Ph.D.thesis,
CornellUniversity, Ithaca,
NewYork.
[3]
FahrmeirL.,
TutzG.,
Multivariate StatisticalModelling
Based on Generalized LinearModels.
Springer Verlag, Berlin,
1994.[4]
Foulley J.L.,
ECMapproaches
to heteroskedastic mixed models with constantvariance
ratios,
Genet. Sel. Evol. 29(1997)
297-318.[5]
Foulley J.L., Quaas R.L., Heterogeneous
variances in Gaussian linear mixedmodels,
Genet. Sel. Evol. 27
(1995)
211-228.[6]
Foulley J.L.,
GianolaD.,
San CristobalM.,
ImS.,
A method forassessing
extent and sources ofheterogeneity
of residual variances in mixed linearmodels,
J.Dairy
Sci. 73(1990)
1612-1624.[7]
Foulley J.L.,
San CristobalM.,
GianolaD.,
ImS., Marginal
likelihood andBayesian
approaches
to theanalysis
ofheterogeneous
residual variances in mixed linear Gaussianmodels, Comput
Stat. Data Anal. 13(1992)
291-305.[8]
GarrickD.J.,
Van VleckL.D.,
Aspects
of selection forperformance
in several[9]
Gianola D.,Foulley
J.L., FernandoR.L.,
HendersonC.R., Weigel K.A.,
Estimation ofheterogeneous
variancesusing empirical Bayes
methods: theoreticalconsiderations,
J.
Dairy
Sci. 75(1992)
2805-2823.[10]
GilmourA.R., Thompson R.,
CullisB.R., Average
information REML: an efficientalgorithm
for variance parameter estimation in linear mixedmodels,
Biometrics 51(1995)
1440-1450.[11]
HendersonC.R., Applications
of Linear Models in AnimalBreeding, University
ofGuelph, Guelph, Ontario, Canada,
1984.[12]
Lange K.,
Agradient
algorithm locally equivalent
to the EMalgorithm,
J. R. Statist. Soc. B 57(1995)
425-437.[13]
LeonardT.,
ABayesian approach
to the linear model withunequal
variances,Technometrics 17
(1975)
95-102.[14]
McCullagh P.,
NelderJ.,
Generalized LinearModels,
2nded.,
Chapman
andHall,
London, 1989.
[15]
Meng X.L.,
RubinD.B.,
Maximum likelihood estimation via the ECMalgorithm:
Ageneral framework,
Biometrika 80(1993)
267-278.[16]
MeuwissenT.H.E.,
DeJong G., Engel B.,
Joint estimation ofbreeding
values andheterogenous
variances oflarge
datafiles,
J.Dairy
Sci. 79(1996)
310-316.[17]
Meyer K.,
An average information restricted maximum likelihoodalgorithm
forestimating
reduced rankgenetic
covariance matrices or covariance functions for animal models withequal design
matrices, Genet. Sel. Evol. 29(1997)
97-116.[18]
PattersonH.D., Thompson
R.,Recovery
of interblock information when block sizesare
unequal,
Biometrika 58(1971)
545-554.[19]
RobertC.,
Etude dequelques problèmes
lies à la mise en oeuvre du REML engénétique quantitative,
these deDoctorat,
Universite PaulSabatier, Toulouse,
France.[20]
RobertC., Foulley J.L., Ducrocq V.,
Inference onhomogeneity
of intra-class correla-tions among environmentsusing
heteroskedasticmodels,
Genet. Sel. Evol. 27(1995)
51-65.[21]
RobertC., Foulley J.L., Ducrocq V.,
Estimation andtesting
of constantgenetic
and intra-class correlation coefficients amongenvironments,
Genet. Sel. Evol. 27(1995)
125-134.[22]
RobertC., Ducrocq
V.,Foulley J.L., Heterogeneity
of variance for type traits in the Montbéliardecattle,
Genet. Sel Evol 29,(1997)
545-570.[23]
San CristobalM.,
Foulley J.L.,
ManfrediE.,
Inference aboutmultiplicative
het-eroskedastic components of variance in a mixed linear Gaussian model with anap-plication
to beef cattlebreeding,
Genet. Sel. Evol. 25(1993)
3-30.[24]
Thompson R.,
BakerR.J.,
Composite
link functions ingeneralized
linear models.Appl.
Stat. 30(1981)
125-131.[25]
Visscher P.M., On the power of likelihood ratio tests fordetecting heterogeneity
of intra-class correlations and variances in balanced half-sibdesigns,
J.Dairy
Sci. 75(1992)
1320-1330.[26]
VisscherP.M.,
HillW.G., Heterogeneity
of variance anddairy
cattlebreeding,
Anim. Prod. 55(1992)
321-329.[27]
Weigel K.A.,
GianolaD.,
Acomputationally simple Bayesian
method for estimationof
heterogeneous
within-herdphenotypic
variances, J.Dairy
Sci. 76(1993)
1455-1465.[28]
Weigel K.A.,
GianolaD.,
YandelB.S.,
KeownJ.F.,
Identification of factorscausing
heterogeneous
within-herd variance componentsusing
a structural model for vari-ances, J.Dairy
Sci. 76(1993)
1466-1478.[29]
Wiggans G.R.,
VanRadenP.M.,
Method and effect ofadjustment
forheterogeneous
Al. APPENDIX: Derivatives for the EM
algorithm
TheQ
function to be maximized is(in
condensednotation)
with
and,
Al.l. Derivative with
respect
to 5(log
residualparameters)
According
to the chainrule,
one hasNow
That is
and
I
Letting
v5 =8Q /8£n ufl
so that!! =
L
v5ipi =P’
V
,5,
then! °! i = 1 Let us define
and,
the samesymbols
with a hat for their conditionalexpectations,
i.e.an alternative
expression
forcomputing
(A4)
isalready
reported by Foulley
et al.[6]
andFoulley
[4]
for models with ahomogeneous
u-component
ofvariance,
and a constant u to e varianceratio,
respectively.
Al.2. Derivative with
respect
to Twith
and
or, more
explicitly
Al.3. Derivative with
respect
to band
After
developing
andrearranging,
one obtainsLetting
b = 0 and b = 1 in(All)
leads toand
Again
these are the sameexpressions
as thosegiven by Foulley
et al.[6]
andFoulley
[4]
for a constantu-component
of variance and a constant varianceratio,
respectively.
Al.5. b - T derivatives
where
Now
Al.6. 5 - b derivatives where Now and so that A1.7. T - T derivatives
Differentiating
(A7)
onceagain
withrespect
to T leads toAl.8. T - b derivatives
From
(A7),
one hasFinally,
theNewton-Raphson algorithm
toimplement
for theM-step
of the EMalgorithm
can be written in condensed form as:where at iteration