HAL Id: hal-00893972
https://hal.archives-ouvertes.fr/hal-00893972
Submitted on 1 Jan 1992
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Bias and sampling covariances of estimates of variance
components due to maternal effects
K Meyer
To cite this version:
Original
article
Bias
and
sampling
covariances
of estimates of variance
components
due
to
maternal
effects
K
Meyer
Edinburgh University,
Institutefor Cell,
Animal andPopulation Biology,
West MainsRoad, Edinburgh
EH93JT, Scotland, UK;
Unibersity
of
NewEngland,
Animal Genetics andBreeding Unit,
Armidale,
NSW2351,
Australia(Received
13September
1991;accepted
26August 1992)
Summary - The
sampling
behaviour of Restricted Maximum Likelihood estimates of(co)variance
components due to additivegenetic
and environmental maternal effects is examined for balanced data with differentfamily
structures. It is shown thatsampling
correlations between estimates arehigh
and that sizeable data sets arerequired
to allowreasonably
accurate estimates to beobtained,
even fordesigns specifically
formulated for the estimation of maternal effects. Bias andresulting
mean square error whenfitting
the wrong model ofanalysis
areinvestigated, showing
that an environmentaldam-offspring
covariance,
which is oftenignored
in theanalysis
ofgrowth
data for beefcattle,
has tobe
quite
large
before its effect isstatistically
significant.
Theefficacy
ofembryo
transfer inreducing sampling
correlations direct and maternalgenetic
(co)variance
components is illustrated.maternal effect
/
variance component/
sampling covarianceRésumé - Biais et covariances d’échantillonnage des estimées de composantes de variance dues à des effets maternels. Les
propriétés d’échantillonnage
des estimées du maximum de vraisemblance restreint des variances-covariances dues à deseffets
maternelsgénétiques additifs
et de milieu sont examinées sur des données d’undMpo!!<!/ e!MtK&re
et avecdifférentes
structuresfamiliales.
On montre que les corrélationsd’échantillonnage
en-tre les estimées sont élevées etqu’un
volume de donnéesimportant
estrequis
pour obtenirdes estimées raisonnablement
précises,
même avec desdispositifs
établisspécifiquement
pour estimer deseffets
maternels. L’étude du biais et de l’erreurquadratique
moyennemilieu,
souventignorée
dansl’analyse
des données de croissance des bovins àviande,
doit être trèsgrande
pour que soneffet
soitstatistiquement significatif. L’efficacité
dutransfert
d’embryon
pour réduire les corrélationsd’échantillonnage
entre les variances-covariancesgénétiques
directes et maternelles est illustrée.effet maternel
/
composante de variance/
covariance d’échantillonnageINTRODUCTION
The
importance
of maternaleffects,
bothgenetic
andenvironmental,
for theearly
growth
anddevelopment
of mammals haslong
beenrecognised.
Forpost-natal
growth,
theserepresent
mainly
the dam’s milkproduction
andmothering ability,
though
effects of the uterine environment and extra-chromosomal inheritance may contribute. Detailed biometrical models have beensuggested.
Willham(1963)
distinguished
between the animal’s and itsmother’s,
ie direct andmaternal,
additivegenetic,
dominance and environmental effectsaffecting
the individual’sphenotype.
Allowing
for direct-maternal covariances between each of the 3effects,
this gave atotal of 9 causal
(co)variance
components
contributing
to the resemblance betweenrelatives. Willham
(1972)
described an extension to includegrand-maternal
effectsand recombination loss.
Estimation of maternal effects and the
pertaining genetic
parameters
isinher-ently problematic.
Unlessembryo
transfer orcrossfostering
has takenplace,
direct and maternal effects aregenerally
confounded.Moreover,
theexpression
ofmater-nal effects is
sex-limited,
occurs late in life of the female andlags by
onegeneration
(Willham, 1980).
Methods to estimate(co)variances
due to maternal effects havebeen reviewed
by Foulley
and Lefort(1978).
Early
work relied onestimating
co-variances between relatives
separately,
equating
these to theirexpectations
andsolving
theresulting
system
of linearequations.
However,
thisignored
the factthat the same animal
might
have contributed to differenttypes
of covariances and that different observationalcomponents
might
have differentsampling
variances,
ie combined information in a
non-optimal
way. Inaddition,
sampling
variances of estimates could not be derived(Foulley
andLefort,
1978).
Thompson
(1976)
presented
a maximum likelihood(ML)
procedure
which over-comes theseproblems
and showed how it could beapplied
todesigns
found in the literature. He considered the ML method most useful when data were balanced dueto
computational requirements
in the unbalanced case. Over the lastdecade,
MLes-timation,
inparticular
Restricted Maximum Likelihood(REML)
as first describedby
Patterson andThompson
(1971),
has foundincreasing
use in the estimation of(co)variance
components
andgenetic
parameters.
Especially
for animalbreed-ing applications
this almostinvariably
involves unbalanced data.Recently, analyses
under the so-called animal
model,
fitting
a random effect for the additivegenetic
value of each
animal,
have become a standardprocedure.
To alarge
extent,
thiswas facilitated
by
theavailability
of a derivative-free REMLalgorithm
(Graser
etMaternal
effects,
bothgenetic
andenvironmental,
can be accommodated inanimal model
analyses
by
fitting
appropriate
random effects for each animal oreach dam with progeny in the data.
Conceptually,
thissimplifies
the estimation ofgenetic
parameters
for maternal effects. Rather thanhaving
to determine thetypes
of covariances between relatives
arising
from the data and theirexpectations,
to estimate each of them and toequate
them to theirexpectations,
we can estimate maternal(co)variance
components
in the same way as additivegenetic
(co)variances
with the animal
model, namely
as variances due to random effects in the modelof
analysis
(or
covariances betweenthem).
The derivative-free REMLalgorithm
extendsreadily
to thistype
ofanalyses
(Meyer, 1989).
As
emphasised by Foulley
and Lefort(1978),
estimates ofgenetic
parameters
arelikely
to beimprecise.
Thompson
(1976)
suggested
that in the presence of maternaleffects,
sampling
variances of estimates of the directheritability
would be increased 3-5-fold over those which would be obtained ifonly
direct additivegenetic
effectsexisted.
Special
experimental
designs
to estimate(co)variances
due to maternaleffects have been
described,
forinstance,
by
Eisen(1967)
and Bondari et al(1978).
Thompson
(1976)
applied
his MLprocedure
to thesedesigns
and showed that for Bondari et al’s(1978)
data,
estimates of maternalcomponents
had notonly
large
standard errors but also
high sampling
correlations.In the estimation of maternal effects for data from livestock
improvement
schemes,
non-additivegenetic
effects and a direct-maternal environmental covari-ance havelargely
beenignored.
Inpart,
this has been due to the fact that often thetypes
of covariances between relatives available in the data do not havesufficiently
differentexpectations
to allow allcomponents
of Willham’s(1963)
model to beestimated. Even for Bondari et al’s
(1978)
experiment,
providing
11types
ofrela-tionships
betweenanimals, Thompson
(1976)
emphasised
thatonly
7parameters,
6(co)variances
and a linear function of the direct and maternal dominance varianceand the maternal environmental
variance,
could be estimated. In fielddata,
thecontrasts between relatives available are
likely
to befewer,
thuslimiting
the scopeto
separate
the various maternalcomponents.
In the
analysis
ofpre-weaning
growth
traits in beefcattle,
components
estimatedhave
generally
been restricted to the direct additivegenetic
variance(o, A 2 ),
the maternal additivegenetic
variance(0-2 m ),
the direct-maternal additivegenetic
covariance(0-
AM
),
the maternal environmental variance(o-b)
and the residual errorvariance
(a
5
)
or a subsetthereof;
seeMeyer
(1992)
for arecent
summary.Using
data from an
experimental
herd whichsupplied
various &dquo;unusual&dquo;relationships,
Cantet et al
(1988)
attempted
to estimate allcomponents.
There has been concernabout a
negative
direct-maternal environmental covariance(0-
EC
)
in this case(Koch,
1972)
which,
ifignored,
islikely
to bias estimates of the othercomponents
and
corresponding
genetic
parameters,
inparticular
the direct-maternalgenetic
correlation(rA,!r).
Summarising
literature results in- andexcluding
information from thedam-offspring
covariance,
theonly
observationalcomponent
affectedby
LTEC
, Baker
(1980)
reported
mean values of rAM of -0.42 and 0.0 for birthweight,
- 0.45 and -0.05 for
daily gain
from birth toweaning
and -0.72 and -0.07 forweaning weight, respectively.
While the modern methods of
analysis together
with theavailability
ofhigh
parameters
due to maternaleffects,
they might
make it all too easy toignore
the inherent
problems
of this kind ofanalyses
and to ensure that allparameters
fitted can be estimated
accurately.
Unexpected
or inconsistent estimates have been attributed tohigh sampling
correlations betweenparameters
or bias dueto some
component
not taken into account without anyquantification
of theirmagnitude
(eg
Meyer,
1992).
Theobjective
of this paper was to examine REML estimates ofgenetic
parameters
due to maternaleffects, investigating
bothsampling
(co)variances
andpotential
bias due tofitting
the wrong model ofanalysis.
MATERIAL AND METHODS
Theory
Consider a mixed liner
model,
where y,
b,
u and e denote the vector ofobservations,
fixedeffects,
random effects and residual errors,respectively,
and X and Z are the incidence matricespertaining
to b and u. Let V denote the variance matrix of y. The REMLlog
likelihood(.C)
is then
For the
majority
of REMLalgorithms
employed
in theanalysis
of animalbreeding
data,
[2]
and its derivatives have beenre-expressed
in termsarising
in themixed model
equations pertaining
to!1!.
Analternative,
based on theprinciple
ofconstructing
independent
sums of squares(SS)
andcrossproducts
(CP)
of the dataas for
analyses
of(co)variances,
has been describedby Thompson
(1976,
1977).
Asa
simple example,
he considered data with a balanced hierarchical full-sib structureand records available on both parents and
offspring, showing
that the SS withindams,
between dams within sires and betweensires,
as utilised in ananalysis
ofvariance
(for
data onoffspring
only),
could be extended to include information onparents.
This wasaccomplished by augmenting
the later 2by
rows and columnsfor dams and
sires,
yielding
a 2 x 2 and a 3 x 3matrix,
respectively,
with the additional elementsrepresenting
offspring
parent
CP,
andSS/CP
amongparents;
see
Thompson
(1977)
for a detaileddescription.
More
generally,
let the data berepresented by
pindependent
matrices ofSS/CP
S!.,
each with associateddegrees
of freedomd! (k
=1, ... , P).
Thecorresponding
matrices of mean squares and
products
are thenM!;
=S!/d!
withexpected
valuesIn the estimation of
(co)variance
components,
V and the matricesV!
areusually
linear functions of the
parameters
to beestimated,
A =f Oi with
i =1, ... , t,
ieREML estimates of 0 can then be determined as iterative solutions to
(Thompson, 1976)
with B =lb
ij
and
q =fq
i
for
i, j = 1, ... , t,
andThis is an
algorithm utilising
second derivativesof
log
G. At convergence,an estimate of the
large sample
covariance matrix of6
isgiven by
-2B-
1
.
Asemphasised
by Thompson
(1976),
B issingular
if a linear combination of thematrices
F!i
is zero for allk,
whichimplies
that not allparameters
can be estimated.This
methodology
can beemployed readily
to examine theproperties
for REML estimates for various models. Consider dataconsisting
of records forf independent
families. HenceV!;, M!;
and theF!i
can be evaluated for onefamily
at a time. Ifthe data are
&dquo;balanced&dquo;,
ie all families are of size n and have the samestructure,
these
calculations, involving
matrices of size n x n, arerequired only
once, ie p = 1.Fitting
an overall mean as theonly
fixedeffect,
the associateddegrees
of freedomof
S¡
are thenf —
1.’
Let a
record y
j
foranimal j
with
damj’
be determinedby
the animal’s(direct)
additivegenetic
value a!, its dam’s maternalgenetic
effect mj,, its dam’s maternal environmental effect Cj’ and a residual error e!, ie:with all
remaining
covariancesequal
to zero.Letting,
inturn,
maternal effects m!’and Cj’ be
present
or absent and covariances O’Ayl and UEC be zero ornot,
yields
atotal of 9 models of
analysis
as summarised in table I.Clearly,
M
k
in[4]
above represents the contribution of the data tolog G,
ie relates to the &dquo;true&dquo; modeldescribing
the data.Conversely,
V!
is determinedby
the &dquo;assumed&dquo; model of
analysis,
ie the effect offitting
aninappropriate
model canbe examined
deriving
V!.
under the wrong model.Furthermore,
the informationcontributed
by
individual records can be assessedby &dquo;omitting&dquo;
these records fromthe
analysis
whichoperationally
issimply
achievedby setting
thecorresponding
rows and columns in
V!
andM
k
to zero.Analyses
In
total,
6family
structures were considered. Thefirst,
denotedby
FS1,
was asimple
hierarchical full-sibdesign
with records for bothparents
andoffspring
forf
sires mated to d dams each with moffspring
perdam,
ief
families of sizen = 1 +
d(l
+m).
As shown in tableII,
thisyielded only
5types
of covariances betweenrelatives,
ie not all 9 models ofanalysis
could be fitted.Linking
pairs
ofsuch families
by assuming
the sire offamily
1 to be a full sib(FS
2
F)
orpaternal
half sib
(FS
2
H)
to one of the dams mated to sire 2 then added up to 3 furtherrelationships
(see
tableII).
With s = 2 siresper
family,
this gave afamily
size ofn = 2(1 + d(i
+
m)).
The fourth
design
examined wasdesign
I of Bondari et al(1978) .
Asdepicted
infigure
1,
this was createdby
mating
2 unrelatedgrand-dams
to the same2 mated to a
random,
unrelated animal. From each of thesematings,
2offspring
were recorded. For Bondari’sdesign
I(B1),
records ongrand-parents
and random mates were assumedunknown,
yielding
afamily
size of n = 8 and 10types
ofrelationships
betweenanimals. Assuming,
for thisstudy,
the former to be known then increased thefamily
size fordesign
B1P to n = 13 and addedgrand-parent
offspring
covariances to the observationalcomponents
available.The last
design
chosen was Eisen’sdesign
1(E1).
Forthis,
eachfamily
consistedof s sires which were full-sibs and each sire was mated to
d,
dams from an unrelated full-sibfamily
and tod
2
dams from an unrelated half-sibfamily.
Each dam had moffspring
whichyielded
afamily
size of n =s(l
+d
l
+d
2
)(1
+n)).
As shown intable
II,
thisproduced
a total of 13 differenttypes
ofrelationships
between animals.Figure
2 illustrates themating
structure for thisdesign.
For each
design
and set ofgenetic
parameters
considered,
the matrix of meansquares and
products,
M!
was constructedassuming
thepopulation
(co)variance
components
to be known and &dquo;estimates&dquo; under various models ofanalysis
wereResults obtained in this way are
equivalent
to those obtained as means overmany
replicates. Large sample
values ofsampling
errors andsampling
correlationsbetween
parameter
estimates were then obtained from the inverse of the informationmatrix,
F =—2B!.
This iscommonly
referred to as the formation matrix(Edwards, 1966).
Simulation was carried out
by
sampling
matricesM*
from
anappropriate
Wishart distribution with covariance matrix
M!;
andf —
1degrees
of freedom andobtaining
estimates of(co)variance
components
and theirsampling
variances andcorrelations
using
the MSCalgorithm.
However,
this did notguarantee
estimates tobe within parameter space.
Hence,
if estimates out of boundsoccurred,
estimationwas
repeated using
a derivative-free(DF)
algorithm, calculating log
G asgiven
in
[4]
andlocating
its maximumusing
theSimplex procedure
due to Nelder andMead
(1965).
This allowed estimates to be restrained to theparameter
spacesimply
by
assigning
a verylarge, negative
value tolog
G fornon-permissible
vectors ofparameters
(Meyer, 1989).
Large sample
95%
confidence intervals were calculated as estimate !1.96x thelower bound
sampling
error obtained from the information matrix.Corresponding
likelihood based confidence limits
(Cox
andHinkley,
1974)
weredetermined,
asdescribed
by Meyer
and Hill(1992),
as thepoints
on theprofile
likelihood curvefor each
parameter
for which thelog profile
likelihood differed from the maximumby —1.92,
ie thepoints
for which the likelihood ratio test criterion would beequal
to the
X
Z
valuepertaining
to onedegree
of freedom and an errorprobability
of5%
RESULTS AND DISCUSSION
Sampling
covariancesSampling
errors(SE)
of(co)variance
component
estimates based on 2 000 recordsfrom
analyses
under Model6,
ie when bothgenetic
and environmental maternal effects arepresent
and there is a direct-maternalgenetic covariance,
are summarisedin table III for data sets of 3
designs,
and 2 sets ofpopulation
(co)variances.
Forcomparison,
values which would be obtained forequal heritability
andphenotypic
variance in the absence of maternal effects(Model 1)
aregiven.
The most
striking
feature of table III is themagnitude
ofsampling
errors even for aquite
large
data set and fordesigns
like B1 and El which have beenespecially
formulated for the estimation of maternal effectscomponents.
In allcases,
SE((j!)
under Model 6 is about twice that under Model 1.FS2F
and Elyield considerably
more accurate estimates than B1 underModel 1,
withvirtually
ie a
high
directheritability
and lownegative
direct-maternalcorrelation,
and arecomparatively
even less variable forparameter
setII,
ie a low direct and mediummaternal
heritability
and a moderate tohigh
positive genetic
correlation.Table IV
gives
means andempirical
deviations of estimates of(co)variance
components
and theirsampling
errors under Model 6 for 1000replicates
for a dataset of size 2 000 for
parameter
set 1. While MSC estimates agreeclosely
with thepopulation values,
corresponding
mean DF estimates are,by
definition,
biased due to the restriction on theparameter
spaceimposed.
This isparticularly
noticeablefor
designs
FS2F and B1 with 355 and 258replicates
for which estimates neededto be constrained.
Overall, however, corresponding
estimates of theasymptotic
lower bound errors appear little
affected:
means over allreplicate
andconsidering
replicates
within theparameter
spaceonly
(MSC
*
)
showonly
smalldifferences,
except for
FS2F,
and agree with thepopulation
valuesgiven
in table III.Moreover,
standard deviations overreplicates
for these(not
shown)
are small andvirtually
the same for MSC and
MSC!‘, ranging
from 0.22(SE(â
1
)
forB1)
to 1.19(SE(âÄ
1
)
for
FS2F).
Inturn,
empirical
standard deviations of MSC estimates agree well with theirexpected values,
being
on averageslightly higher.
Those of the DFestimates,
however,
are inparts
substantially
lower,
demonstrating
clearly
thatconstraining
estimates alters their
distribution,
ie thatlarge
sample
theory
does not hold at the bounds of theparameter
space.Table V
presents
bothlarge sample
(LS)
andprofile
likelihood(PL)
derivedfor the
population
(co)variances.
As noted for otherexamples by Meyer
and Hill(1992),
unless bounds of theparameter
space areexceeded, predicted
lengths
of theinterval from the 2 methods agree
consistently
better than values for theposition
of the confidence bounds. Lower PL limits for
a2
m
anda
2
c
fordesigns
FS2F andB1 could not be determined
(as
thelog profile
likelihood curve to the left of the estimates was so flat that it did not deviate from the maximumby
-1.92),
andwere thus set to zero, the bound of the parameter space. While differences between
PL and
LS
intervals are small for alldesigns
forlarger
data sets(not shown),
considerable deviations occur for the 2 000 record case,
particularly
forQ
AM
andCorresponding
empirical
andexpected
sampling
correlations between DFesti-mates of
(co)variance
components
are contrasted in table VI. Meanexpected
valuesover
replicates
were in all casesequal
(to
the second or third decimalplace)
to thosederived from the information matrix of the
population
parameters.
Whileempiri-cal values for
larger
data sets(not shown)
again
agree well with their theoreticalcounterparts,
those based on 2 000 recordsdeviate,
again
reflecting
the effect ofconstraining
estimates to theparameter
space on theirsampling
distribution.De-viations are in
places
considerable forparameter
setII,
for which estimates from773,
726 and 553replicates
forFS2F,
B1 andE1, respectively,
needed to becons-trained to the parameter space.
Overall,
however,
some of thesampling
correlations(expected values)
showremarkably
little variation betweendesigns
or differences betweenparameter
setsconsidered.
6;1
andû1
areconsistently highly negatively
correlated,
with values of! -0.8 to
-0.9; while
5!
and the maternal effectscomponents,
6’
andiT- 2 ,
showcomparatively
little(though
morevariable)
association,
correlationsranging
from0 to m 0.4 and 0 to m
-0.3,
respectively. Similarly,
correlations betweenû1
andû
1
are low andnegative
and between<7!.
and3%
are low andpositive
ornegative
and close to zero.
Differences between
designs
and the amount of information available toseparate
not
only
direct and maternal but also maternalgenetic
and environmentalcom-ponents
asapparent
in tableIII, however,
areclearly
exhibited in the correlationsamong
û1, a’ m
and9!..
While the correlation betweenû
1
and6
AM
is ashigh
as -0.9 forFS2F,
it is reducedin
magnitude
to -0.8 for B1 and -0.7 to -0.6 for El.Correspondingly,
ahigh
positive
correlation between%
AM
andQC
for FS2F(!
0.9)
and Bl(! 0.7)
is reducedsubstantially
for El(to m
0.4).
FAMILY STRUCTURE
The
relationship
betweenfamily
structure andsampling
(co)variances
for agiven
number of records is further
investigated
in tableVII,
foranalyses
under Models1 and 4. The total
genetic
variance is defined asa5
=92
A
+1/2
QM
+3/2
QAM
(Willham,
1963),
ie is the same aso- A 2
for Model 1. As in tableIII,
differencesbetween
designs
are small forModel 1,
but increase with the number ofparameters
estimated. In
particular,
including
the direct-maternalgenetic
covariance has apronounced
effect.For FS1 with
only
oneoffspring
per dam(column 6),
there are no full-sibs in thedata.
Though
theremaining
4 observational covariances between relatives still allowall 4
components
under Model 4 to beestimated,
this causes an almostcomplete
sampling
correlation betweenû
1
and both6’ E
andû!M’
andcorrespondingly
high
sampling
variances.Conversely,
with the same number ofoffspring
butonly
one damper sire
(column 3)
thereare
nopaternal
half-sibs.However,
as theexpectation
ofthe
pertaining
covariance involvesonly
a A 2
estimates of the maternalcomponents
are thus much less
affected, though
SE(al A
and thesampling
correlation betweena2 A
andû
1
arelargest
amongst
those for the FS1designs.
As noted above for
Model 1,
Bondari’sdesign
1gives
less accurate estimatesthan most full-sib
family
structures(unless
d = 1 or m =1)
for thesimpler
modelsinvolve data on
only
2generations,
B1P includes records ongrand-parents,
ie 3generations
in total.Though
the coefficients ofa A 2
and aAM in theexpectation
of thegrand-parent offspring
covariances arecomparatively
small(see
tableII),
thisclearly
reduces thesampling
errors of allcomponents
estimated and themagnitude
of
sampling
correlations between62
A
and bothlTiI
and%5
.
As forFS1, sampling
errors for E1 are
markedly
increased when one or several of the covariances betweenrelatives are
missing
(s
= 1 ord
i
= 1 ord
2
=
1),
the more the moreparameters
are estimated.
Sampling
correlations follow a similarpattern
as for FS1. Based on8 000
records, design
12provides
the most accurate estimates among the 12 data structures examined. Some discussion on theoptimal
choice ofs, d
l
andd
2
forEisen’s
(1967)
designs
isgiven
by
Thompson
(1976).
BIAS AND MEAN
SQUARE
ERRORSo
far, only analyses
under the &dquo;true&dquo; modeldescribing
the data have beenconsidered. In some
instances,
analyses
are carried out,however,
fitting
the wrong model. Aparticular
example
as discussed above is theanalysis
ofgrowth
traits inbeef cattle where an environmental correlation between a dam and her
daughter,
though
assumed toexist,
isgenerally
ignored. Figure
3shows
the effect of such environmental covariance(b
EC
=lTEG/lT!)
on the estimates of(co)variance
components
and the direct-maternalgenetic
correlation(r
AM
)
under Model 6 when the true modeldescribing
the data is Model9,
forparameter
set I and 3designs.
While
(7!
and%5
aregenerally
littleaffected,
even forlarge
(absolute)
values ofb
EC
,
all the maternalcomponents
aresubstantially
biased. Thepattern
of biasesdiffers between
designs,
reflecting
clearly
the differences in covariances betweenrelatives available and the information contributed
by
each of them. Fordesign
FS1
(not shown),
estimates ofo, A 2 ,Or
Am
anda5
were unbiased whileô
’
1-
andô’!
were biasedby -2a
E
c
and+2o-
EC
,
respectively
(unless
estimates exceeded thebounds of the
parameter
space and wereconstrained).
Figure
4 shows thecorresponding
differences inlog £
fromanalyses
under models 6 and 9. For theparameter
setexamined,
themagnitude
ofb
E
c
needs to exceed 0.3 fordesign
El before a likelihood ratio test would beexpected
toidentify
asignificantly
better fit of Model 9 than of Model 6(at
an errorprobability
of5%;
the dashed line infigure
2 marks thesignificance level).
While estimates from EFS2Hand EFS2F
(not
shown)
differlittle,
thehigher
coefficients in the observationalcovariances due to the across
family
relationships
for FS2Fclearly
increase thescope to
identifiy
a non-zero QEC.The effect of an over- or
underparameterized
model ofanalysis
on estimates of(co)variances,
their lower boundsampling
errors and theresulting
mean square error(MSE),
defined as biassquared plus prediction
errorvariance,
is furtherillustrated in table VIII.
Clearly,
estimating
a(co)variance
when it is notpresent
increases the
sampling
errors of allcomponents
unnecessarily. Similarly,
when thebias introduced
by ignoring
acomponent
issmall,
MSEs under the wrong modelmay be
considerably
smaller than under the correct model. As the deviations inlog
G from the value under Model 9
show,
none of theanalyses
would beexpected
toEMBRYO TRANSFER
With a dam
affecting
thephenotype
of heroffspring
boththrough
half herdirect additive
genetic
value and her maternalgenotype
as well as her maternalenvironmental
effect, high sampling
correlations among thegenetic
and maternal(co)variance
components
areinvariable,
even with the bestexperimental design.
Fortunately,
modernreproductive
technology
allows some of these correlations tobe reduced. As a
simple illustration,
consider the hierarchical full-sibdesign
(FS1)
with one sire per
family.
Assume now that the sire has been mated toonly
oneout of the d dams with md full-sib
offspring resulting
from thismating. Further,
assume that each dam raises m of these
offspring
(design FS1ET).
This
gives
riseto 3 different
dam-offspring
covariances,
namely:
- the &dquo;usual&dquo; covariance between
a dam and her
offspring
raisedby her,
withexpectation
afl /2
+5
UAM/
4
+or2 /2
+ aEc I- the covariance between
a dam and her
offspring
raisedby another, recipient
dam,
withexpectation
a fl /2 + a
AA
f /4,
ie the same as thesire-offspring
covariance;
and
- the covariance between
a
recipient
dam and theoffspring
(of
anotherdam)
which she
raised,
withexpectation
aAM +(jÄ
I
/2
+ o-EC.Similarly,
we now need todistinguish
between 4types
of covariances between full-sibs:- the &dquo;usual&dquo; covariance between full-sibs raised
by
theirgenetic
dam,
withexpectation
(j!/2
+ (jAM+
U
2 m +
0
,2
c
- the covariance between full-sibs raised
by
the samerecipient
dam(not
theirgenetic
dam),
withexpectation
(j!/2
+U2
m
+o, C
2
’ - the covariance between full-sibs raised
by
differentdams,
with one of thembeing
theirgenetic
dam,
withexpectation
QA/
2
+O
-
A
M /2;
and- the covariance between full-sibs raised
by
differentrecipient dams,
none ofwhich is their
genetic
dam,
withexpectation
!A/2.
Table IX compares the
expected sampling
errors for FS1 and FS1ET for 3family
structures and Table X contains the
corresponding sampling
correlations. Results fromanalyses
under Models3, 4,
5(not shown)
and 6 were contrasted. For Model3,
with low correlations between
&2
m
and the othercomponents,
FS1ETyields slightly
less accurate estimates than FS1.
However,
as soon as a direct-maternalgenetic
covariance is
fitted,
FS1ETgives considerably
smallersampling
errors than FS1 asit reduces the
high sampling
correlations between&A 2
andQM
(Model
4,
5 and6),
(j2
and62
(Model
4 and5),
a2
and%5
(Model 5),
62
A
and%
AM
(Model
4 and6),
Q
N1
andâ
AM
(Model
4 andslightly
for Model6),
or&2 A
andûb
(Model
5 and6).
Clearly,
however,
FS1ET does not allowgenetic
and environmental maternal effectsto be
separated
any better thanFS1,
andsampling
correlations between& m 2
anda
2
c
(Model
5 and6)
are stilllarge
andnegative.
Other
designs involving genetically
more diverse &dquo;litter mates&dquo; and relatedparents
orrecipients
willprovide
more types of covariances between relativesand thus allow even better
separation
ofgenetic
andenvironmental,
and directand maternal effects. While the
expectation
of all observationalcomponents
intable II which involve
o, m 2
also includeU2 A
and 0-!!, the covariance between 2them)
which are full-sibs or maternalhalf-sibs,
issolely
due to maternalgenetic
effects
(expectation
(TiE /4).
CONCLUSIONS
It has been shown that estimates of
(co)variance
components
aresubject
tolarge
sampling
variances andhigh sampling
correlations,
even for a &dquo;reduced&dquo;model
ignoring
dominance effects andfamily
structuresproviding
numeroustypes
of covariances between relatives which have been
specifically designed
for the estimation of maternal effects. For small data sets and models ofanalysis
fitting
bothgenetic
and maternal environmental effects or a direct-maternalcovariance,
this
frequently
induces the need to constrain estimates to the parameter space.Consequently, large
sample
theory
predictions
ofsampling
errors and correlationsestimates do not agree with
empirical
results. Further research isrequired
to evaluate theimplications
of suchlarge sampling
(co)variances
on the accuracy of selection indexesincluding
both direct and maternaleffects,
ie theexpected
loss in selection response becauseinaccurately
estimatedparameters
have been usedThe
efficiency
of searchprocedures
used in derivative-free REMLalgorithms
ishighly dependent
on the correlation structure of theparameters
to beestimated,
being
most effective if these are uncorrelated. The fact thatexpected
sampling
correlations between some
components
for agiven
model ofanalysis
varied littlebetween
designs
(see
tableVI)
suggested
that areparameterisation
to linearfunctions of the
(co)variance
components
might
improve
the convergence rateof such
algorithms.
Inspection
ofeigenvalues
andeigenvectors
of the formationmatrices, however,
failed toidentify
anygeneral guidelines.
Examination of
bias,
sampling
variances andresulting
mean square errors whenfitting
the wrong model ofanalysis
showedthat,
in someinstances,
ignoring
somecomponent(s)
can lead toconsiderably
smaller MSE withoutbiasing
the(co)variances
estimatedsubstantially
orreducing
the likelihoodsignificantly
overthat under the true model. In
particular, investigating
the effect ofignoring
anappropriate
for agrowth
trait in beefcattle, suggested
that for a data set of size8 000,
the covariance should amount to at least30%
of thepermanent
environmentalvariance due to the dam before a likelihood ratio test would be
expected
todistinguish
it from zero(at
5%
errorprobability).
Results
presented
here reinforce earlierwarnings
about theinaccuracy
ofes-timates of maternal effects and the
pertaining
variancecomponents
(Thompson,
1976;
Foulley
andLefort,
1978).
Clearly,
use of an estimationprocedure
with &dquo;built-in&dquo;optimality
characteristics like REML will not alleviate the need forlarge
datasets
supplying
numeroustypes
of covariances between relatives whenattempting
toestimate these
components.
Use of modernreproductive
techniques
such asembryo
transfer may
provide
data where direct and maternal effects are less confounded.includ-ing
severalgenerations
wouldprovide
further contrast whichmight help
to reducethe
biologically
inducedhigh sampling
correlations.Implications
for the scope offitting
more detailedmodels, accounting,
forinstance,
for dominanceeffects,
recom-bination loss or variance due to new
mutation,
and ofestimating
theappropriate
(co)variance
components
are somewhatdiscouraging.
ACKNOWLEDGMENTS
Financial support for this
study
wasprovided
under the MRC(Australia)
grant UNE15 andby
theAgricultural
and Food Research Council(UK).
I amgrateful
to WG Hill forREFERENCES
Baker RL
(1980)
The role of maternal effects on theefficiency
of selection in beefcattle: a review. Proc NZ Soc Anim Prod
40,
285-303Bondari
K,
WillhamRL,
Freeman AE(1978)
Estimates of direct and maternalgenetic
correlations for pupaweight
andfamily
size ofTriboleum.
J Anim Sci47,
358-365
Cantet
RJC,
KressDD,
AndersonDC,
DoornbosDE,
Burfening
PJ,
Blackwell RL(1988)
Direct and maternalphenotypic
effects onpreweaning
growth
of beef cattle.J Anim Sci
66,
648-660Cox
DR, Hinkley
DV(1974)
Theoretical Statistics.Chapman
andHall,
London Edwards AWF(1966)
Likelihood.Cambridge
University
PressEisen EJ
(1967)
Mating
designs
forestimating
direct and maternalgenetic
variances and direct-maternal covariances. Can J GenetCytol 9,
13-22Foulley JL,
Lefort G(1978)
M6thodes d’estimation des effects directs et maternelsen selection animale. Ann Genet Sed Anim
10,
475-496Graser
HU,
SmithSP,
Tier B(1987)
A derivative-freeapproach
forestimating
variance
components
in animal modelsby
Restricted Maximum Likelihood. J Anim Sci64,
1362-1370Koch RM
(1972)
The role of maternal effects in animalbreeding.
VI. Maternal effects in beef cattle. J Anim Sci35,
1316-1323Meyer
K(1989)
Restricted Maximum Likelihood to estimate variancecomponents
for animal models with several random effects
using
a derivative-freealgorithm.
Genet Sel
Evod 21,
317-340Meyer
K(1992)
Variancecomponents
due to direct and maternal effects forgrowth
traits of Australian beef cattle. Livest Prod Sci
31,
179-204Meyer K,
Hill WG(1992)
Approximation
ofsampling
variances and confidenceintervals for maximum likelihood estimates of variance
components.
J Anim BreedGenet
109,
264-280Nelder
JA,
Mead R(1965)
Asimplex
method for function minimization.Computer
J7,
147-151Patterson
HD,
Thompson
R(1971)
Recovery
of inter-block information when blocksizes are
unequal.
Biometrika58,
545-554Thompson
R(1976)
The estimation of maternalgenetic
variances. Biometrics32,
903-917
Thompson
R(1977)
Estimation ofquantitative genetic
parameters.
In: Proc IntConf Quantitative
Genetics,
(Pollak
E, Kempthorne 0, Bailey TB,
eds)
Iowa StateUniv
Press,
Ames,
639-657Willham RL
(1963)
The covariance between relatives for characterscomposed
ofcomponents
contributedby
related individuals. Biometrics19,
18-27Willham RL
(1972)
The role of maternal effects in animalbreeding:
III. Biometricalaspects
of maternal effects in animals. J Anim Sci35,
1288-1293Willham RL