HAL Id: hal-00894001
https://hal.archives-ouvertes.fr/hal-00894001
Submitted on 1 Jan 1993
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Use of covariances between predicted breeding values to
assess the genetic correlation between expressions of a
trait in 2 environments
Dr Notter, C Diaz
To cite this version:
Original
article
Use
of
covariances between
predicted
breeding
values
to
assessthe
genetic
correlation between
expressions
of
atrait in
2
environments
DR
Notter,
C Diaz
Virginia Polytechnic Institute and State University, Department of Animal Science, Blacksburg, VA 24061, USA
(Received
23 June 1992; accepted 23 March1993)
Summary - Procedures to interpret correlation and regression coefficients involving
pre-dicted breeding values
(BV)
calculated for the same animals in different environmentshave been developed. Observed correlations are a function of the additive genetic
cor-relation between performances in the 2 environments but are also affected by selection
of animals that produce data in both environments, the accuracy of BV predictions in
each environment, relationships among animals within and across environments and
co-variances among BV predictions within an environment arising from estimation of fixed effects in best linear unbiased prediction
(BLUP)
of animal BV. Methods to account for effects of selection and variable accuracy and experimental designs to minimize effects ofrelationships and covariances among BV predictions from estimation of fixed effects have been described. The regression of predicted BV in environment 2 on predicted BV in
envi-ronment 1 is generally not affected by selection in environment
1,
but both correlation andregression coefficients are sensitive to covariances among
breeding
value predictions within environments. In general, caution must be exercised in interpreting observed associationsbetween predicted breeding values in different environments.
predicted breeding value / genetic correlation / regression / selection index / best
linear unbiased prediction
Résumé - Utilisation des covariances entre les valeurs génétiques prédites pour estimer la corrélation génétique entre les expressions d’un caractère dans 2 milieux. Des procédures sont établies pour interpréter les coefficients de corrélation et de répression
impliquant des valeurs génétiques prédites
(VG)
calculées pour les mêmes animaux dansdifférents milieux. Les corrélations observées sont fonction de la corrélation génétique
entre les performances dans les 2 milieux, mais elles dépendent aussi de la sélection des
animaux sur lesquels des données sont recueillies dans les 2 milieux, de la précision des
prédictions de VG dans chaque milieu, des parentés entre animaux intra-milieu et entre
milieux et des covariances entre les prédictions de VG dans un milieu qui résultent de
VG. L’article présente des méthodes pour prendre en compte les effets de la sélection et
de la précision variable des prédictions et des plans d’expérience pour minimiser les effets
de la parenté et des covariances entre les prédictions de VG à partir de l’estimation des
effets fixés. La régression des VG prédites dans le milieu 2 en fonction des VG prédites dans le milieu 1 n’est généralement pas affectée par la sélection dans le milieu 1, mais
la corrélation et la régression sont toutes deux influencées par les covariances entre les
prédictions de VG intrn-milieu. D’une manière générale, une grande prudence est requise
dans l’interprétation d’associations entre des valeurs génétiques prédites dans différents milieux.
prédiction de valeur génétique / corrélation génétique / régression / indice de sélection / meilleure prédiction linéaire sans biais
INTRODUCTION
Procedures to estimate additive
genetic
correlation(r
G
)
betweenexpressions
of thesame trait in different environments were introduced
by
Falconer(1952),
Robertson(1959),
Dickerson(1962)
and Yamada(1962).
Theprocedures
areanalogoes
to those for estimation ofgenetic
correlation between 2 traits in the same environment, butrecognize
thatperformance
isnormally
not measured on the same animal inmultiple
environments.Instead,
related animals(often half-sibs)
areproduced
ineach environment and rG is derived
by
comparing
the resemblance among relatives in different environments to that observed among relatives in the same environment. Insingle-generation
experiments
utilizing
half-sibs,
sires canproduce
progeny inpairs
of environments. If sires are evaluated in environment 1 beforebeing
usedin environment 2,
divergent
selection of sires can increaseprecision
of estimates of thegenetic regression
of one trait on the other when a fixed number of progenyis measured
(Hill,
1970;
Hill andThompson,
1977).
This strategy makes use ofthe fact that ’selection of sires biases correlation between parent
predicted breeding
value(BV)
andoffspring performance
but does not affect theregression
of progenyperformance
on parentpredicted
BV solong
as there is no selection of progeny records.Data from
industry
performance-recording
programs often include records ofrelatives evaluated in different environments, but the data structure is not under
experimental
control. Animals differ in the amount of informationavailable,
and unknownnon-genetic
sources of resemblance among relatives can exist.Likewise,
little information may exist on
procedures
used to select parents in each environ-ment. Procedures to estimate additivegenetic
covariances from theseindustry
datasets exist
(Meyer, 1991)
and have been used to estimate covariances betweenex-pressions
of the same trait in different environments(eg,
Dijkstra
etal,
1990).
Theseanalyses require
that the model includes allgenetic
andnongenetic
sources ofre-semblance among relatives and that information used to select parents be included
in the data.
Large
numbers of records often exist, butonly
a fraction of them meyIf
predicted
BV for the same animals in different environments were derivedusing
only
data from within each environment, correlations amongpredicted
BVacross environments should
provide
information about rc. Observed correlations in such situations(Oldenbroek
andMeijering,
1986;
DeNise andRay,
1987;
Tilsch etal,
1989a,b;
Mahrt etal,
1990)
wereusually
< 1,but,
as notedby
Calo et al(1973)
and Blanchard et al(1983),
theexpected
value of the correlations is also < 1, evenif the
underlying
genetic
correlation isunity.
Thus correlations betweenpredicted
BV in different environments must beinterpreted
relative to theirexpected
value. This paper will consider theexpected
values of observed correlations and consider alternativeexperimental designs. Expected
values of correlation andregression
coefficients under ideal conditions will first be reviewed. Effects of non-random
selection,
variation in the accuracy of BVpredictions, relationships
among animals within and acrossenvironments,
and covariances among BVpredictions arising
from estimation of fixed effects under best linear unbiased
prediction
(BLUP)
will then each be considered.RELATIONSHIP BETWEEN PREDICTED BV IN 2 ENVIRONMENTS
Let a
population
in environment 1 have additivegenetic
varianceQ
u
l
for sometrait. Predict BV
(Mi)
in that environment, and choose m sires toproduce
progenyin environment 2. Predict BV in environment 2
(Û
2
)
using
only
data from that environment. Let the additivegenetic
variance for the trait in environment 2 beol u
2
2and
thegenetic
correlation between BV in environment 1(u
l
)
and 2(u
2
)
be rG. Let the accuracy of BVprediction,
aland a2 for environments 1 and 2,respectively,
be the correlation between actual andpredicted
BV and be constant within each environment.Under certain
conditions,
theexpected
correlation betweenpredicted
BV in the 2 environments(Tulu2!
is a,rGa2 and rG can be estimated as rG =r— !
/a
l
a
2
-ul!!2)
Ul 2The conditions include
(Taylor, 1983): 1)
no environmental correlation betweenperformance
in the different environments;2)
norelationships
among parents ofmeasured
animals;
and3)
no other covariances amongpredicted
BV within eitherenvironment. An additional assumption
(4)
is that sires are chosen at random. For sire evaluation with theseassumptions,
a ij 2
=n
ij/
(n
ij
+A)
where nij is the number of progeny for sire i in
environment j
and A is the ratio of residual to sire variance.Assumption
(1)
isnormally
met if different animals are measuredin different environments.
Assumptions
(2)
and(4)
can be metthrough
choice of sires.Assumption
(3)
will notnormally
hold forBLUP,
but mayapproximately
hold under some conditions.
The
regression
(b)
ofu
2
onill
has the expectation:such that TO =
b!2u1
(Q!1 /a2!!z ).
Taylor’s
(1983)
assumptions
arerequired
forthis
expectation,
but random selection of sires is not.Knowledge
ofa!l
andQ
u
2
isu 2
. Effects of
using
incorrect values ofo,2
on estimates of rG will not be consideredfurther,
but may beimportant.
Expected
confidence limits for observed correlation(F)
andregression
coefficients(b)
can be used to evaluateexperimental
designs
in terms of theirability
to detectsignificant
departures
ofr and
from theirexpectations.
For correlationanalysis,
Fischer’s z =0.5[ln
(1 +r) -In
(1 - r)] (Snedecor
andCochran,
1967)
has variance of!
(m-3)-
1
where m is the number of sires in thesample.
For ) r)
6
0.65, confidence bounds on F are of similar width at fixed m, whereas forIrl
>0.65,
confidence bounds narrow withincreasing
r.Large
numbers of sires are thusrequired
ifaccuracies are low to avoid confidence limits that
overlap
0. Forregression
analysis,
varianceof
b [V(b)]
is:(Snedecor
andCochran,
1967)
whereQ!2I!1
is the variance inÛ
2
at a fixed value of!
U2 UtÛ
l
(ie,
the mean square for deviations fromregression)
andSS(ul)
is the sum ofsquares for
u
l
.
GivenT_aylor’s
(1983)
assumptions
for m siressampled
at random from environment 1,V(b)
is:Numbers of sires and progeny
required
to detectsignificant
departures
of
6 fromits
expected
value aregiven
infigure
1 for several values of rc, , , Oau&dquo;U2 and al = a2or al = 0.95. When al = a2
(eg,
when sires arebeing
provensimultaneously
in 2environments),
numbers of progenyrequired
in each environment toreject
the nullhypothesis
that rG = 1 are minimized at aj = 0.7 to 0.8 for rG between 0.5 and0.8. For al = 0.95
(eg,
whenproven sires are chosen from environment
1),
progenynumbers in environment 2 are minimized with one progeny per sire, but increase
little until a2exceeds 0.5 to 0.6. Thus
relatively
efficientdesigns
at al = 0.95 wouldinclude 35 to 45 sires with 400-500 progeny at rG = 0.6, but 250-400 sires with
1200-1300 progeny at rG = 0.8.
Critical numbers
required
for correlationanalysis
were similar to those forregression
analysis
at low accuracies and rG, but lower athigher
accuracies due toasymptotic
declines in the width of confidence intervals asexpected
r increased.The ratio of the critical number of sires for correlation
analysis
to critical number forregression
analysis
(SRAT)
waspredictable
(R
2
=0.983)
as a functionof q =
ala2 and resuch that SRAT = 1.115-0.101q-0.667
q
2
-0.161
1 rG . This ratioadjustment
can be
applied directly
to values infigure
1 toapproximate
critical sire and progeny numbers for correlationanalysis.
The above derivations assume that accuracies are calculated
correctly
in bothenvironments. Under
BLUP,
accuracies ofu
i
for non-inbred animals aregiven
by
(1 —
Ci2/Q!)’S
whereC
ii
is the ithdiagonal
element ofC
22
,
theprediction
errorcovariance matrix of u
(Henderson, 1973).
If the model iscomplete
andproperly
parameterized,
accuracies areexpected
toequal
correlations between actual andpredicted
BV. In mostapplications,
u is derivedby
iterative solution of Henderson’selements of
C
22
areapproximated
butoff-diagonal
elements ofC
22
areusually
not estimated. To
date,
nocompletely satisfactory procedures
to obtaindiagonal
elements ofC
22
exist. Alternative methods have beenpresented by
Van Raden and Freeman(1985),
Greenhalgh
et al(1986),
Robinson and Jones(1987),
Meyer
(1989)
and Van Raden andWiggans
(1991).
Evaluation ofprocedures
to estimate accuracyis
beyond
the scope of thisstudy,
but theassumption
that accuracies are estimatedcorrectly
is critical to the discussion.Effects of
departures
from the ideal conditions described above will now be discussed.Effects of non-random selection from environment 1
Let sires be
non-randomly
selected based onÛ
l
and accuracies be constant within each environment. Let unselectedpopulation
variances, covariances, correlations andregressions
besymbolized by
Q
2,
0&dquo;, r andb,
respectively,
and letV, Cov,
Corr andRegr respectively
represent observed values for somesample
from thepopulation.
For truncation selection onu
l
,
where w = 1 -
V(Û¡)/
0
&dquo;1
(Robertson, 1966).
For directional truncationselection,
mw =
i(i — x)
and fordivergent
truncation selection w = -ix where i is the standardized selection differential(Becker, 1984)
and x is the truncationpoint
on a standard normal curve(Snedecor
andCochran,
1967)
corresponding
to random selection of sires from the upper or lowerfraction,
p, of theu
l
distribution for directional selection or from the upper and lower fractionp/2
fordivergent
selection.Also:
(Hill,
1970;
Johnson andKotz,
1970;Robertson,
1977).
The observed correlation is thus biasedby
selection but the observedregression
is not, and the deviation ofRegr
(u
2
u
1
)
from itsexpected
valueprovides
a test of thehypothesis
that rG = 1.0.If selection is non-random but not
clearly
directional ordivergent
or not based ontruncation, additional
complications
arise. To account for suchselection,
letV(Ei )
be calculated for the selectedsample
and define wempirically
as the observed value1-
V(iil)lo,!! .
Use of thisempirical
value of w topredict
rGusing equation
[3]
wasU¡ i
evaluated
by
computer simulation. Predicted BV for the ith sire in environment 1where
61i
is a random normal deviate(SAS, 1985),
0
&dquo;;B
= 315 and al = 0.7.Predicted BV in environment 2 were then simulated for a2 = 0.7 and
Q!! oru2
as:Three selection scenarios
(SS)
were considered:SS1. 80%
divergent,
20% random: 80% of the bulls chosen suchthat lxl
>1.282(i
=1.755)
and 20% chosen atrandom;
SS2. 50%
high,
50% random: 50% of the bulls had x > 0.842(i
=1.400);
SS3. 50%
high,
50%stabilizing: lxl
< 0.5 for 50% of the bulls and x > 1.282 for50% of the bulls.
Each scenario was
repeated
for rG = 1.0 or 0.5 andreplicated
10 times. Eachreplicate
contained 5 000 selected animals.Agreement
betweenpredicted
and simulated values ofV(ii
2
)
andCorr(ic
l
u
2
)
(table I)
was within theoretical 95% confidence limits of theexpected
value(Snedecor
andCochran,
1967).
Thusequation
[3]
predicted
Corr(û
l
û
2
)
satisfac-torily
in bulls selectednon-randomly
onu
l
with fixed accuracy al.With selected sires, the
expected
V(b)
is:using
values fromequations
[1]
and[3].
The SD of isinversely proportional
tovi --w and
varies from 48 to 243% of its value when w = 0 as w varies from- 3.39
(divergent
selection from the top and bottom 5% of thepopulation)
to 0.83departures
ofRegr(u
2
u1)
from itsexpectation using
selected sires can be derivedfrom
figure
1by
dividing
sire and total animal numbersby
1 - w.Effects of variation among animals in accuracy of
predicted
BVCalo et at
(1973)
and Blanchard et at(1983)
derived theexpected
correlation betweenpredicted
BV for 2 traits when BV for each trait were estimated inseparate
single-trait analyses
and individuals differed in accuracy of BVprediction
as
C
’
OT’r(ui
M2
)
= rGal2 for:(see
Appendix)
and recommendedusing
thisexpression
to estimate rG from ob-servedC
O
rr(iil
i
i
2
).
Similarly:
Taylor
(1983)
criticizedequation
[5]
asunstable, however,
asserting
that itmay
yield
estimates of rG that are outside the parameter space, andpresented
assumptions required
to allow estimation of rG with thisequation.
Taylor
(1983)
concluded
that,
if all assumptions are met,equation
[5]
isappropriate
to estimater
G so
long
as thea!
are derived from MME as(1 -
C,,/
0
,2).
The
equations
of Calo et al(1973)
and Blanchard et al(1983)
do not consider selection onÛl.
Withselection, equation
[5]
appearsappropriate
to estimateCorr(u
l
u
2
)
for the selectedsample,
but not within the unselectedpopulation.
Equations
[3]
and[5]
could,
however,
perhaps
be combined togive
theexpected
correlation in a selectedsample
of animals with variable accuracy as:To evaluate
equation
[7],
several accuracy scenarios(AS)
were consideredby
simulating
samples
from theu
i
distribution.AS1: 20 000 animals from the upper 10% of the
u
l
distribution. al varieduniformly
over the interval 0.7 to 0.95.
AS2: 20 000 animals from the upper and lower 10% of the
ui
distribution. al varieduniformly
over the interval 0.7 to 0.95.AS3: 10 000 animals from the upper 10% of the
Û
l
distribution with aluniformly
distributed over the interval 0.7 to 0.95 and 10 000 animals selected from the lower 10% of the distribution with aluniformly
distributed over the interval0.5 to 0.7.
the bottom 80% of the distribution with al
uniformly
distributed over theinterval 0.5 to 0.7.
ASS: 15 000 animals from the upper 10% and 5 000 animals from the lower 80%
of the
Û
l
distribution with aluniformly
distributed over the interval 0.7 to0.95.
AS6: 5 000 animals from the upper 10% and 15 000 animals from the bottom 80% of the
Û
l
distribution with aluniformly
distributed over the interval 0.5 to 0.995.AS7: 10 000 animals from the upper 10% of the
Û
l
distribution with aluniformly
distributed over the interval 0.795 to 0.995 and 10 000 animals from the lower80% of the distribution with al
uniformly
distributed over the interval 0 to0.50. a
2 was
uniformly
distributed over the interval 0.5 to 0.7 for all scenarios. Each scenario wasrepeated
for rc = 0.5 or 1.0 andreplicated
(table II).
Empirical
calculation of w
requires
use ofa2 ,
which varies with accuracy. Simulated values! ul
of
Mi
were thus standardizedby dividing by
aliaul andempirical
w calculated as1 —
V(u
l
)
using
standardizedu
l
.
For all accuracy scenarios, observed
Regr(u
2
u1)
agreed closely
withpredicted
values fromequation
[6]
(table II).
Observed values ofCorr(u
l
u
2
)
wereusually
also close to
expectations
fromequation
[7],
but with somesystematic
departures
Thus
equation
[7]
produced
a smallnegative
bias under directional selection withvariable accuracy. This result was confirmed
by producing
10 morereplicates
atr
G =
1;
the results were identical.For
divergent
selection,
differences between observed andpredicted
correlationswere
again
small,
but sometimessignificant
and nownegative
forAS2,
3,
5 and6,
ranging
from -0.001 to -0.008(!0.002).
However,
with bothnon-symmetrical
selection fromhigh
and low groups and different accuracy distributions betweengroups
(AS4
andAS7),
observed correlations wereconsiderably larger
thanpre-dicted,
especially
for rG = 1(table II).
Theappendix
shows exactexpectations
forcorrelations and
regressions
involving u
i
andÛ
2
under non-random selection from environment 1 and variable accuracies within environments. Correlation betweenmeans and accuracies of
divergently
selected groups violate some of theassump-tions used to derive equation
[7]
andpresumably
account for thedepartures
frompredicted
values in AS4 and AS7.Equation
[7]
thusproduced slightly
biasedpredictors
ofCorr(Ei
E2
)
but stillappears
useful,
especially
when exact selection rules are unknown.However,
biasesin
predicted
values ofCorr(û
l
Û2
)
inequation
[7]
will bemultiplied by
the inverse of the coefficient of rG inequation
[7]
to estimate rG. Potential bias in re thus islarger
with lower aj or more directional selection. IfV(u
i
)
islarger
thanexpected
from random selection and greater
precision
than thatprovided
by
equation
[7]
isdesired, Ap
P
endix
equations
can be used.The
expected
correlation betweenÛ
l
andÛ
2
thusdepends
on the distribution of accuracies within each environment, the selectionapplied
onÛ
l
(quantified
by
w).
and rG. To evaluate net effects of thesevariables,
values ofCorr(û
l
û
2
)
fromequation
[7]
were calculated for rG = 1 and when a12 varied from 0.10 to 0.90 and wvaried from -3.3 to 0.9
(table III).
For re = 1.0 and a12 = 0.90,Corr(u
l
u
2
)
variedfrom 0.55 to 0.97 due to
selection,
although
rG exceeded 0.79 solong
as directionalselection was not intense
(w
60.6):
For w = 0 and rG = 1.0,Corr(u
lu2
)
equalled
a l2
, but still varied from 0.10 to
0.90,
depending
on observed values of al and a2.Effects of
relationships
Previous results assume
independence
ofpredicted
BV within each environment.However,
relationships
among animals lead to covariances amongpredicted
BV within and across environments. Ifui
andU
2
arepredicted
by
BLUP,
covariancesamong
predicted
BV within environments also arise from estimation of fixedeffects. These covariances affect
expectations
of bothCorr(u
l
u
2
)
andRegr(u
2
u1).
Theirimpact
is difficult togeneralize, depending
upon the extent and nature ofrelationships
in the data and the distribution of records among fixed effect classes. Covariances amongpredicted
BV associated withrelationships
and estimation of fixed effects arise simultaneous in BLUP solutions toMME,
but effects ofrelationships
alone can be seen under selectionindex,
or best linearprediction
(BLP),
assumptions
of known mean and variance for both ul and u2. In that case:where
Û
j
is a vector ofbreeding
valuepredictions
forenvironment j
and yj ismatrix between yj and
u’..
For non-inbredanimals,
H
j
=Z
j
G
=Z
j
Ao, 2
whereZ
j
is the incidence matrixrelating
yj to Uj, and G and A are additive covarianceand numerator
relationship
matrices,respectively,
for animals in Uj. The covariance matrix ofGj (Qj )
and the covariance matrix betweenÛ
l
andu2
(Q
i2
)
are thus:Expectations
ofsample
variances ofU
j
(s3- )
and covariance betweenM
i
and_ Uj
u2(Sulu2)
are functions of elements ofQ
j
andQ
12
:
where tr is the trace of the matrix and sum is them sum of all elements. If animals
recorded in the 2 environments resemble one another
only
because ofrelationships
This
assumption
is warranted if animals are evaluatedusing
unrelatedpopula-tions of mates in the 2 environments but may not be correct if mates are
poten-tially
related across environments. If selected animals are likewiseunrelated,
andrelationships
among recorded animals within each environment ariseonly through
relationships
to selectedanimals,
Q
j
will bediagonal
with elementsa? . 2
for the ith animal in thejth
environment, A is anidentity
matrix of size m andQ
lz
isdi-agonal
with elementsaiiaiirGaul!u2.
Sample
correlation andregression
coefficients then haveexpectations
equal
to thosepreviously
discussed.In most
applications,
Q
j
are notdiagonal
andoff-diagonal
elements are not calculated due to the nature and size ofV! 1.
Thusexplicit
consideration ofoff-diagonal
elements ofQ
j
may not bepossible
unless the data set is small orhighly
structured.
However,
if the accuracy of allÛ
ij
approaches
1,Q
j
approaches
Ao, &dquo;j 2
(ie, Qu --!
j
)
or2 u
andQ
12
approaches
utO
u2
&dquo;
&dquo;
rcO
A
such thatCorr(û
l
û
2
)
= TG and°/J ’
Regr(û
2
û¡)
=r
C
O&dquo;U
2/
O
&dquo;
U
t’
A more realistic situation is one in which sires from environment 1 have a,
approaching
1 but are evaluated in environment 2 with a2 < 1. In this case,Q
l
=AQ!I,Qz
=H2VZ
l
Hz,
andQ
12
=&dquo;
’
)
U2
(rc
Ut/O
&dquo;
O
2
Q
Thesequantities,
andassociated
expected sample
variances and covariances, could be obtained if the sizeor structure of the data allows calculation of all elements of
Q
2
and would allow derivation of an exactpredicted
value forC’orr(uiU2)-A small
example
will demonstrate theimpact
ofrelationships
onCorr(ûlû2)
and
Regr(û
2
ûl),
which areequal
in theseexamples.
Leth
z
= 0.25, Qu, =O &dquo;U 2 = 1
and rG = 1. Let 3 sires
produce
8 progeny each in each of 2 environments. If sires are unrelated and progeny are relatedonly
through
the sires,sample
variances and covariances
involving Û
l
andÛ
2
equal expected population
values,
andCorr(û
l
Û
2
)
= . zalaIf all 3 sires are fullsibs,
Q
j
andQ
lz
are nolonger diagonal
and al = a2 = 0.643. Theexpected
Corr(ûlû2)
is 0.211 versus ala2 = 0.414. If sires are halfsibs,
Corr(û
l
Û
2
)
= 0.286 versus ala2 = 0.365,reducing
biasby
onehalf as
relationships
among
sires decline.Still,
with many closerelationships
amongsires, ala2 may
considerably
overestimate theexpected
correlation.If
only
2 of the 3 sires are fullsibs,
bias is reduced. al = a2 = 0.621 for related sires and 0.590 for the unrelated sire, and a12 = 0.374(equation
[5]).
Now a12overestimates the observed correlation
by only
3.9%(0.374
versus0.360).
Thus ifsampled
animals represent a reasonable number of unrelatedfamilies,
Q
j
andQ
12
are
correspondingly
sparse and little bias inCorr(u
i
U
2
)
isexpected.
Turning
to effects ofrelationships
withinenvironments,
let the 3 sires be unrelated but cross-classified within each environment withonly
2 dams. In thatcase, al =az =0.566 and
Corr(ii
IU2
)
= 0.364 which is 13%larger
than ala2= 0.321.However,
under the more realisticassumption
of cross-classification with 2 maternalgrandsires,
Corr(ûl Û2)
= 0.352 versus alaz =0.338,
yielding
little bias. When sireswere cross-classified with 8
dams,
Corr(û
l
û
2
)
= 0.364 versus ala2 =0.349, again
yielding
little bias. Theseexamples
suggest selection ofwidely
proven,lowly
relatedanimals from environment 1 followed
by
evaluation in environment 2using
a broadEffects of covariances
arising
from estimation of fixed effectsIf fixed effects are estimated
simultaneously
with BLUP ofiij,
Mallinckrodt(1990)
noted that
off-diagonal
elements ofQ
j
andQ
12
are not zero, even in the absenceof
relationships. By
BLUP:f
or
# =
(X!V! 1X!)-1X!V! ly!
andP_,
=X!(X!V! 1X!)-1X!
andwherep
is the vector of fixed effects with incidence matrix X.The covariance matrix of
Û
j
is:composed
of a term due to BLPrelationships
minus a term due to estimation of fixed effects. Ifrelationships
across environments ariseonly through
siressampled
from environment 1 such that
Cov(y
l
y2)
isgiven
by
equation
(10],
Q
12
is stillgiven
by
equation
[11],
butregardless
of therelationship
structure of thedata,
Q
j
nowapproaches
adiagonal
matrixonly
as ajapproach
1.Also,
for agiven
number ofprogeny, aijwill be less for BLUP than for BLP and
depends
on the number of siresand their distribution among fixed effect classes. The above solutions are identical
to those obtained from MME
(Henderson,
1963,
1984)
such that:The
impact
of fixed effect estimation onCorr(û
l
û
2
)
can be seen mostreadily
using
MME for a sire model withoutrelationships
among animals in u and whereP
includesonly
contemporary group effects. Note that in allremaining
examples,
Corr(£i
E2
)
=Regr(u
2
u
1
)
when al = a2. For such amodel,
afterabsorption
offixed effects into u
equations
andfactoring
of residual variance from both sides of theequation,
the coefficient matrix for u has:(Do, 1991)
where A is the ratio of residual to sire variance and ni.,n.k andn
ik are numbers of records for sire
i,
contemporary group k(of g)
and sire xcontemporary
group subclassik,
respectively.
For balanceddata,
nik = n foroff-diagonals
reduce to(-gn/m)
[vs
0 forBLP].
Thecorresponding
inverse of the coefficient matrix hasdiagonal
elements of(gn
+mA)/[m(gn
+A)]
andoff-diagonals
ofgn/[mA(gn
+A)] (Searle,
1966).
Q
will havecorresponding diagonal
andoff-diagonal
elements ofgn(m —
1)/!m(gn
+A)]
=a
2
and
-gn/[m(gn
+A)],
respectively.
s
2
is thusgn/(gn
+A)
= a!(m -
l)/
7
7t].
Ifdesign
matrices are thesame for both environments and rG = 1,
Q
12
will havediagonal
elements of9
2
n
2
(
M
_ 1)/[,rn(gn+A)
2
1
=
a
4
m/
(m-1)
andoff-diagonals
of -g
2
n
2
/[m(gn+À?
]
=
-a9m/(m
-
1)
togive
S12 =g2
n
2/(g
n
+A)
2
=a
4
m
2
/(m -
1)
2
andCorr(ûlû2) =
gn/(gn
+A)
=a
2
m/(m -
1).
With balanceddesigns,
Corr(u
l
u
2
)
has the sameexpectation
under both BLUP andBLP,
but accuracies are lower under BLUP such that la2a from BLUP underestimatesexpected
2
).
l
û
Corr(û
The extent of bias isproportional
tom/(m—1)
and decreases from 20% at m = 5 to 11% at m = 10 and2.6% at m = 40. This
expectation
is maintained ifdesign
matrices differ betweenenvironments
provided designs
are balanced within each environment.The situation is more
complicated
for unbalanceddesigns,
butgeneral
conclu-sions are similar in that the number of sires
compared
ascontemporaries
needs tobe
large enough
to minimizeconfounding
between sire BVpredictions
and fixed effects estimates.Otherwise,
BLUP accuracies are reduced and theirproduct
un-derestimatesexpected
Corr(u
l
u
2
).
Forexample,
consider a block of 8 sires withprogeny distributed over 4 contemporary groups
(eg,
4 yr of anexperimental
eval-uation in some
environment)
as shown in table IV. The size of the experiment may be variedby
increasing
the number of sire blocks(to
16,
24, etc,sires),
by
varying
the number of progeny per sire and contemporary group
(n),
by replicating
the sire block over additional contemporary groups(8,
12, 16,etc),
orby
a combination ofthese
approaches.
The samedesign
is assumed for each environment.Define bias
(fig
2)
as the difference between theexpected
Corr(u
l
u
2
)
calculatedfrom
equations
[8]
and[9]
and theproduct
ala2 which is constant for all sires in thisdesign.
With n = 6 andonly
8 sires, bias wasrelatively large
withCorr(£1 £2 )
= 0.41 vs ala2= 0.35. Bias decreased as number of sire blocks increased and was < 0.03 with 24 sires(12/contemporary
group).
Doubling
n orreplicating
pattern of bias.
Expected
values ofCorr(ûlû2)
were alsocompared
to theproduct
of BLP accuracies calculatedignoring
contemporary group effects. Theproduct
of BLP accuracies overestimatedCorr(u
l
u
2
)
as shownby negative
bias infigure
2,
but theproduct
of BLP accuracies wassuperior
to theproduct
of BLUP accuraciesas an estimator of
Corr(u
l
u
2
).
Design
like that in table IV can be used underexperimental conditions,
but are less feasible when sires arecompared
on cooperator farms. Itparticular,
useof
large
numbers ofexperimental
sires on individual farms may not be feasible.Instead,
sires from environment 1 may be testedtogether
on several farms in aloosely
connecteddesign
but withonly
a few siresrepresented
on any one farm.If
only
data from introduced sires are used in theevaluation,
considerable bias inCorr(u
l
u
2
)
may result.However,
this bias can be reduced if introduced sires areevaluated with sires
represented only
in environment 2 and data from all sires areincluded in the evaluation. To demonstrate this
effect,
two additional sires wereadded to each contemporary group in table IV to
give
16sires/sire
block withresulting
values ofCorr(ul
u2
)
for introduced sires was reduced(fig 2).
Thus ifsires from one environment are introduced into another and if evaluation occurs
primarily
in cooperatorherds,
sires should be evaluated in contemporary groupscontaining
reasonably large
numbers of sires(either
introduced ornative)
to increaseprecision
of estimates of contemporary groupeffects,
and data from all sires should be included in the evaluation.CONCLUSIONS
Interpretation
of correlations betweenpredicted
BV in different environments isnot
straightforward. Expectations
of such correlations are influencedby
accuracyof evaluation of animals in both environments,
by
selection of animals chosen forevaluation,
by relationships
among chosen animals andby
thedesign
of the evalua-tion in both environments. If animals are chosen from environment 1 for evaluationin environment 2,
Regr(£2£1 )
may be a more useful statistic thanCorr(u
l
u
2
)
be-cause it is unbiasedby
selection oniij.
However,
bothRegr(û
2
û¡)
andCorr(£
1£
2 )
are biased
by
covariances amongpredicted
BV within environments.Also,
use ofproven sires
(a
-!1.0)
from environment 1simplifies
interpretations
and reducesthe number of sires
required
to attain aspecific
level ofsignificance
for measures ofassociation.
Equations
in this paper allow calculation orapproximation
ofexpected
values ofCorr(£
1
£
2
)
under various sorts of selection and with variable accuracies in each environment(equation
[7]).
Evidence for selection can be obtainedempirically
if necessary
by
comparing
observedV(Û¡)
to its expectation from Blanchard etal
(1983) (see Appendix).
Expected
values of bothCorr(£
1
£
2
)
andRegr(û
2
û¡)
may involve
off-diagonal
elements ofQ
j
matrices which are often not available for BLUP BVpredictions.
Effects ofoff-diagonals
may be minimizedby
ensuring
that anumber of families are
represented
in both theexperimental
animals and their mates andby
using
several(eg 8-16)
sires per contemporary group. Use of small numbers of sires per contemporary group can lead to considerable underestimation of theexpected
value ofCorr(£1 £2 )
ifoff-diagonal
elements are not considered. Prediction ofexpected
values ofCorr(£
1
£2 )
and2
iii)
g
r(ii
Re
from observed accuracies may besuperior
when selection index(BLP)
rather than BLUP accuracies are used.A number of
potential
difficulties inderiving
andinterpreting
Corr(û
l
Û
2
)
have not beenexplicitly
considered. Accuracies are assumed to beproperly
calculated,
even
though
approximations
arenormally
used and areprobably
notcompletely
satisfactory.
Additivegenetic
variances must be known for both environments inor-der to calculate
u
l
andU
2
correctly
and tointerpret
Regr(u
2
u
1
).
If sires introduced into environment 2 for evaluation are a selectedsample
from environment 1 and BLUP evaluations areused, grouping strategies
and(or)
adjustment
of covariances may berequired
to derive unbiasedii
2
-
Values ofo,2 U2
for animals selected from environment 1 willdepend
both on selectionapplied
and on rG. Thus resultsgiven
for correlationsinvolving
BLUPpredictions
ofÛ
j
areprobably striclty
correctonly
for random selection of sires. See Diaz(1992)
for additional discussion of effects of selection andgrouping
onCorr(£
1
£
2
)
with BLUPpredictions. Despite
theseenvironments will often be
relatively
easy to obtainand,
ifproperly interpreted,
can
provide
information on rG.APPENDIX
This
appendix
addressesgeneral
expectations
for variances and covariances ofill
andM
2
with non-random selection onÛ
l
and variable accuracies in bothenvironments.
Sample
m animals from environment 1. Ifsampling
is non-randomand selection rules are not
specified
or if accuraciesdiffer,
theresulting
distributionsof
u
j
andM
z
are mixtures of munique
distributions. Thesample
sum of squares(SS)
ofu
i
is:The
expected
value(E)
and variance ofÛ
2i
with accuracy a2i are:The
resulting expected
SS(u
2
)
and sum ofcross-products
(SCP)
are:Expected
correlation andregression
coefficients can be derived from these SSand SCP. If animals are chosen at
random,
l
u
andU
2
have mean 0 andexpected
Corr(£
1£2
)
isgiven
by
textequation
[5]
(Blanchard
etal,
1983).
If selection is non-random but accuracies are constant within environments, formulae for
SS(u
2
)
andSCP(u
l
u
2
)
become consistent with text formulae 2 and 3. For directional selection with constant accuracies, w can bereplaced by
i(i — x)
without loss of
generality.
Fordivergent
selection,
let fraction wH of the selectedstandardized mean and truncation
point of i
H
and xH,respectively.
Letfraction W
L come from the lower fractionø
L
.
Let accuracies withinhigh
and low groups, and within environment2,
be constant at alH, alL and a2,respectively.
V(û¡)
within selected groups are[1 -
i
H
(i
H
-
x
H
)]aî
H
O&dquo;î
and[1 -
i!(iL -
x
L
)]aî
L
O&dquo;I.
Theexpected
value and variance ofu
l
andu
2
,
and theirexpected
covariance are:The
resulting
Corr(û
l
û
2
)
reduces to textequation
[4]
only
if alH = alL and selection issymmetrical
(ie,
wH = WLand i
H
=-i
L
).
ACKNOWLEDGMENT
This research was conducted while C Diaz was recipient of an INIA
(Spain)
graduate fellowship.REFERENCES
Becker WA
(1984)
Manualof Quantitative
Genetics. AcademicEnterprises,
Pull-man,
WA,
4th edn’
Blanchard
JP,
EverettRW,
Searle SR(1983)
Estimation ofgenetic
trends and correlations forJersey
cattle. JDairy
Sci 66, 1947-1954Calo
LL,
McDowellRE,
VanVIeckLD,
Miller PD(1973)
Genetic aspects of beefproduction
among Holstein-Friesianspedigree
selected for milkproduction.
J AnimSci
37,
676-682DeNise
RSK, Ray
DE(1987)
Postweaning weights
andgains
of cattle raised underrange and
gain
test environments. J Anim Sci 64, 969-976Diaz C
(1992)
The effect of selection on the covariance between direct andmaternal
breeding
valuepredictions
in different environments. Ph DDiss,
Virginia
Polytechnic
Institute and StateUniv,
Blacksburg, Virginia
Dickerson GE
(1962)
Implications
ofgenetic-environmental
interaction in animalbreeding.
Anim Prod4,
47-64Dijkstra
J,
OldenbroekJK,
KorverS,
VanderWerf JHJ(1990)
Breeding
for veal and beefproduction
in Dutch Red and White cattle. Livest Prod Sci 25, 183-198 Do C(1991)
Improvement
in accuracyusing
recordslacking
sire information inthe animal model. Ph D
Diss,
Virginia Polytechnic
Institute and StateUniv,
Blacksburg, Virginia
Greenhalgh
SA,
Quaas
RL,
VanVIeck LD(1986)
Approximating prediction
errorvariances for
multiple
trait sire evaluations. JDairy
Sci 69, 2877-2883Henderson CR
(1963)
Selection Index AndExpected
Genetic Advance. Natl Acad Sci Natl Res Council Publ 982,Washington,
DCHenderson CR
(1973)
Sire evaluation andgenetic
trands. In: Proc Anim Breed GenetSymp
in Honorof
DrJay
L Lush. Am Soc AnimSci, Champaign, IL,
10-41 Henderson CR(1984)
Applications
of linear models in animalbreeding. University
ofGuelph, Guelph,
OntarioHill WG
(1970)
Design
ofexperiments
to estimateheritability by
regression
ofoffspring
on selected parents. Biometrics 26, 566-571Hill
WG,
Thompson
R(1977)
Design
ofexperiments
to estimateoffspring
parentregression using
selected parents. Anim Prod24,
163-168Johnson
NL,
Kotz S(1970)
Distributions in Statistics: Continuous Univariate Distributions-1. JohnWiley
andSons,
New YorkMahrt
GS,
NotterDR,
BealWE,
McClureWH,
Bettison LG(1990)
Growth of crossbred progeny of Polled Hereford siresdivergently
selected foryearling weight
and maternal
ability.
J Anim Sci68,
1889-1898Mallinckrodt CH
(1990)
Relationship
of milkexpected
progeny differences to actual milkproduction
and calfweaning
weight.
M Scthesis,
Colorado StateUniv,
FortCollins,
COMeyer
K(1989)
Approximate
accuracy ofgenetic
evaluation under an animalmodel,
Livest Prod Sci 21, 87-100
Meyer
K(1991)
Estimating
variances and covariances for multivariate animal modelsby
restricted maximum likelihood. Genet SelEvol 23,
67-83Oldenbroek
JK,
Meijering
A(1986)
Breeding
for vealproduction
in Black and Whitedairy
cattle. Livest Prod Sci 15, 325-336Robertson A
(1959)
Thesampling
variance of thegenetic
correlation coefficient. Biometrics 15, 469-485Robertson A
(1966)
A mathematical model of theculling
process indairy
cattle.Anim Prod 8, 95-108
Robertson A
(1977)
The effect of selection on the estimation ofgenetic
parameters.Z
Tierxuchtg Zuchtgsbiol
94, 131-135Robinson
GK,
Jones LP(1987)
Approximations
forprediction
error variances.J
Dairy
Sci70,
1623-1632SAS
(1985)
SAS User’s Guide: Statistics. SAS InstituteInc,
Cary,
NCSearle SR
(1966)
MatrixAlgebra for
theBiological
Sciences. JohnWiley
andSons,
NYSnedecor
GW,
Cochran WG(1967)
Statistical Methods. The Iowa State UnivPress,
Ames,
IA,
6th ednTaylor
J(1983)
Assumptions required
toapproximate
unbiased estimates ofgenetic
(co)variance
by
the method of Calo et al(1973).
In: Genetics Research 1982-1983.Rep
Eastern Artificial InseminationCoop, Dept
Anim Sci CornellUniv, Ithaca,
NY,
256-261Tilsch
K,
WollertJ,
Nurnberg
G(1989b)
Studies on sire xsex/environment
interactions and their effect of response to selection in beef siresprogeny-tested
forfattening
performance
and carcassyield.
Livest Prod Sci21,
287-302VanRaden
PM,
Freeman AE(1985)
Rapid
method to obtain bounds on accuraciesand
prediction
error variances in mixed models. JDairy
Sci68,
2123-2133VanRaden
PM,
Wiggans
GR(1991)
Derivation,
calculation,
and use of animalmodel information. J