HAL Id: hal-00893968
https://hal.archives-ouvertes.fr/hal-00893968
Submitted on 1 Jan 1992
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Univariate and multivariate parameter estimates for
milk production traits using an animal model. II.
Efficiency of selection when using simplified covariance
structures
Pm Visscher, Wg Hill, R Thompson
To cite this version:
Original
article
Univariate and multivariate
parameter
estimates
for milk
production
traits
using
an
animal model.
II.
Efficiency
of
selection
when
using
simplified
covariance
structures
PM Visscher
WG
Hill
R
Thompson
1
University
of Edinburgh,
Instituteof Cell,
Animal andPopulation Biology,
West Mains
Road, Edinburgh
EH9 3JT;2
AFRC
Instituteof
AnimalPhysiology
and GeneticsResearch,
. Roslin EH259PS, Midlothian,
UK(Received
5August
1991; accepted 3 June1992)
Summary - The effect of
simplifying
covariance structures formilk,
fat and proteinyield
in lactations 1-3 on accuracy of selection for lifetimeyield
wasinvestigated using
selection index
theory. Previously
estimated(co)variances
were assumed to be the truepopulation
parameters. A modifiedrepeatability
model, ieassuming
agenetic
correlationof
unity
betweenperformances
across lactations butallowing
for different heritabilitiesand variances in different
lactations,
was used toinvestigate
theefficiency
ofselecting
sireon their progeny
performance.
For most combinations of heritabilities in first and laterlactations, selecting
siresusing
therepeatability
model wasnearly
100% efficient in termsof accuracy of selection
compared
to ageneral
multivariate model. Thepredicted
response to selectionusing
a modifiedrepeatability
model wasapproximately
10% toohigh.
It wasfound that
applying
a transformation to make new traits in lactation 1 uncorrelated atthe
phenotypic
andgenetic
level to milk, fat and proteinyield
in laterlactations,
and assuming that 3 new uncorrelated variates were formed, washighly
efficient in terms of accuracy of selection whencompared
to the accuracy of ageneral
multivariate model. This transformation was recommended for a national BLUPevaluation,
since it may takeaccount of selection to a
larger
extent than whenperforming
separateanalyses
formilk,
fat and protein
yield.
’dairy cattle / milk production
/
selection index/
genetic parameters / repeatability/
canonical transformation*
Résumé - Utilisation du modèle animal pour l’estimation des paramètres univariates et multivariates concernant les caractères de production laitière. II. Efficacité de la sélection après utilisation d’une structure simplifiée de covariance.
L’efj&dquo;et
d’unesimplification
de la structure des covariances pour laproduction
delait,
de matière grasseet de matière
protéique
a étéanalysé
en utilisant la théorie des index de sélection. Les estimées antérieures de(co)variances
ont étésupposées
être les vraisparamètres
de lapopulation.
Un modèle&dquo;répétabilité&dquo; modifié,
c’est-à-dire supposant une corrélationgénétique
de 1 entre lesperformances
desdifférentes
lactations mais permettant des héritabilités et des variancesdifférentes
àchaque lactation,
a été utilisé pouranalyser
l’e,f,jîcacité
de la sélection des taureaux àpartir
de laperformance
de leur descendance. Pour laplupart
des combinaisons d’héritabilité enpremière
lactation ou lactations ultérieures, la sélection des taureaux en utilisant le modèlerépétabilité
apermis
à peuprès
la mêmeprécision
de la sélection que le modèlegénéral
multivariate. Laréponse
prédite
à lasélection selon le modèle
répétabilité modifié
a été surévaluée d’environ 10%. On aappliqué
aux lactations ultérieures unetransformation
rendantindépendants
enpremière lactation,
tant au niveauphénotypique
quegénétique,
les caractères deproduction
laitière, de matièregrasse, de matière
protéique
et on asupposé
que les 3 nouvelles variables étaienttoujours
indépendantes.
Une telleprocédure
a été hautemente,f!cace
en termes deprécision
de la sélectioncomparée
à laprécision
d’un modèlegénéral
multivariate. Cettetransformation
est recommandée pour une évaluation nationale selon le
BLUP, puisqu’elle
peut mieux tenir compte de la sélection que desanalyses séparées
desproductions
delait,
matière grasse et matièreprotéique.
bovin laitier
/
production laitière/
index de sélection/
paramètre génétique répétabilité/
transformation canoniqueINTRODUCTION
In a
previous
paper(Visscher
andThompson,
1992),
genetic
and environmentalparameters
werepresented
for milkyield
(M),
fatyield
(F)
andprotein
yield
(P)
in lactations 1-3. If the
breeding
goal
fordairy
cattlebreeding
is some(linear)
combination of these
production
traits in alllactations,
anoptimal
way to combine all available information topredict breeding
values is a multivariate(MV)
BLUPanalysis.
For a national animal model(AM)
breeding
valueprediction,
however,
a
general
MV BLUPanalysis
iscomputationally
notyet
feasible. Inpractice,
therefore,
simplified
assumptions
are made whenpredicting breeding
values forlarge populations
using
an AM. Indairy
cattle AMprediction,
milk,
fat andprotein
yield
areusually
evaluatedseparately
using
arepeatability
model with somescaling
for observations in later lactations to account for
heterogeneity
of variance acrosslactations
(Wiggans
etal,
1988a,
b; Ducrocq
etal, 1990;
Jones andGoddard,
1990).
Selection indextheory
is used toinvestigate
the loss in accuracy of selection whensimplified
covariance structures are used topredict
breeding
values. A second aim is toinvestigate
how to reduce thedimensionality
of the above MVprediction
MATERIAL
As
reported
previously
(Visscher
andThompson,
1992),
the 9 x9 genetic
covariancematrix of
M,
F and P in lactations 1-3 was found to be notpositive
definite. To create a(semi)
positive
definite matrix thesingle
negative eigenvalue
was set to10-’,
and covariance matrices were recalculated. These matrices were thenused for
subsequent
(index)
calculations. Without loss ofgenerality, phenotypic
variances forM, F,
and P in lactation one were set to 1.0. Theparameters
aresummarised in table I. An alternative way to summarise the covariance matrices is
to calculate
eigenvalues
andeigenvectors
of the matrixP-
1
G.
Eigenvectors
thenrepresent
linear combinations of theoriginal
traits which areindependent
of eachother at the
genetic
andphenotypic
level(Hayes
andHill, 1980;
Meyer,
1985).
Thecorresponding
eigenvalues
of thenewly
createdtraits,
called canonical variates(eg
Anderson, 1958;
ch12),
have been termed canonical heritabilities(Hayes
andHill,
1980;
Meyer,
1985).
Table II shows theeigenvalues
andeigenvectors
of matrixP -
1
G,
where P and G are the 9 x9 phenotypic
andgenetic
covariance matrices formilk,
fat andprotein yield
in lactations 1-3P
),
3
P
3
F
3
M
2
2
2
F
M
1
P
1
F
1
(M
calculated fromparameters
in table I. A numberfollowing M,
F or P indicates the lactationnumber. The smallest
eigenvalue
from theoriginal
P-’G
was-0.03,
and thecorresponding eigenvector
was:Therefore,
one canonicaltrait,
{-0.04M
1
+.’..
i
0.15F
+0.09P
31
,
was found to have aheritability
of -0.03. Thenegative
eigenvalue
resultedmainly
from therow
of tableII).
This wasexpected, given
the veryhigh genetic
correlations foryield
traits in lactations 2 and 3(see
tableI).
ANALYSIS
Selection index
theory
Let:
u = q x 1 vector of
breeding
values of qtraits;
a = q x 1 vector of
(marginal)
economic values for qtraits;
H = u’a =aggregate
breeding
value;
x =
p x 1 vector of sources of information on an individual
(for
example phenotypic
observations,
daughter
averages,predicted
breeding
values);
b =p x 1 vector of index
weights;
I = b’x = index value used topredict
H ;
P =
v(x);
G =cov(x’u)
and thesymbol A
added to a scalar or matrix indicates an estimate thereof.For index calculations standard results were used
(see,
forexample,
Sales andHill,
1976).
Equations
for the responses areusing
b =P-’Ga
for theoptimal index,
andb =
P-
1
Ga
for an indexusing
estimates of P and G :Where
R,
R and R* are theoptimal,
predicted
and achieved response to selection inthe
aggregate
breeding
value(=H)
respectively, expressed
as a ratio of the selectionIf new traits are created which are linear combinations of the
observations,
y =
W’x,
ie variables in vectory are a linear combination of the variables in
vector x, then
It was assumed that the
marginal
economic value for any of theproduction
traitsin later lactations was the
product
of the relativeexpression
of that trait and thephenotypic
standarddeviation,
thusreflecting
survival to later lactations and theeconomic
importance
of alarger
standard deviation(and mean)
in later lactations.a z =
e ZQi
, where ai, si and aj are relative economic
value,
relativeexpression
and standard deviation for lactation i. Relative
expression
was assumed to followa
geometric
series,
Ei =(0.8)?!! ,
assuming
a relative survival of80%
from onelactation to the next and
setting
theexpression
in lactation one to 1.0.Phenotypic
standard deviations were assumed to be
1.0,
1.20 and 1.25 for lactations1-3,
and 1.25 for allsubsequent
lactations(from
tableI).
If it is further assumed that the covariance of any observation with thebreeding
value in lactation 3 isequal
to the covariance of that observation withbreeding
values in laterlactations,
ie thecorresponding
rows of matrix G areidentical,
then the economic valuesfor lactations 1-3 are
[1.0
1.04.0],
since the sum of economic values for thirdand
subsequent
lactations is4.0(E(l.25)(0.8)’
=4.0,
for i =2,3...).
Similarly
for the case where
only
2 lactations are considered andassuming
second andlater
lactations,
breeding
values haveequal
covariances with observedphenotypes,
a’ =[1.0 5.0!.
Economic values for traits within a lactation were varied to reflect differentbreeding goals.
For all
subsequent
calculations it is assumed that observations in later lactationsare included in the
genetic
evaluationonly
if observations onprevious
lactationsare available.
Single
traitmultiple
lactations considerationsMeyer
(1983)
investigated
thepotential
gain
in response to selection fromincluding
multiple
lactation information on progeny of sires for sire evaluation. The accuracyof selection was increased
directly
through
more(genetic)
information about thetrait(s)
ofinterest,
since thegenetic
correlation betweenperformances
acrosslactations
(r
9
)
is less than one, andindirectly through
a better data structure(better &dquo;connectedness&dquo;).
Assuming
a’ =
!1 14!
for eithermilk,
fat orprotein
yield
in lactations
1-3,
andusing
the relevantparameters
for any of these traits fromtable
I,
it was found(using
standard selectiontheory)
that for sires the increase in accuracythrough
including
second(and third)
lactationdaughter
information inthe selection index was
approximately
6-10%.
The number of progeny per sire forfirst and second lactations was varied from 25 to 50 and 5 to 35
respectively.
SeeMeyer
(1983)
for moreexamples.
Perhaps
a moreinteresting question
regarding
the use ofmultiple
lactationinformation on a
single
trait is how much accuracy is lost when a modifiedrepeatability
model(eg
assuming
r9 =1)
is assumed forbreeding
valueprediction
instead of the &dquo;true&dquo; MV covariance structure. This was
investigated
for 3 selection1
1
=phenotypic
index,
ie sources of information arephenotypic
observations onindividuals;
1
2
= sireindex;
sources of information aredaughter
averages of sires in different
lactations;
1
3
= cowindex;
sources of information are thepredicted breeding
value(index)
of the cow’s sire and dam and the cow’s own records.The
largest
reduction in response to selection isexpected
when selection isacross age
classes,
eg across cohorts with different amounts ofinformation,
since animproper
weighting
of later lactations then would have thelargest impact.
In thefollowing
examples, only
2 lactations and 2 cohorts wereconsidered,
but theresults are
thought
to be similar for more lactations(given
the veryhigh genetic
correlation
between second and later lactationyields)
and more age groups. Thegenetic
means for the cohorts were assumed to be zero, hence the consequences ofthe error in
predicting genetic
trend wereignored.
(A
thorough study
of long-term
losses in response
through
incorrect estimation ofgenetic trend,
thuscreating
ansuboptimal ranking
of young vs oldanimals,
was outside the scope of thisstudy.)
For each index there were different amounts of information on the 2
cohorts,
I
1
:
Cohort 1:
phenotypic
observation in lactation 1Cohort 2:
phenotypic
observations in lactations 1 and 2I
2
:
Cohort 1: first lactation
daughter
average based on nldaughters
Cohort
2:n
1
first lactationdaughter
records and n2 second lactation records(n
2
<n
l
)
I
3
:
1Cohort 1: sire index based on n first lactation progeny, dam index based on sire index of dam and dam’s first lactation
record,
cow’s first lactation recordCohort 2: sire index based on nl + n2 progeny
records,
dam index based on sireindex of dam and dam’s records in first and second
lactation,
cow’s first and second lactation records.Parameters used for the
example
with 2 lactations and 2 cohorts were(from
table
I):
a’ =!15!;
7-g =0.85;
7-p =
0.55;
phenotypic
variances were 1.0 and 1.45 andheritabilities were 0.40 and 0.30 for first and second lactations
respectively;
nl =50;
n2 = 35. The &dquo;estimated&dquo;(assumed)
parameters
were:r
9
= 1.0(repeatability
model);
the &dquo;true&dquo;phenotypic
covariance matrix(from
tableI)
was used andheritabilities for lactation
1(!2 )
and for lactation 2(h2)
were varied. Aproportion
of
10%
of the total number of animals available was selected. The definition ofthis
repeatability
model differs from thatusually
used because heritabilities andphenotypic
variances are notnecessarily equal
in different lactations.Responses
to selection were calculatedusing equations
(1), (2)
and(3).
Givenany set of
parameters
theoptimal proportion
of animals to be s from each cohortwas determined
using
analgorithm
fromDucrocq
andQuaas
(1988),
assuming
theparameters
used were the truepopulation
parameters.
Results arepresented
intable III.
These results may be
expected,
since the &dquo;true&dquo;genetic
correlation(=
0.85)
betweenperformance
in lactations 1 and 2 washigh
and an observation for lactationperformance
isalways
conditional on the presence of a first lactation observation.The ratio of achieved to
predicted
response was less robust tochanges
inparameters.
Even when the correct heritabilities
(0.40
and0.30)
were used the achieved response(accuracy)
wasapproximately
10%
below the maximumexpected
response. Thismay be seen as a very
simple
illustration that one should be cautious whenusing
predicted
values(whether
from selection indices orBLUP)
to estimategenetic
trend when the parameters used in theprediction
aresubject
tolarge
sampling
errors orwhen
they
are apriori
incorrect(as
in the case of arepeatability
model when it isknown that r9 G
1).
Since results were similar for the 3 indicesused, subsequent
calculations were
only performed
for the case of mass selection.Multiple
traitmultiple
lactation considerationsSuppose
thebreeding goal
is a linear combination of 9 traits,
1
(M
F
1
,
P
I
,
M
2
,
F
2
,
P
2
,
M
3
,
F
3
,
),
3
P
which isthought
to be agood
indicator of lifetime economicproduction
since third and later lactationperformances
are assumed to behighly
correlated.Then,
choosing
a set of economic values andusing
parameters
from tableI,
the relative accuracy of selection for different indices which use differentamounts of information can be
investigated.
For 3 different sets of economic values these relative accuracies werecalculated,
and results arepresented
in table IV. Theto first lactation economic
weightings
used inpractical
selection indices inEurope
(eg
Wilmink,
1988).
H
3
reflects a more&dquo;progressive&dquo;
breeding
goal
with selectiononly
onprotein production.
Results from table IV show thatapproximately
20%
accuracy is lost whenonly
one observation is used topredict
theaggregate
breeding
value. Results forH
2
andH
3
were similar sincebreeding
values for thesecomposite
traits werehighly
correlated.
Ifonly
accuracy isconsidered, using
milk and fatyield
in a selection index does not contributesubstantially
to increase responseto selection for lifetime
protein
yield
(H
3
).
For all 3breeding goals,
theexpected
gain
in accuracy for sire selection whenadding
progeny means onmultiple
traits isexpected
to be smaller than for mass selection.Although
these calculations are anoversimplification
ofbreeding
valueprediction
and selection in
practice,
they
are useful whencomparing
the accuracies from tableIV with accuracies when
simplified assumptions
are maderegarding
the covarianceProportionality
considerationsOne
possible
way to reduce thedimensionality
of a MVprediction problem
isto
investigate
whether some traits may beapproximately
expressed
as linearcombinations of other
traits,
or if some linear combination of the traitsexplains
most of the variation in the
aggregate
breeding
value. To reducecomputations
(further)
it would be of interest to find a minimum number ofindependent
traits
which would
provide
all the necessary information(see
forexample
Lin andSmith,
1990).
Inparticular,
it would be convenient if one linear transformation could be found that reduced theprediction problem
of 9highly
correlated traits(milk,
fat andprotein yield
in lactations1-3)
to that of 3independent
newtraits,
since in thatcase 3 separate
analyses
would contain all the information and would(therefore)
account for selection. Oneapproach
is toinvestigate
if submatrices of the 9 x 9covariance matrix are
proportional
to eachother,
since that would indicate thata suitable transformation may exist to reduce the
dimensionality
of theprediction
problem.
With observation on p traits in Ilactations,
thissuggests
testing
if the covariance matrix oflp
traits,
V,
may be written as V = K Q9V
h
,
whereV
h
is asubmatrix
(or
a transformationthereof)
describing
a(co)variance
block of p traitswithin or between
lactations,
K is a I x Isymmetric
matrix ofproportionality
orscaling
constants, and (9
is the directproduct
operator
(Searle, 1966).
For the caseof
M,
F and P in lactations1-3,
thehypothesis
is that thecomplete
covariancematrix may be
expressed
as aproportionality
(or scaling)
matrixmultiplied
by
atransformation matrix. In the
appendix
ageneral
framework ispresented
to test forproportionality
of covariance matricesusing
likelihood ratio tests and to estimateconstants of
proportionality, using
multivariate normaltheory applied
to observed andexpectations
of moment matrices.Unfortunately,
the data from table I were found unsuitable for a likelihood ratiotest.
Obviously
the additivegenetic
covariance matrix(A)
and the environmental covariance matrix(E)
from table I are notindependent
momentmatrices;
A and E arehighly
correlated and the determinant of A is zero. An alternative is to transform A and E into a between and within sire covariance matrix(B
andW),
assuming
these matrices are from a balanced half-sibdesign
based on s sires andn progeny per sire.
However,
there was insufficient information about thesampling
variances of the estimated E and A matrices to determine the
appropriate degrees
of freedom.Furthermore,
the exact distribution of the likelihood ratio test statisticbased on
empirically
deriveddegrees
of freedom andusing
animal model estimatesmay differe
substantially
from aChi-square
distribution.
Therefore,
significance
testing
forproportionality
was notpursued.
Using
parameter
estimates from tableI, however,
some inference withrespect
toproportionality
may be drawn. One(obvious)
choice for the transformation matrixV
h
is a canonical transformation onM
1
,
,
F
1
andP
l
.
This transformation wascalculated
(see
Hayes
andHill,
1980 ;
andlMeycr,
1985,
forcomputations)
and thethe 3 x
3 genetic
(Vg
ll
)
andphenotypic
(V
Pll
)
covariance matrices ofmilk,
fat andprotein yield
in lactation 1. Then the transformation matrix ofmilk,
fat andprotein
yield
in lactation one,Q
l
,
was chosen such that:with elements of
diagonal
matrix Dequal
toeigenvalues
of the matrix(V-
l
vgil).
Using Q
1
,
the vector ofobservations,
was transformed
using:
The
eigenvectors
for the 3 canonical variates in lactation 1 were(2.96 -
0.72-2.09], (-0.85 -1.85 2.61!
and[-0.08
0.420.69]
respectively,
which form the rows ofmatrix
Q
1
.
The 9 x 9 correlation matrices and the heritabilities of the 9 new traits(y
c
)
are shown in table V.Off-diagonals
in all 3 x 3 blocks weresmall, indicating
that one transformation matrix created 3nearly independent
variates with for each newvariate
highly
correlated &dquo;observations &dquo; in later lactations.Using
the covariance matrix ofmilk,
fat andprotein
yield
in lactation 1 asV
h
,
proportionality
matricesfor additive
genetic
and environmental effects were calculated fromequation
(A7].
This assumed the observed covariance matrices E and A were moment
matrices,
butdegrees
of freedom did not need to bespecified.
For A andE,
the estimates ofproportionality
matrixK,
K
a
andK
e
respectively,
were:For
example,
the estimate of the averagescaling
factor forgenetic
variances forM,
F and P on the transformed scale(transformation
such that new variates areuncorrelated)
in lactation 3 was1.74,
and the estimate of the averagescaling
factor for environmental covariances between the transformed variates in lactations 1 and 3 was 0.58. Ifproportionality
isassumed,
the 9 x 9 MVprediction problem
maybe reduced to 3
independent
3 x 3 multivariatepredictions
or to 3independent
evaluations with arepeatability
model.Using
thebreeding goals
definedpreviously
the
efficiency
of the reduction indimensionality
was calculated for massselection,
conditional on the
parameters
in table Ibeing
the truepopulation
parameters.
Thus theparameters
from table V were used for index calculations with alloff-diagonals
of all 3 x 3 covariance blocks set to zero. In the case of a
repeatability
model on thecanonical
variates,
genetic
correlations between canonical variates across lactationswere set to
unity.
Results of selection index calculations forphenotypic
selection areis
slightly
lower than thecorresponding
accuracyusing
theoriginal
first 3 variates(M
1
,
F
1
andP
1
)
from table III because thegenetic
covariance structure between the canonical traits in lactation one and transformed variates in later lactationswas
simplified
(off-diagonals
of 3 x 3 blocks in matrix G were set tozero).
Clearly
little accuracy is lostassuming proportionality
of the covariance structure formilk,
fat andprotein
yield
across lactations.Simplification
to arepeatability
model on3 canonical variates was
approximately
97%
as efficientcompared
to a multivariateanalysis
on 9 traits. Whenusing
the canonical variates there was noadvantage
ofa MV
analysis
over ananalysis
with arepeatability
model.Analysing
linear combinations of the observationsA final reduction in
dimensionality
is achievedby analysing
a reduced set of traits which are linear combinations of the available observations. Onesuggestion
is to create new traits for each lactation which are the sum of thephenotypic
observationsweighted by
thecorresponding
economic values in theaggregate
breeding
value.Using
the notation fromequation
(4!, !
y =a’x,
where variables in x are, forexample,
observations for
M
1
,
F,
andP
1
.
Relative accuracies were calculatedusing equation
(4!,
fitting
first lactationyield
traits,
first and second lactationyield
traits,
and allyield
traits in vector x,respectively.
Results arepresented
in table VII.Another
suggestion
is to use linear combinations of theyield
traits within alactation as new traits and to
perform
ananalysis
on those new traits. Forexample,
if yl =
w
’
x
l
,
forx
’
=(M1F1P1!,
and!2 =
W’X
2
,
forx’
=[M
2
F
2
P
Z
),
then in theselection index framework this would be
fitting
y = W’x as used forequation
(4).
Using
Xl and x2 asabove,
andx3
=[M
3
F
3
P3!,
3 new traits were createdusing
the economic values for each trait in the
aggregate
breeding
value as elements formatrix W. Accuracies for
fitting
combinations of these new traits arepresented
intable VII.
Finally,
3separate
selection indices could be calculated fromusing
(M
l
M
2
M
3
),
(F
way selection indices for
dairy
cattle areusually
calculated:breeding
values arecalculated for
milk,
fat andprotein
yield
separately,
and the 3estimated
breeding
values are
subsequently
combined in a selection index( eg
Dommerholt andWilmink,
1986; Wilmink,
1988).
Separate
selection indices for milkyield,
fatyield
andprotein
yield
were calculatedusing
either the &dquo;true&dquo;(multivariate)
covariance structure or the modifiedrepeatability
model(r
.
across lactations set tounity),
and the accuracy ofusing
the 3 indices topredict
theaggregate
breeding
value wascalculated. For the construction of each of the 3 indices it was assumed that the
traits in the index were identical to the traits in the
aggregate
breeding
values. Forexample,
the indexusing M
I
,
M
2
andM
3
was calculated asPn; G&dquo;,a!&dquo;
whereP
m
and
GI&dquo;
are the 3 x3 phenotypic
andgenetic
covariance matrices for milkyield
inlactations
1-3,
and am is a vector of economic values for these traits. The relativeaccuracies for this
approach
are shown in table VII.Comparing
results from tablesIV,
VI and VII shows that littleefficiency
waslost when
analysing
linear combinations of the observationsusing
economic valuesas
weights.
For the case ofjust using
observations for,
i
M
F
1
andP
I
this is notsurprising,
since these traits were sohighly
correlated and had similarheritabilities,
hence their index values resembled the economic values. Forbreeding goals
H
2
andH
3
approximately
8%
accuracy was lost whenusing
all observationsweighted by
their economic values as asingle
trait,
andapproximately
4%
accuracy was lostwhen
analysing
3 newtraits,
each traitbeing
a linear combination of observationsand economic values within a lactation
(table VII).
The last 2 rows of table VII indicate what accuracy was lost
by ignoring
separately.
In effectonly
average correlations between the traits within lactationsare taken into account when
combining
separate
indices for milkyield,
fatyield
andprotein
yield.
The loss in accuracy was small and very similar to losses whenusing
a
repeatability
model on the canonical variates(see
tableVI).
DISCUSSION
Only
oneaspect
ofefficiency
ofselection,
namely
accuracy ofpredicting
someaggregate
breeding
valueassuming
fixed effects wereknown,
was considered in thisstudy. Meyer
(1983)
found that for BLUPprediction
ofbreeding
values increasein accuracy from
including
later lactation observations waslargely through
animproved
data structure. Results fromincluding
information from relatives in thecalculations,
andincluding
comparisons
between young and old animals should haveUse of a
repeatability
model for milkproduction
traits across lactations seemedto have little effect on accuracy of
selection, although
thepredicted
gain/accuracy
may be
approximately
10%
toohigh.
More research is needed toinvestigate
long-term losses in response to selection when incorrect models are used to
predict
breeding
values.More information on the
sampling
variance of theparameter
estimates are neededfor
testing
theproportionality
hypothesis. Ideally,
one MV REMLanalysis
on the9 traits should be carried out, with an
algorithm
that wouldproduce
(second)
derivatives.Still,
calculations would then involve a 90 x 90(45
genetic
and 45environmental)
sampling
variance matrix which wouldprobably
besubject
tolarge
sampling
errors itself.As
pointed
outby
Meyer
(1985),
the canonical variates fromcreating
indepen-dent variates in lactation 1 may have abiological explanation.
Theeigenvectors
show that canonical variate 1
corresponds approximately
topercentage protein
(and
fat content to a lesserextent)
and canonical variate 2 to the difference be-tween fat andprotein
content. Canonical variate 3 seemsjust
to be the sum of fatand
protein
yield.
Heritabilities for canonical variates were consistent withheri-tabilities found for
fat
andprotein
contentpreviously reported.
Canonical variatesfrom
diagonalising
thecomplete
9 x 9P-
1
G
matrix have similarbiological
expla-nations to the canonical variates from lactation
1,
but nowincluding
comparisons
between lactations
(see
tableII).
Given that
parameter
estimates from table I aresubject
tosampling
error,ma-trices
describing
covariances betweenM, F,
and P within and between lactationswere
remarkably
proportional
to each other. Calculations for mass selectioncon-firmed that little information is lost if
proportionality
is assumed. Arepeatability
model on canonical variates from lactation one should account for selection bias andonly
losesapproximately
3%
in accuracycompared
to ageneral
multivariateprediction
ofbreeding
values ofmilk,
fat andprotein
yield
in lactations 1-3. The loss in accuracy is very similar to the loss when information between traits within each lactationseparately
isignored,
as is commonpractice
indairy
cattle selection indices. Forpractical
(national)
animal model BLUPevaluations,
theproposed
transformation is easy toimplement.
Reducing
thedimensionality
of theprediction problem by analysing
linear combinations of observations and economic values ofcorresponding breeding
valueswas found to be very efficient.
However,
no information from relatives was included in thecalculations,
and for the traits considered heritabilities andphenotypic
andgenetic
correlations were similar betweenpairs
of traits. Whenusing
traitsTesting
forproportionality
of covariance matricesNotation:
M = Moment
matrix;
asymmetric positive
definite(PD)
matrix of orderlp,
with mean squares and meancross-products
based ondf degrees
of freedomI = number of
lactations,
p = number of traits per lactation
V =
E(M) ;
unknown PD covariance matrix oflp
traitsK =
symmetric
matrix ofproportionality
constants of order 1Q
9 = direct
product
operator
(see
eg
Searle,
1966),
tr = traceoperator
L = natural
logarithm
of likelihood.Using
standard multivariatetheory
(eg
Anderson, 1958;
ch10),
with
A
j
=eigenvalue
ofV,
y
Z =
eigenvalue
ofMV-’.
The maximum likelihood
(ML)
is obtained for V =M,
Suppose
a moment matrixM
o
isobserved,
and the nullhypothesis
is,
Ho :
V
o
=E(M
o
)
= kV with Vspecified
and k a constant.Then,
and the ML estimate of
k,
k
=(’
2
:,Îi)/lp.
HenceFor the trivial case of
M
o
= kM where M is the ML estimate ofV,
alleigenvalues
ofM
o
V-
1
=M-’
O
M
are constant andequal
to theproportionality
constant
(= k).
The likelihood ratio(LR)
teststatistic,
t =2(ML — ML
o
),
asymptotically
has aX
2
distribution
withdegrees
of freedom[1/21p(lp
+1) -
1].
With observations on p traits in I
lactations,
onesuggestion
is to testV
o
=K I8i V
h
,
where
V
h
is a(transformation
ofa)
submatrixdescribing
a(co)variance
block of ptraits within or between lactations.
V
o
may be written as,the trace in
[A5]
may be written as:for P a
permutation
matrix andM!!
a I xI diagonal
block of M*. Thus[A5]
becomes:Using
(A6!,
the ML estimate of K is:ACKNOWLEDGMENT
PM Visscher
acknowledges
financial support from the MilkMarketing
Boards in the UK.REFERENCES
Anderson TW
(1958)
An Introduction to Multivariate StatisticalAnalysis.
JohnWiley
andSons, NY,
374 ppDommerholt
J,
Wilmink JBM(1986)
Optimal
selection response undervarying
milkprices
andmargins
for milkprices
andmargins
for milkproduction.
Livest Prod Sci14,
109-121 1 ..Ducrocq V,
Quaas
RL(1988)
Prediction ofgenetic
response
to truncation selectionacross
generations.
JDairy
Sci71,
2543-2553Ducrocq V,
BoichardD,
BonaitiB,
BarbatA,
Briend M(1990)
Apseudo-absorption
strategy
forsolving
animal modelequations
forlarge
data files. JDairy
Sci73,
1945-1955Hayes
JF,
Hill WG(1980)
Areparameterisation
of agenetic
index tolocate
itssampling properties.
Biometrics36,
237-248’ ’
’ .
Jones
LP,
Goddard ME(1990)
Five yearsexperience
with the animal model fordairy
cattle evaluations in Australia. In: Proc4th
WorldCongr
GenetAppl
LivestProd,
Edinburgh,
ScotlandJuly !J-27, 13,
382-385Lin
CY,
Smith SP(1990)
Transformation of multitrait to unitrait mixed modelanalysis
of data withmultiple
random effects. JDairy
Sci73,
2494-2502Meyer
K(1983)
Scope
forevaluating
dairy
siresusing
first and second lactation records. Livest Prod Sci10,
531-553Meyer
K(1985)
Maximum likelihood estimation of variance components for aSearle SR
(1966)
MatrixAlgebra
for
theBiological
Sciences. JohnWiley
andSons,
NYVisscher
PM, Thompson
R(1992)
Univariate and multivariateparameter
estimates for milkproduction
traitsusing
an animal model. I:Description
and results ofREML
analyses.
Genet SelEvol,
24,
415-430 ’ ’Wiggans GR,
MisztalI,
Van Vleck LD(1988a)
Implementation
ofan
animal model forgenetic
evaluation ofdairy
cattle in the United States. JDairy
Sci71
(suppl 2)
54-69