HAL Id: hal-00894154
https://hal.archives-ouvertes.fr/hal-00894154
Submitted on 1 Jan 1997
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
A reduced animal model approach to predicting total
additive genetic merit for marker-assisted selection
S Saito, H Iwaisaki
To cite this version:
Original
article
A reduced animal model
approach
to
predicting
total
additive
genetic
merit
for marker-assisted selection
S Saito
H Iwaisaki
1
Graduate School
of
Science andTechnology;
2
Department
of
AnimalScience, Faculty of Agriculture,
Niigata University, Niigata 950-21, Japan
(Received
22February 1996; accepted
15 October1996)
Summary -
Using
a system of recurrenceequations,
best linear unbiasedprediction
applied
to a reduced animal model(RAM)
ispresented
for marker-assisted selection. Thisapproach
is a RAM version of the method with the animal model to reduce the number ofequations
per animal to one. The current RAMapproach
allows simultaneous evaluation of fixed effects and total additivegenetic
merit which isexpressed
as the sum of theadditive
genetic
effects due toquantitative
trait loci(QTL)
unlinked to the marker locus(ML)
and the additive effects due to theQTL
linked to the ML. The total additivegenetic
merits for animals with no progeny arepredicted by
the formulae derived forbacksolving.
A numerical
example
isgiven
to illustrate the current RAMapproach.
marker-assisted selection
/
reduced animalmodel / best
linear unbiased prediction/
total additive genetic merit/
combined numerator relationship matrixRésumé - Utilisation d’un modèle animal réduit pour prédire la valeur génétique
globale dans la sélection assistée par marqueur. Sur la base d’un
système d’équations
derécurrence,
la méthode du meilleurprédicteur
linéaire sans biaisappliquée
à un modèle animal réduit(MAR)
estprésentée
pour la sélection assistée par marqueur. Cette méthodeest une version MAR de celle du modèle animal pour réduire à un le nombre
d’équations
par animal. Cette méthode MAR permet d’estimer simultanément les
effets fixés
et lavaleur
génétique globale, qui
est la somme deseffets génétiques additifs
des locus decaractère
quantitatif
(QTL)
non liés au locus marqueur et deseffets additifs
desQTL
liés au locus marqueur. La valeurgénétique globale
des animaux sans descendance estprédite
par unsystème d’équations
reconstitué àpartir
dusystème
principal.
Unexemple
n2imëriqué
est donné pour illustrer la méthode MARprésentée
ici.sélection assistée par marqueur
/
modèle animal réduit/
meilleur prédicteur linéairesans biais
/
valeur génétique additive totale/
matrice de parenté combinée*
INTRODUCTION
Marker-assisted selection
(MAS)
isexpected
to contribute togenetic
progressby
increasing
accuracy ofselection,
by
reducing generation
interval andby
increasing
selection differential
(eg,
Soller, 1978;
Soller andBeckmann, 1983;
Smith andSimpson,
1986;
Kashi etal, 1990;
Meuwissen and vanArendonk,
1992),
especially
forlowly
heritable traits(Ruane
andColleau,
1995).
Fernando and Grossman
(1989)
presented methodology
for theapplication
ofbest linear unbiased
prediction
(BLUP;
Henderson, 1973,
1975,
1984)
to MAS in animalbreeding.
Using
an animal model(AM)
with additive effects for alleles at a markedquantitative
trait locus(MQTL)
linked to a marker locus(ML)
and additive effects for alleles at theremaining quantitative
trait loci(QTL)
which arenot linked to the
ML,
they
showed theapproach
to simultaneous evaluation of fixedeffects,
effects ofMQTL
alleles,
and effects of alleles at theremaining
QTL.
The number ofequations required
in the AMapproach
isf
+q(2m
+1)
wheref,
q andm are the number of fixed
effects,
the number of animals in thepedigree
file and the number ofMQTLs,
respectively.
Therefore,
theapplication
of the AMapproach
may be limited to smaller data sets.Accordingly,
Cantet and Smith(1991)
deriveda reduced animal model
(RAM)
version of Fernando and Grossman’sapproach, by
which the total number ofequations
to be solved could beconsiderably
reduced. The total additivegenetic merit, ie,
the sum of the value forpolygenic
effects andgametic
effects can bepredicted directly
by
an AMprocedure
(van
Arendonk etal,
1994).
The number ofequations
required
in theprocedure
isf +q
since
the number ofequations
per animal is reduced to oneby combining
information on theMQTL
and theremaining
QTL
into one numeratorrelationship
matrix.BLUP methods for MAS
require computation
of the inverse of the conditionalcovariance matrix of additive effects for the
MQTL
alleles. Fernando and Grossman(1989)
derived analgorithm
tocompute
theinverse,
whichrequires
notonly
information on markergenotypes
but also information on theparental
origin
of marker alleles.Wang
et al(1995)
extended Fernando and Grossman’s work tosituations where
paternal
or maternalorigin
of marker alleles can not be determinedand where some marker
genotypes
are uninformative.In this paper, a RAM
approach
to theprediction
of total additivegenetic
merit ispresented.
The number ofequations
in thesystem
for this RAMapproach
becomes of the orderf
+ ql where ql is the number ofparental
animals.Also,
a small numericalexample
isgiven
to illustrate the currentapproach.
THEORY
In the
derivations,
oneMQTL
and one observation per animal are assumed forsimplicity.
The conditional covariance matrix between additive effects of theMQTL
alleles, given
the markerinformation,
is based on the recursiveequation
which wasAMs for MAS
An AM discussed
by
Fernando and Grossman(1989)
is written aswhere y is the n x 1 vector of
observations, 8
is thef
x 1 unknown vector of fixedeffects,
u isthe
q x 1 random vector with the additivegenetic
effects due toQTL
not linked to theML,
v is the2q
x 1 random vector with the additive effects of theMQTL
alleles,
e is the n x 1 random vector of residualeffects,
andX,
Z and P are n xf, n
x q and q x2q
known incidencematrices, respectively.
Theexpectation
and
dispersion
matrices for the random effects are assumed to bewhere
A,!
is the numeratorrelationship
matrix for theQTL
not linked to theML,
A
v
is thegametic relationship
matrix for theMQTL,
I is anidentity
matrix,
anda
u
2,a2
and
or
2
are the variancecomponents
for the additivegenetic
effects due toQTL
unlinked to theML,
for the additive effects of theMQTL
alleles and for the residualeffects,
respectively.
The mixed modelequations
forequation
[1]
arewhere Œu =
a£ la£
and Œv=af
l
l
afl.
On the other
hand,
the total additivegenetic
merit isexpressed
as the sum of the additivegenetic
effects due toQTL
not linked to the ML and the additive effects of theMQTL alleles,
or a = u + Pv.Then,
as discussedby
van Arendonk et al(1994),
equation
[1]
can be written asWith the model
!3!,
the variance-covariance structure for the total additivegenetic
merit a is
given
by
where
A
a
is the combined numeratorrelationship
matrix,
anda 2(= o!
+2a!)
is the variancecomponent
of the total additivegenetic
merit.Assumptions
on theexpectation
anddispersion
parameters
for the random effects in the model[3]
areAs described
by
van Arendonk et al(1994),
the mixed modelequations
areThe
proposed
RAMapproach
for MASThe vectors y, u and v in
equation
[1] can
bepartitioned
as y =[yp’
Yo! ! ,
u =
[up’
uo’ !’
and v =( v
P
’
’ ]’,
V
0
respectively,
where thesubscripts
p and orefer to animals with progeny and without progeny,
respectively.
Then Cantet and Smith(1991)
discussed the RAM version of the model of Fernando and Grossman(1989).
In the AM
given
asequation
(3!,
the vector a ispartitioned
as a= [ap’
a
o
’] ,
where
ap
and aorepresent
the total additivegenetic
merit for theparents
and for thenon-parents,
respectively.
With the similar idea used in y,X,
Z and e, theRAM of
equation
[3]
can be written asFor the
RAM,
it isnecessary
that ao isexpressed
as a linear function ofap.
Then we utilize asystem
of recurrenceequations,
as followswhere K is a matrix
relating
ao toap
and is definedby
and (p
is the vector of the residual effects. The vectorsap
and ao areexpressed
asap
=up
+Ppvp
and ao = Uo +o
,
o
v
P
respectively.
Moreover,
Uo and vo canbe
represented by
linear functions of up and vp,respectively
(Cantet
andSmith,
1991).
The additivegenetic
effects due toQTL
not linked to the ML of an animal can be described as the sum of the average of those of itsparents
and a Mendeliansampling effect,
orwhere the matrix T has zero elements
except
for 0.5 in the columnpertaining
to a knownparent,
and m is a vector of the Mendeliansampling
effects.The
relationship
between Vo and vp is written aswhere B is a matrix
relating
the additiveMQTL
effects of animals to those ofparents,
and E is a vector of thesegregation
residuals. If the situations where theparental
origin
of marker alleles is not determined areconsidered,
as discussedby
and d stand for the sire and the dam of animal
i,
respectively,
in scalar notationequation
[10]
isrewritten,
as followswhere v!
(1
= 1 or 2 and x =i,
s ord)
are thecorresponding
elements of vo andvp.
The coefficients
b!k
(k
=1,2,3
3 or4)
are the conditionalprobabilities
thatQ!,
is a copy of
QP
(m
= 1 or 2 and p = s ord),
given
the markerinformation,
whereQ!
stands for theMQTL
allele linked to the alleleM!
at theQTL
(Wang
etal,
1995).
Also,
ei
and
e?
are the segregation residual effects.Consequently,
inequation
[8]
the vectorcorresponding
to animal i of K can becomputed
aswhere
A
a
p, A
u
p
andA
v
p
areappropriate
submatrices ofA
a
,
A
u
andA
v
,
respec-tively,
t
i
is the vectorcorresponding
to animal i ofT,
qiis the matrixcorresponding
to animal i ofB,
1 is the vector(
11 )!
and 0 stands for the directproduct
op-erator.Using
equation
[7]
inequation
[6]
gives
and further
equation
!11!
can bearranged
asFor this model
(12!,
theassumptions
forexpectations
anddispersion
parameters
ofap
and 0 aregiven by
where the matrix R is
expressed
asIf we denote 4P +
Io0!
by
R
o
,
then the inverse matrix ofR
o
can beobtained,
as followswith s. =
Ro, i-lro, 21,
wherer
oj
is
the subvectorcorresponding
to animal i ofR
o
,
which contains elements for animals 1 to i - 1.Thus,
the mixed modelequations
forequation
[12]
are written asBacksolving
for animals with no progenyThe total additive
genetic
effects of animals with no progeny can bepredicted
from thefollowing
equations
The inverse of 0 can be obtained
according
toequation
!14!.
EXAMPLE
We use a small
example
data setincluding
sixanimals,
four animalshaving
progeny and two animals with no progeny, asgiven
in table I.and the matrix W in
equation
[12]
isThe inverse matrix of R is
given
asand
the vector ofright-hand
side isConsequently,
the vector of solutions forequation
[15]
isgiven
asand
also,
the vector of back-solutions inequation
[16]
isWhile the orders of the mixed model
equations
in the AMs of Fernando andGrossman
(1989)
and van Arendonk et al(1994)
and in the RAM of Cantet and Smith(1991)
are20,
8 and14, respectively,
that in the current RAMapproach
is6,
because animals 5 and 6 arenon-parents.
The solutions obtainedby
the currentapproach
are the same as thecorresponding
ones calculatedaccording
to AMs ofFernando and Grossman
(1989)
and van Arendonk et al(1994).
DISCUSSION
For marker-assisted selection
using BLUP,
the AMapproach
waspresented
firstby
Fernando and Grossman(1989),
and its RAM version was describedby
Cantet and Smith(1991).
These AM and RAMapproaches
permit
best linear unbiasedestimation of fixed effects and simultaneous BLUP of the additive
genetic
effectsdue to
QTL
unlinked to the ML and the additive effects due to theMQTL.
On the otherhand,
van Arendonk et al(1994)
discussed an AM method to reduce the number ofequations
per animal to oneby combining
information onMQTL
andQTL
unlinked to the ML into one numeratorrelationship
matrix. Their method allows theprediction
ofonly
the total additivegenetic
merit in addition to the estimation of fixed effects.Accordingly,
however,
the size of mixed modelequations
required
in their method can be smaller than those for theapproaches
by
Fernando andGrossman
(1989)
and Cantet and Smith(1991).
The current
approach
is a RAM version of the methodpresented by
vanArendonk et al
(1994),
and isgiven using
asystem
of recurrenceequations.
In thisRAM
approach,
the conditional covariance matrix for theMQTL
can becomputed
by
the method describedby Wang
et al(1995)
which does notrequire
assigning
theorigin
of the marker alleles and accounts for inbredparents.
With the currentapproach,
there is a reductionexpected
in the size of mixed modelequations
sincefor the random effects
only
theequations
forparental
animals arerequired
andthe number of
equations
perparental
animal isonly
one.However,
one feature of the current method is that the matrix R defined inequation
[13],
essentially
R
o
= 0 +IoQe,
is notdiagonal,
and needs to be inverted before introduction intoequation
(15!.
Thecomputing
algorithm
shown in this paper could be one of thestrategies
for thepractical
calculation. Another feature of ourapproach
is thatreduction of
computing
time. Furthercomparisons
between the current RAM andother
approaches,
for relativecomputational properties,
are needed.Hoeschele
(1993)
derived an AMapproach considering
equations
for total addi-tivegenetic
merits and additive effects due to theMQTL,
whereMQTL
equations
for animals not
typed
and certain other animals are absorbed. Themethod,
for realisticsituations,
would also lead to alarge
drop
in the number ofequations
re-quired.
A RAM consideration of Hoeschele’sapproach
has beengiven
by
Saito and Iwaisaki(1996).
The BLUP methods for
MAS, including
the current RAMapproach, require
theknowledge
of the recombination rate(r)
between the ML and theMQTL
andthe additive
genetic
varianceexplained
by
MQTL
(
Q
v).
Since true values of theseparameters
areusually
unknown inpractice,
it is necessary thatthey
are estimated.As
discussed,
eg,by
Weller and Fernando(1991),
van Arendonk et al(1993)
andGrignola
et al(1994),
with theassumption
of effects ofMQTL
allelesnormally
distributed,
theseparameters
can be estimatedby
the likelihood-based methods such as restricted maximum likelihood(Patterson
andThompson, 1971).
REFERENCES
Cantet
RJC,
Smith C(1991)
Reduced animal model for marker assisted selectionusing
best linear unbiasedprediction.
Genet SelEvol 23,
221-233Fernando
RL,
GrossmanM(1989)
Marker assisted selectionusing
best linear unbiasedprediction.
Genet Sel Evol21,
467-477Grignola FE,
HoescheleI, Meyer
K(1994)
Empirical
best linear unbiasedprediction
to mapQTL.
In:Proceedings of
the 5th WorldCongress
on GeneticsApplied
to LivestockProduction, University of Guelph,
Ontario 21, 245-248Henderson CR
(1973)
Sire evaluation andgenetic
trend. In: AnimalBreeding
and GeneticsSymposium
in Honorof
DrJay
LLush,
AmericanSociety
of Animal Science and AmericanDairy
ScienceAssociation,
Champaign,
IL, USA,
10-41Henderson CR
(1975)
Best linear unbiased estimation andprediction
under a selectionmodel. Biometrics 31, 423-447
Henderson CR
(1984)
Applications of
Linear Models in AnimalBreeding. University
ofGuelph,
OntarioHoeschele I
(1993)
Elimination ofquantitative
trait lociequations
in an animal modelincorporating genetic
marker data. JDairy
Sci76,
1693-1713Kashi
Y,
HallermanE,
Soller M(1990)
Marker-assisted selection of candidate bulls forprogeny
testing
programmes. Anim Prod51,
63-74Meuwissen
THE,
van Arendonk JAM(1992)
Potentialimprovements
in rate ofgenetic
gain
from marker assisted selection indairy
cattlebreeding
schemes. JDairy
Sci75,
1651-1659
Patterson
HD, Thompson
R(1971)
Recovery
of inter-block information when block sizesare
unequal.
Biometrika58,
545-554Ruane
J,
Colleau JJ(1995)
Marker assisted selection forgenetic improvement
of animalpopulations
when asingle QTL
is marked. Genet Res Camb66,
71-83Saito
S,
Iwaisaki H(1996)
A reduced animal model with elimination ofquantitative
trait lociequations
for marker assisted selection. Genet SelEvol 28,
465-477Smith
C, Simpson
SP(1986)
The use ofgenetic polymorphisms
in livestockimprovement.
Soller M
(1978)
The use of loci associated withquantitative
traits indairy
cattleimprovement.
Anim Prod27,
133-139Soller
M,
Beckmann JS(1983)
Geneticpolymorphism
in varietal identification andgenetic
improvement.
TheorAppl
Genet 67, 25-33van Arendonk
JAM,
TierB, Kinghorn
BP(1993)
Simultaneous estimation of effects ofunlinked markers and
polygenes
on a traitshowing quantitative genetic
variation. In:Proceedings of the
l7th InternationalCongress
onGenetics, Birmingham,
192van Arendonk
JAM,
TierB, Kinghorn
BP(1994)
Use ofmultiple genetic
markers inprediction
ofbreeding
values. Genetics137,
319-329Wang T,
FernandoRL,
van der BeekS,
GrossmanM,
van Arendonk JAM(1995)
Covariance between relatives for a marked