HAL Id: hal-00893941
https://hal.archives-ouvertes.fr/hal-00893941
Submitted on 1 Jan 1992
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Estimation of heritability in the base population when
only records from later generations are available
L Gomez-Raya, Lr Schaeffer, Eb Burnside
To cite this version:
Estimation of
heritability
in
the base
population
when
only
records
from
later
generations
areavailable
LGomez-Raya
LR
Schaeffer
EBBurnside
University of Guelph, Centre for Genetic Improvement of Livestock Animal and Poultry Science , Guelph, Ontario, Canada NI G 2W1
(Received
8 November 1990; accepted 28 November1991)
Summary - The genetic variance and
heritability
of a quantitative trait decrease underdirectional selection due to the generation of linkage
(gametic phase)
disequilibrium. Aftera few cycles of directional selection in a population of infinite sire a steady-state equilibrium
is approached. At this point there is no further reduction in these parameters since the
disequilibrium generated by selection is offset by free recombination. In many situations records available to estimate genetic parameters come from populations at the steady-state
equilibrium. A simple method to obtain estimates of genetic variance and heritability in the base population using estimates of these parameters at the equilibrium is described. The method makes use of knowledge of the effect of repeated cycles of selection on genetic
variance and heritability to infer the base population parameters.
genetic variance / heritability / estimation of genetic parameters / linkage disequi-librium
Résumé - Estimation de l’héritabilité dans la population initiale en utilisant
seule-ment les données des générations subséquentes. Lorsqu’il y a sélection directionnelle sur un caractère quantitatif, la variance génétique et l’héritabilité sont réduites à la suite de la formation d’un déséquilibre de liaison
(phase
gamétique).
Après quelques cycles desélection directionnelle dans une population de taille infinie, un équilibre stable est at-teint.
À
partir de ce moment, il n’y a plus aucune réduction de ces paramètres puisquele déséquilibre créé par la sélection est compensé par la recombinaison. Dans plusieurs
situations, les données disponibles pour estimer les paramètres génétiques proviennent de
populations en équilibre stable. Une méthode simple d’estimation de la variance génétique et de l’héritabilité dans la population initiale est présentée. Cette méthode tient compte de l’effet d’une succession de cycles de sélection sur la variance génétique et l’héritabilité pour inférer la valeur de ces paramètres dans la population initiale.
variance génétique / héritabilité / estimation des paramètres génétiques / déséquilibre de liaison
INTRODUCTION
The estimation of
genetic
variances and heritabilities ofquantitative
traits inpopulations
under artificial or natural selection is a commonobjective
in animalbreeding
andevolutionary
biology
of naturalpopulations.
Standard methods toestimate
genetic
variances and heritabilities when information is available on the parents andoffspring
are the correlation among sib and theregression
ofoffspring
on parents(Falconer, 1989).
Analysis
of variance of half-sibyields
biased estimatesof
heritability
if the parents are a selectedsample
from thepopulation
(Robertson,
1977;
Ponzoni andJames,
1978).
Unbiased estimates ofheritability
by
half-sib correlation can be obtained aftercorrecting
for the bias inducedby
selection ofsires
(Gomez-Raya
etal,
1991).
Regression
ofoffspring
on parents is not alteredby
selection of animals to be parents(Pearson, 1903)
and therefore estimates ofheritability
by
regression
are unbiased(Robertson,
1977).
Inboth,
half-sib andregression
analyses,
unbiased estimates ofheritability
are obtained after onecycle
ofselection.
Regression
estimates ofheritability
are not unbiased for the accumulated reduction ingenetic
variance afterrepeated cycles
of selection(Fimland,
1979).
Thechanges
ingenetic
variance under selection were describedby
Lush(1945)
using
genetical
arguments and a numericalexample.
Bulmer(1971)
formally
establishedthe
theory
toexplain
thechanges
in thegenetic
variance under continuedcycles
selection. Under theassumption
of an infinitesimal gene effect model thegenetic
variance and
heritability
are reduced due to thebuild-up
oflinkage
(gametic
phase)
disequilibrium
in apopulation
of infinite size and with discretegenerations.
After
only
a fewcycles
of directional orstabilizing
selection alimiting
orsteady-state
equilibrium
value for these parameters isapproached.
At thispoint
the newdisequilibrium generated
by
the selection of parents is offsetby
free recombination. Most animalpopulations
areprobably
in thesteady-state
or close to it since theequilibrium
isapproached
veryquickly.
The use of standard methods to estimategenetic
variance andheritability
yields
estimates of these parameters in the limitsituation.
However,
in many cases, interest is on the parameters in thenon-selected base
population.
Sorensen andKennedy
(1984)
have shown that mixed modelmethodology
may be used to estimate thegenetic
variance andheritability
in the base
population.
They
carried out a simulationexperiment
for severalcycles
of mass selection and thenproceeded
to estimategenetic
variancesusing
a minimum variance
quadratic
unbiased estimator(MIVQUE)
under the correctmodel.
They
found close agreement between observed and simulated parameters.The
requirement
ofusing
the correct modelimplies making
use of therelationship
matrix with
complete pedigree
information back to the basepopulation.
Naturalpopulations
arecurrently
under selection andpedigree
information is not known. Inlivestock
species,
such asdairy
cattle,
pedigree
information isonly
recorded from thelater years. In
general,
mixed modelmethodology
requires
thegenetic
variance of the basegeneration
as determinedby
the available data andcorresponding pedigree
information.
Therefore,
if the available data andpedigrees
areonly
for animals atthe
point
of selectionequilibrium,
then thegenetic
variance at selectionequilibrium
is needed to evaluate animals
by
mixed model methods.However,
knowledge
ofgenetic
variance in the basepopulation
(prior
tostarting
selection)
is necessaryand/or
accuracy of evaluation differ from those in the currentbreeding
programme.Any
changes
in those parameters alter the amount ofdisequilibrium
maintained inthe
population.
After a fewcycles
of selection a newequilibrium
will beapproached
which can be
predicted
withknowledge
of new selectionintensity,
new accuracy ofevaluation,
and thegenetic
variance in the basepopulation.
The
objective
of this paper is to describe a method to estimate basepopulation
genetic
variance andheritability
from data available at thesteady-state equilibrium.
Use is made of effect of
repeated cycles
of selection ongenetic
variance andheritability.
Assuming
thepopulation
is at theequilibrium,
the basepopulation
parameters are obtained
by
reversing
Bulmer’s arguments.THEORY
Consider an additive infinitesimal gene effect model. The trait under selection
is determined
by
a verylarge
number of loci with recombination rates of1/2.
Assume that selectionintensity
isconstant
across discretegenerations
and that each individualbelonging
to the same sex is evaluated with the same accuracy.Population
size is infinite. Selection isby
truncation. Assume that there are nodepartures
fromnormality
after selection(Bulmer, 1980).
The basic
theory
toexplain
thechanges
ingenetic
variance inpopulations
undergoing
selection was firstgiven by
Bulmer(1971).
Thebreeding
value of anindividual i in a
given
generation
is:where a8 and aD are the
breeding
values of the sire and damrespectively
and eiis the mendelian
sampling
effect in individual i. ei is distributednormally
with variance((1/2)
0’ A
.
) 2
in apopulation
of infinite size whereor2A
O
is thegenetic
variance in the basepopulation.
Thegenetic
variance in the selected group ofparents is reduced
by
kr
2
(Pearson,
1903),
where r is the accuracy of selection andk =
(Ø(x)/p)((Ø(x)/p) - x)
for selection of thetop
ranking
individuals(directional
selection)
and k =2x(Ø(x)/p)
for selection of the middleranking
individuals(stabilizing selection);
x = standard normaldeviate;
§(x)
= ordinate at cutoffpoints
for p =proportion
selected. Thegenetic
variance in theoffspring
can bepartitioned
into between and withinfamily
components. Thewithin-family
variance is not affectedby
selection of parents and has value((1/2)
QAo
).
This is true on theassumptions
of a verylarge
number of loci and infinitepopulation
size,
ie nochange
in the gene
frequencies
of thesegregating
loci for the trait. Thebetween-family
variance has a value of
(1-krLl)(1/2)0’!t-l’
where
QAt
_
1
is
thegenotypic
variance in theprevious generation.
Therefore,
thegenetic
variance in agiven generation,
t,assuming
different selection intensities and accuracy of selection in the 2 sexes is:where r,,_, =
accuracy of selection of sires in
generation
t -1; rDt-1 =
accuracy
of selection of dams in
generation
t —1;
k
9
andk
D
are the values of parameter kAt the limit there are no further
changes
in thegenetic
variance since the newdisequilibrium generated
in thatgeneration
iscompensated
forby
freerecombina-tion.
Then, genetic
variance becomes:After some
algebraic manipulation
this reduces to:Assuming
constant environmental variance acrossgenerations
andsubstituting
expression
[1]
in the standard formula ofheritability,
theheritability
at theequilibrium
limit is:If the
population
is at thesteady-state equilibrium
and records are availableto estimate
genetic
variance,
then estimates of basepopulation
parameters canbe found
by solving
expressions
[1]
and[2]
forar2A
andho,
respectively,
andby
substituting
true valuesby
their estimates.Thus,
genetic
variance andheritability
in the basepopulation
can be obtainedby:
where &dquo;&dquo;’&dquo;
denotes estimate.
!
If selection criterion is the individual
phenotype
thenri&dquo;,
=F2 D,
=!2
and
expressions
[3]
and[4]
reduce to:respectively.
In theseexpressions k
=0.5k
8
+0.5k
D
.
Therequired
estimates of thegenetic
variance andheritability
atequilibrium
can be obtainedby
eitherregression
or maximum likelihood methods. It isgenerally accepted
that maximum likelihoodinformation is included in the
analysis.
If REML(restricted
maximumlikelihood)
account for
selection,
say, ingenerations
0 to 6, then it will also account for selectionin
generations
6 to 10 whenonly
data from thesegenerations
are available. In theformer case, the component of variance estimates the
genetic
variance in the basepopulation,
and in the latter thegenetic
variance ingeneration
6 which it is assumedto be the
equilibrium
genetic
variance.The
approximate
sampling
variance of the estimate ofheritability
inthe
basepopulation
can be obtainedby differentiating
expression
[6]
with respect to!2 L7
Therefore,
thesampling
variance of the estimate ofheritability
in the basepopulation depends
on a factorf ,
which is a function ofh)
and k because underphenotypic
selectionh
depends only
onh)
andk,
and on thesampling
variance ofh
1
.
Values of thef
factor for different£)
arerepresented
infigure
1 forvarying
selected percentages
(p)
50%, 20%, 10%,
and 1%. The value ofhi
was obtainedby
solving
expression
[6]
as a function of knownh) :
as described
by
Gomez-Raya
and Burnside(1990).
For traits withheritability values
less than
0.70,
f
islarger
than 1 and therefore thesampling
variance of!2
will
be
increased
with respect to thesampling
variance of the estimates at the limit(Var
(hL)).
Selectionintensity
appears to have small effect onf.
In
practical
animalbreeding,
theperformance
of relatives can be used tomax-imize response
by
the use of selection indices. Forexample,
consider apopulation
where sires are selected on the average of records of d
daughters
each with one recordand dams are selected on the average of n records each. Estimates of
heritability
inthe base
population
can be obtainedby
substituting
inexpression [4]
theappropri-ate
equilibrium
values of accuracy for siresT
SL
= [dhLf(4+(d-l)hlW/2
and damsT
v
L
= [nhLf(l + (n-l)rêpL)]1/2,
where
rep
L
# [(8 fl! + 8 $! ) / (8 fl! + 8$! + 8$! )] ,
8$ !
= estimated permanent environmental variance andQT
E
= estimated
DISCUSSION
In this paper a method to estimate
heritability
in the basepopulation
from dataat the
steady-state equilibrium
ispresented.
The method to obtain estimates ofparameters at the
equilibrium
is assumed to be unbiasedby
selection of parents inthat
particular
generation.
Estimates ofheritability by regression
ofoffspring
on parents is unbiasedby
selection in agiven generation
(Robertson, 1977).
Estimationof
heritability by
half-sib correlation is biasedby
selection of sires(Robertson,
1977;
Ponzoni and
James,
1978),
but estimates can be corrected(Gomez-Raya
etal,
1991),
and then final estimates free of selection bias can be obtained. Another alternative
is to use the method
given
by
Sorensen andKennedy
(1984).
They proposed
theestimation of
genetic
variance in latergenerations
using
theMIVQUE algorithm
andpaper
they
carried out a simulationexperiment
to test thevalidity
of this method. Ingeneration
7 actualgenetic
variance had decreased from 10 to 8.41. The simulatedenvironmental variance was
10,
soheritability
at the limit was0.457, assuming
that environmental variance was known without error. The percentage selected in
males was 50%
(k
s
=0.637)
in eachgeneration.
Dams were not selected(k
D
=0).
Using
expression
[6]
aftersubstituting
estimated with true parameter values andcorresponding
values ofh
i
,
k
s
andk
D
theheritability
in the basepopulation
isexpected
to be0.491,
which is very close to the simulatedheritability
in the basepopulation
(0.50).
On the otherhand,
Van der Werf(1990)
carried out 2 differentsimulation
experiments
in which mass selection waspractised
on males at different selection intensitiescorresponding
to percentage selected p = 10% andp = 25%. He
proceeded
to estimate components of varianceusing
REML(restricted
maximumlikelihood)
and the data fromgenerations
4 and 5 withpedigree
information knownback to
generation
3.Treating
sires as random in the model he obtained biasedestimates
(8.58
for p = 10% and 8.71 forp =
25%)
of the basepopulation genetic
variance(10).
If we assume that thepopulation
is at thesteady-state
equilibrium
ingeneration
3 thengenetic
variance in the basepopulation
can be estimatedusing
[5]
after
substituting
appropriates
values ofk(k
s
= 0.830 forp = 10%
and k,
= 0.759 for p =25%;
k
D
=0)
and!2
(0.45
forp = 10% and 0.46 for
p =
25%).
The values of!2
can be obtained from the estimates ofgenetic
(8.58
for p = 10% and 8.71 for p =25%)
and residual variances(10.44
forp = 10% and 10.17 for
p =
25%)
given
by
Van der Werf(1990)
in table II.Proceeding
in this way, estimates of thegenetic
variance in the base
population
are 10.18(p
=10%)
and 10.23(p
=25%).
These values are very close to the simulatedgenetic
variance in the basepopulation
(10).
Theslight discrepancy,
in thesestudies,
occurs because the formulae derived in thispaper have not taken into account the effect of
inbreeding
in the reduction ofgenetic
variance.Throughout
this paper,population
size has been assumedinfinite,
andtherefore, inbreeding
effects ongenetic
variance were not considered. Both naturaland livestock
populations
are finite. The reduction ingenetic
variance due to thebuild-up
oflinkage disequilibrium
occursrapidly
in the firstgenerations
whereasinbreeding
effect is small but accumulatesgradually
in latergenerations.
After thesteady-state equilibrium
isachieved,
thegenetic
variance reducesgradually
dueto
inbreeding
and so does the amount oflinkage disequilibrium
maintained in thepopulation. Thus,
correction for selection at thispoint
would notyield
estimates of thegenetic
variance andheritability
in theoriginal
basepopulation. Rather,
theestimates of these parameters would be those obtained after
relaxing
selection forseveral
generations,
in otherwords,
thegenetic
variance due to the genefrequencies
segregating
in thepopulation
at thegeneration
inquestion.
In mostsituations,
theseare the parameters of interest because
they explain
how muchgenetic variability
could be used in selection programmes. Prediction ofthe
joint
effects ofinbreeding
and selection ongenetic
variance is rather difficult(Robertson,
1961;
Verrier etal,
1990;
Wray
andThompson,
1990).
The method described in this paper is able to correct for the bias
generated
after
repeated cycles
of selectionassuming
equal
information on each individualevaluated and constant selection
intensity
acrossgenerations.
Inpractice,
bothassumptions
may not hold. Methods to estimatebreeding
values such as best linearof livestock. Each individual
breeding
value has a different accuracy in BLUP evaluations. Ifpedigree
information is known back to the basepopulation
then mixed modelmethodology
could be used(Sorensen
andKennedy,
1984).
The effect of different accuracies among selection candidates ongenetic
variance isnot known. Further work is needed to
incorporate
this kind of selection in the methodpresented
in this paper.Changes
in selectionintensity
acrossgenerations
result in
changes
in thedisequilibrium
in thepopulation
parameters over time. In naturalpopulations,
selectionintensity
could oscillate due tochanges
in thepattern of interaction among
species
and/or
environmental fluctuations. In livestockpopulations,
selectionintensity
may fluctuate due tochanges
inproduction
systemor in market conditions.
Therefore,
theprocedure
described in this paper wouldgive
only
approximate
values of the basepopulation heritability. However,
oscillationin the selection
intensity
acrossgenerations
has small effect on the estimationof
heritability
in the basepopulation
because the parameter kchanges only
veryslightly
with selectionintensity.
Forexample,
if we use a wrong value of selectionintensity corresponding
to selection of the top 1%(k
9
=0.903; k
D
=0)
in the simulationexperiment
of Sorensen andKennedy
(1984),
thenheritability
in the basepopulation
afterusing expression
[6]
is 0.504. This value isagain
very closeto the simulated
heritability
(0.50).
Therefore,
eventhough
selectionintensity
isnot constant across
generations
the method described in this paper could be usedto
estimate,
in a veryapproximate
manner, the value ofheritability
in the basepopulation.
ACKNOWLEDGMENTS
We gratefully thank C Smith and B Villanueva for very useful comments. This research
was supported by Instituto Nacional de Investigaciones Agrarias
(Spain)
and the OntarioMinistry of
Agriculture
and Food(Canada).
REFERENCES
Bulmer MG
(1971)
The effect of selection ongenetic
variability.
Am Nat 105,201-211 ’
Bulmer MG
(1980)
The MathematicalTheory of Quantitative
Genetics. ClarendonPress,
OxfordFalconer DS
(1989)
Introduction toQuantitative
Genetics.Longman
Press, Essex,
3rd edn
Fimland E
(1979)
The effect of selection on additivegenetic
parameters. Z TierzZuchtungsbiol96,
120-134Gomez-Raya
L,
Burnside EB(1990)
The effect ofrepeated cycles
of selection ongenetic
variance,
heritability,
and response. TheorAppL
Genet79,
568-574Gomez-Raya
L,
SchaefferLR,
Burnside EB(1991)
Selection of sires to reducesampling
variance in the estimates ofheritability by
half-sib correlation. TheorAppl
Genet81,624-628
Lush JL
(1945)
AnimalBreeding
Plans. The Iowa StateUniversity
Press,
Ames,
Pearson K
(1903)
Mathematical contributions to thetheory
of evolution. XI. On the influence of natural selection on thevariability
and correlation of organs. PhilTrans R Soc Lond Ser A
200,
1-66Ponzoni
RW,
James JW(1978)
Possible biases inheritability
estimates from intraclass correlation. TheorAppl
Genet53,
25-27Robertson A
(1961)
Inbreeding
in artificial selection programmes. Genet Res 2,189-194
Robertson A
(1977)
The effect of selection on the estimation ofgenetic
parameters. Z TierzZuchtungsbiol
94,
131-135Sorensen
DA, Kennedy
BW(1984)
Estimation ofgenetic
variances from unselected and selectedpopulations.
J Anim Sci59,
1213-1223Van der Werf JHJ
(1990)
Models to estimategenetic
parameters in crossbreddairy
cattlepopulations
under selection. Doctoralthesis,
Dept
AnimBreeding,
Agric
Univ, Wageningen,
TheNetherlands,
ch 5Verrier