HAL Id: hal-00894220
https://hal.archives-ouvertes.fr/hal-00894220
Submitted on 1 Jan 1998
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Prediction of the response to a selection for canalisation
of a continuous trait in animal breeding
Magali San Cristobal, Jean-Michel Elsen, Loys Bodin, Claude Chevalet
To cite this version:
Magali San Cristobal, Jean-Michel Elsen, Loys Bodin, Claude Chevalet. Prediction of the response
to a selection for canalisation of a continuous trait in animal breeding. Genetics Selection Evolution,
BioMed Central, 1998, 30 (5), pp.423-451. �hal-00894220�
Original
article
Prediction of the
response to
a
selection
for canalisation of
a
continuous trait
in
animal
breeding
Magali
SanCristobal-Gaudy
Jean-Michel Elsen
b
Loys
Bodin
Claude Chevalet
a
a
Laboratoire de
génétique cellulaire,
Institut nationalde la recherche
agronomique, BP27,
31326 Castanet-Tolosancedex,
Franceb
Station
d’améliorationgénétique
des animaux, Institut nationalde la recherche
agronomique, BP27,
31326 Castanet-Tolosancedex,
France(Received
7 November1997; accepted
31August
1998)
Abstract -
Canalising
selection is handledby
a heteroscedastic modelinvolving
agenotypic
value for the mean and agenotypic
value for thelog variance,
associated with asingle phenotypic
value. A selectionobjective
isproposed
as theexpected
squared
deviation of thephenotype
from theoptimum,
of a progeny of any candidatefor selection. Indices and
approximate expressions
ofparent-offspring regression
are derived. Simulations are
performed
to check the accuracy of theanalytical
approximation.
Examples
of fat to protein ratio in goat milkyield
and musclepH
data inpig breeding
areprovided
in order toinvestigate
theability
of thesepopulations
to be canalised towards an economic optimum.
©
Inra/Elsevier,
Pariscanalising selection
/
heteroscedasticity/
selection index*
Correspondence
andreprints
E-mail: msc@toulouse.inra.fr
Résumé - Prédiction de la réponse à une sélection canalisante d’un caractère continu en génétique animale. Le
problème
de la sélection canalisante est traitégrâce
à un modèlehétéroscédastique
mettant enjeu
une valeurgénétique
pour la moyenne et une valeurgénétique
pour lelogarithme
de lavariance,
toutes deux associées à uneseule valeur
phénotypique.
Pour unobjectif
de sélection visant à minimiserl’espérance
des carrés des différences entre le
phénotype
etl’optimum,
pour un descendantd’un candidat à la
sélection,
des index sont estimés et desexpressions approchées
de larégression parent-descendant
sont calculées. Laprécision
de cesexpressions
ces
populations
à être canalisées vers unoptimum économique,
desexemples
sontdonnés : le rapport entre matière grasse et matière
protéique
du lait dechèvre,
et lepH
d’un muscle chez le porc.©
Inra/Elsevier,
Parissélection canalisante
/
hétéroscédasticité/
index de sélection1. INTRODUCTION
Production
homogeneity
is animportant
factor of economicefficiency
inanimal
breeding.
Forinstance,
optimal weights
and ages atslaughtering
exist forbroilers,
lambs andpigs,
and the breeder’sprofit depends
on hisability
to send
large
homogeneous
groups to theabattoir; optimal
characteristics ofmeat such as its
pH
24 h afterslaughtering
exist butdepend
on thetype
of
transformation;
eweslambing
twins have the maximumprofitability
whilesingle
litters are notsufficiently
productive
andtriplets
orlarger
litters aretoo difficult to
raise;
with extensive conditions where food is determinedby
climaticsituations, genotypes
able to maintain the level ofproduction
would be of interest.Hohenboken
[22]
listed differenttypes
ofmatings
(inbreeding,
outbreeding,
top
crossing
and assortativematings)
and selection(normalising,
directional andcanalising)
which can lead to a reduction in traitvariability.
Stabilisation of
phenotypes
towards a dominantexpression
has been knownfor a
long
time as amajor
determinant ofspecies
evolution,
similarly
tomuta-tions and
genetic
drift(e.g.
[4]
for areview).
Differenthypotheses explaining
these natural
stabilising
selection forces have beenproposed
(2,
3, 8, 15,
16, 19,
27, 38,
45-47,
49,
52!.
A number of models assume that trait stabilisation iscontrolled
by
fitness genes(e.g.
[9]
for areview),
whichkeeps
the meanphe-notype
at a fixed’optimal’ level,
without a necessary reduction of the traitvariability.
Alternativehypotheses
wereproposed
forcanalisation;
for instanceRendel et al.
[32, 33]
assumed that thedevelopment
of agiven
organ is underthe control of a set of genes, while a
major
gene controls the effects of thesegenes within bounds to
keep
thephenotype roughly
constant.Whatever its
origin
stabilisation is to be related to theenvironment(s)
inwhich it is
observed,
which makes it essential[48]
todistinguish
stabilisation of a trait in aprecise
environment(normalising selection)
from theaptitude
to maintain a constant
phenotype
influctuating
environments(canalising
selection).
Various artificial
stabilising
selectionexperiments
have been carried out withlaboratory
animals:drosophila
[17,
23, 29, 30, 34, 40, 41, 44,
48],
tribolium[5,
6, 24,
43]
and mice[32].
Mostoften,
selection was of anormalising
type
with a
culling
of extremeindividuals,
this selectionbeing applied
globally
[5,
29, 30,
41,
43,
44],
withinfamily
[24]
or betweenfamily
[6, 34].
Canalising
selection was
experimentally
applied
by
Waddington
[48]
andby
Sheiner andLyman
!40!,
their rulebeing
the selection of individuals less sensitive tobreeding
temperature
andby
Gibson andBradley
[17]
whoapplied
aculling
of extremesin a
population
bred in unstable environment(fluctuating
temperature).
Somegeneral
conclusions from theseexperiments
may beproposed:
1)
veryphenotypic
variance;
2)
heritability
estimationsduring
and at the end of the selectionexperiments
often showed that the selected traitgenetic
variancedecreased,
this conclusion notbeing
general;
3)
in many cases it waspossible
to prove that the environmental
variance,
or thesensitivity
of individuals toenvironmental
fluctuation,
was reducedby
selection.In this paper we
investigate
mathematical tools for the evaluation of thepos-sibility
andefficiency
oforganising
canalising
selection in animalpopulations.
Existence of agenetic
component
in varianceheterogeneity
between groups isa
prerequisite
for such a selectiongoal
to be feasible. Statisticalmodelling
andestimation
procedures
have beendeveloped
to take account of variancehetero-geneity
(e.g. [10,
11, 35,
36!),
inparticular using
alogarithmic
link betweenvariances and
predictive
parameters
[12,
13,
39!.
In the
following,
we extend such modelsby
introducing
agenetic
valueamong these
parameters,
consider thepossibility
ofestimating
this newgenetic
value,
then discuss theefficiency
of selection based on this model.Although
ourobjective
is toapply
suchmethodologies
to continuous and discretetraits,
wefirst concentrate here on continuous traits.
Applications
to artificialcanalising
selection towards an economicoptimum
ingoat
andpig
breeding
aregiven.
2. GENETIC MODEL
2.1.
Building
of a modelOur
approach
was motivatedby
the extensive literature mentioned in theIntroduction,
and inparticular
the paper of Rendel and Sheldon[34]
shows that artificialcanalising
selection doeswork,
in the sense that thepopulation
mean reaches the
optimum and,
moreimportantly,
the environmental variance is reduced. Some individuals are lesssusceptible
to environment thanothers,
thisparticularity
being genetically
controlled,
since itresponds
to selection. Some genes are now known to controlvariability,
e.g. theApolipoprotein
Elocus
[31]
inhumans,
the Ubx locus inDrosophila
[18],
the dwarfism locus in chickens(Tixier-Boichard,
pers.comm.),
and some(aTLs
with effects onvariance are
already
suspected
!1!.
Like
Wagner
et al.[50]
in theirequation
7,
the effect ofpolymorphism
at agiven
locus on the environmental variance may beexpressed
by
agenotype-dependent multiplicative
factor for this variance. The samehypotheses
(in
particular
no interactions betweengenes)
andreasoning
as in the Fisher model allow theprevious
one-locus model to be extended to apolygenic
orinfinitesimal
model,
in which each individual has agenetic
valuegoverning
amultiplicative
factor for the environmental variance.Since the
analysis
needs the evaluation ofphenotypic
variances associated withgenetic values,
it must be based onexperimental
designs
allowing
for therepeated
expression
of the same or ofclosely
relatedgenetic
values.Although
not
necessarily
efficient,
anypopulation
schememight
beconsidered,
butwe focus here on two
simple situations, repeated
measurements on asingle
individual,
and evaluation of one individual from theperformances
of its2.2. Animal model: basic model
A model
linking
aphenotype
yjof agiven
animal(from
repeated phenotypes
y =
(!1, ...,
yj , ... ,
yn ) )
with twogenetic
values u and v is considered.According
to the infinitesimal model of
quantitative genetics,
thesegenetic
values u andv,
possibly correlated,
are assumed to be continuousnormally
distributedvariables,
and contribute to the mean and to thelogarithm
of the environmentalvariance. The
simplest
version of the model can be written as:where p
is thepopulation
meanand
the
population log
variance mean, while:and the Ejs are
independent identically
distributedN(0, 1)
Gaussianvariables,
independent
of u and v. Additivegenetic
variances are denotedby
afl and a V ’ 2
and r is the correlation coefficient between u and v. The distribution of the
conditional random variable
Ylu,
v is Gaussian./1!(!
+u, exp(!
+v)),
but the unconditional distribution of Y is not. The unconditional mean and variance(the
phenotypic
varianceor y 2
of
the random variable Y areequal
toNote that the
v genetic
value and its varianceo, are
dimensionless;
exp(
77
)
has the same units as the
phenotypic variance,
andexp(w/2)
is the average(genetic)
scale factor of the environmental variance.2.3. Animal model: extensions
More
general
formulations of the model are needed to cope with realsituations.
First, introducing
permanent
environmental effects(denoted
by
pand
t)
common to severalperformances
of the same individual is necessary totake account of
non-genetically
controlledcorrelations,
both on the mean value- as it is usual to deal with
repeatability -
and on thelog
variance of the withinperformance
environmental effect.Thus,
thejth
performance
of an individualis modelled as:
where
(u, v),
(p, t)
and followindependent
Gaussian distributions: thebivari-ate normal
(2),
a similar bivariate distribution withcomponents
o, 2,
at and
When q individuals are measured in several
environments,
a moregeneral
heteroscedastic model can be stated as:
,-/
where yg is the
jth performance
of aparticular
animal in aparticular (animal
xenvironment)
combination i. This full model(6)
is ageneralisation
of model(1)
introducing
environmental andgenetic
parameters
to be estimated: locationparameters
({3, u, p)
anddispersion
parameters
(6,
v,t)
with incidencematrices
i
,
(x
zi,z
i
)
and, z
),
i
, z
i
i
(q
respectively.
Vectors u, p, v and t have thesame
length
q.!3
and 6 denote fixedeffects,
while u, v and p, t are randomgenetic
and randompermanent
environmental effects attached toindividuals,
respectively.
The vectors ofgenetic
values u and v have then ajoint
normal distribution:where ©
denotes the Kroneckerproduct
and A is therelationship
matrixbetween the animals
present
in theanalysis.
Permanent environmental effectsp and t are
similarly
distributed as:where I is the
identity
matrix,
independently
of(u, v).
This
general
way ofsetting
up the modelneeds,
however,
some caution whenapplied
to actualdata,
to assess whichparameters
areestimable,
taking
accountof the structure of the
experimental
design.
Specifically, analysing
apossible
genetic
determinism ofheteroscedasticity
needs a sufficient number ofrepeated
measures to be available for the same(or
related)
genotypes.
2.4. Sire model
In a progeny test
scheme,
thephenotypic
values attached to an individual are theperformances
of itsoffspring.
From theprevious
animalmodel,
theperformance
y2! of thejth offspring
of sire i can be written asfollows,
conditional on the
genetic
values ui and vi of the sire andassuming
unrelated dams:It is assumed here that the terms aZ! and
{
3
ij
include thegenetic
effects inoffspring
not accounted forby
thepart
transmittedby
the sire. Permanent environmental effects in theoffspring
(the
p and t variables of model 5 areThis can be rewritten as
with
E’(
Etj
)
=0,
Var(e!)
= 1. The distribution ofe! is
only
approximately
normalN
(0,1).
Models(9)
and(10)
are notstrictly equivalent, but,
since the first two moments of yj areequal
under bothmodels, they
areequivalent
inthe sense of Henderson
[21]
(see
e.g.[37]
for anapplication
of thisconcept).
Forexample,
forlarge
numbers ofoffspring
persire,
the mean sire’sperformances
and
sample
within sire variances haveasymptotically
the same structure ofvariances and covariances between relatives under both models. The
corresponding generalised
approximate
sire model is written aswith the
joint
densities(7)
for u and v, and(8)
for p and t.Methods needed to estimate
parameters
are outlined inAppendix
A. Inparticular,
they
allow thegenetic
values of individuals to beestimated,
asthe conditional
expectations
ofgenetic values, given
observedphenotypes
y:h =
E(u!y)
and v =E(vly),
if variancecomponents
are known. Estimationof variance
components
wassimilarly developed
to make the methodpossible
to
apply.
In the
following
we first focus ondevelopments
of the basicmodel,
which issimple
enough
to deriveapproximate analytical predictions
of the response toselection and to compare several selection criteria. In a second
step
we check thevalidity
of the theoreticalapproach
by
means of simulations and test theability
of the extended models andcorresponding
numericalprocedures
to tackle actual data and evaluate thepotential
forcanalising
selection.3. SELECTION OBJECTIVE AND CRITERION
3.1.
Objective
and criterionOne
objective
that summarises thebreeding goal
(progeny
performances
close to theoptimum
and with lowvariability
aroundit)
is the minimisation of theexpected squared
deviation ofoffspring performances
from theoptimum
y o
. This is the one we have chosen. For an individual characterised
by
aset y of
performances
(on
itself and on itsrelatives),
a selection criterionis defined as the
expectation
of thesquared
deviation E!(Yd - yo)2lyJ
ofoffspring performance Y
d
,
conditional on y, and selection willproceed by
keeping
individuals with minimal values of thisindex,
such that:In classical linear
theory,
it isequivalent
togiving
an individual a merit withrespect
to the selectionobjective,
defined as theexpectation
of itsoffspring
performance,
or to consider itsgenetic
value u, since the former isjust equal
to half the latter.
Breeding
animals are rankedaccording
to their estimatedgenetic
value.In the
present context,
due to thenon-linearity
of themodel,
wedefine,
for acandidate to selection with
given genetic
values u and v, its merit forcanalising
selection as the
expected squared
deviation of anoffspring performance:
Its conditional
expectation
E(M
*
ly)
isequal
to the indexWith
complications
due to the non-linearsetting
of ourmodel,
we derive inthe
following
the mean and variance of an individual’sphenotype distribution,
conditional on theperformances
of a relative.3.2. Conditional mean and variance
We need the distribution of a
phenotype
Y
d
of a progenyd, given
perfor-mances y of a relative F. Let ud, vd be thegenetic
values ofd,
y =fy
j
1,
j
=1, ...n,
u and v thephenotypic
andgenetic
values of animal F.Perfor-mances of animals F and d follow model
(1),
with:where a is the
relationship
coefficient between animals F and d(a
= 0.5 if d isthe progeny of
F).
The
density
f (yd!y)
describing
the distribution ofY
d
,
conditional on ycan-not be
explicitly
derived,
but its moments are calculable or can beapproxi-mated. We have:
This is first
integrated
over yd,owing
toand
finally
the distribution of u and v conditional on y isapproximated
as:where u =
E(u!Y)!
v =E(v!Y),
C
uu
=Var(!!Y)!
C
vv
=Var(vly),
C
w
=Cov (u, v ly),
are the estimated first and second moments of thegenetic
values(see
Appendix
A for the estimationmethod).
It follows that
and that
These
expressions
aregiven
numerical values after estimates ofgenetic
valuesand of variance
components
are available.General formulae can be derived that take into account all
performances
of the wholepedigree,
notonly performances
of asingle
relative. Theexplicit
forms of the extensions of
equations (18)
and(19)
aregiven
inAppendix
B. The combination ofequations
(18)
and(19)
gives
the indexI
*
(y)
inequation
(14),
equal
to the conditionalexpectation E(M*!y)
of thegenetic
meritM
*
,
as in Goffinet and Elsen!20!.
3.3.
Approximate
criteriaWhen the conditional variance terms
(C)
can beneglected,
for instance whenn is
large,
I* isapproximately
equal
to the maximum likelihood estimate of the merit M*:where hats
denote,
in this case, modes of thedensity
ofv,, v!y.
This is to be related to the work of Wilton et al.!51!,
whodeveloped
aquadratic
index fora
quadratic merit, by &dquo;minimising
theexpectation
of thesquared
differencebetween total merit and
index,
bothexpressed
as deviations from theirexpec-tations&dquo; . In their
setting,
normality
was assumed for the distributions ofgenetic
values and of
performances,
so that this criterion wasequal
to the maximumlikelihood estimate of the merit.
The
previous
calculations make itnumerically possible
to set up a selectionscheme,
but do not allowanalytical predictions
of theefficiency
of selectionaccording
to the values of variancecomponents
o’!, or2and
r. Someinsight
canIn the individual model
(1),
assuming
thatrepeated
measures are availablefor the candidates for
selection,
we consider thefollowing
selection index Iwhich is
equal
to thesample
mean squaredeviation,
y
denoting
thesample
mean and
S’y
thesample
variance of theperformance
set of anindividual,
1 n
6! ! -
!(!j -
y
)
2
.
Note, however,
that this index measures the value of an
j
=1
icandidate,
notdirectly
theexpected
value of its futureoffspring.
Truncation selection would beaccordingly
characterisedby
astep
fitness function wtdefined as:
Instead,
we consider a continuous fitness functionwhere s is a selection coefficient which can be
adjusted
to obtain the sameselection differential as
equation
(22).
Thepositivity
ofw(y)
inequation
(23)
necessitates a small s value. Hence we assume that selection is
weak, allowing
first-order
approximation
of the response to selection.For progeny test selection the model for y is
equation
(10),
but without pand
t,
andyields
a similar selectionindex,
y valuesbeing
made up of theperformances
of theoffspring
of the candidate for selection. The selectioncriterion
(21)
is then a true measure of the candidate’svalue,
and can beconsidered as an
approximation
of the criterion(12)
for thissimple population
structure.
4. RESPONSE TO CANALISING SELECTION
We seek the responses to selection for the
genotypic
values u and v, thegenetic merit,
and theperformance
(Y -
YO
)
2
.
Wequantify
the effects of selectionby
theregression
ofoffspring
on the selectedparent
(e.g. !9)),
in ageneral
way as:where X is any trait of
interest, E!(X)
itsexpectation
in the selectedpart
inthe candidate
population,
andEd (X )
theexpectation
ofphenotypes
among theoffspring
of the w-selectedparents.
The numerator is the responseR(w,
X)
to selection based on the fitness function w in the trait X of
interest,
measuredin the next
generation.
The denominator is the selection differentialS’(w, X),
measured amongparents.
As arule,
we restrict thefollowing
derivations to4.1.
Analytical
approximations
4.1.1. Animal modelWe first derive the distribution of u and v in the
parent
population
afterselection
according
to the fitness function w, then calculate thecorresponding
distribution in the
offspring
population.
Let
f (y)
be the unconditional distribution ofY,
andf (u, v)
thejoint density
of u and v. Thedensity
of Y in the selectedparental
population
isFollowing
Gavrilets andHastings
!14!,
we introduce the mean fitness of thegenotype
(u, v):
As with
M
*
(u,v)
inequation
(13),
this functionM(u,v)
=E(I(Y)!u,v)
can be considered as a
genetic
meritreferring
to a candidate’s own value andnot as in
equation
(13)
to that of a futureoffspring.
The mean fitness of thepopulation
is theproportion
of selected individuals:where
We obtain the distribution of
genetic
values among selectedparents:
4.1.1.1.
Genetic responseSince
genetic
values are transmittedlinearly
to theoffspring,
thegenetic
responses to
selection,
R(w,u)
andR(w,v),
are the differences ofexpected
genotypic
values u and v,respectively,
between candidates and selectedindi-viduals
(assuming
that selection occurs in asingle
sex,only
half of this progresswhere wg refers to
equation
(26).
The effects ofnon-linearity
are seen in the aboveequations.
Note that if
genotypes
are correlated(if
r is notzero),
theefficiency
of selection is reduced if r and(
M
-
y
o
)
are ofopposite signs.
4.1.1.!.
Parent-offspring regression
The
efficiency
of individualcanalising
selection towards yo is evaluatedby
theregression
coefficient(24)
calculated for the trait X =II(Y)
_
(Y -
y
o
)
2
.
Thefact that the
expectation
of the trait II of interest isequal
to theexpectation
of the index I involved in the fitness function w defined inequation
(23)
makes thefollowing
derivations feasible.Summarising
the detailed calculationsgiven
in
Appendix
C,
we state that the numerator ofequation
(24)
isequal
to the w-selection response in thegenetic
merit M:since M =
E(II!u, v).
The denominator ofequation
(24)
is the selection differential:This leads
to,
ifr = 0,
where V stands for
exp(?
l
+a!j2).
Ifgenotypes
arecorrelated,
an extra term2rau
av
V
(p, -
yo +4 ra
u
av)
is added to thenumerator,
and4(l
+ n)r
OUUV
V 2
1
-t -
yo+ ! 2 rauav )
is added to the denominator. The response to selection can be written as:i.e. as the
product
of selectionintensity
(1 = ! ) , of
a realizedheritability,
the B tf7ratio
b(w, II)
defined inequation
(34),
and of the standard deviation Qn of theselection index. 4.1.2. Sire model
As for the individual
model,
thegenetic
merit for the sire model is defined as:and the fitness
The
expectation E(M)
=E[I(Y)]
is the same asgiven
inequation
(27).
The response to selection in the trait
II(Y)
among maleparents
isand the selection differential is
The
regression
coefficient bgiving
the response tocanalising
selection in aprogeny test scheme is
equal
to the ratio of(36)
to(37).
Figure
1plots
the responsegiven
inequation (36)
in units of selectionintensity
andphenotypic
variance,
from anequation
similar toequation (35).
4.1.3. Extensions
The
previous
exactresults,
obtainedusing
the fitness function(23)
andanalogous
for the siremodel,
hold for weakselection,
and theirexpressions
as ratios of a covariance to a variance indicate that
they
can also be obtainedfrom a linear
approximation.
This comment makes itpossible
to extendeasily
the
approximate prediction
of response in cases when differentweights
aregiven
to the variance of
performances
and to their deviation from theoptimum.
Considering
the animal model withrepeated
measurements(5),
let us denoteII
1
(
Y
) =
(y -
YO
)
2
,
II
2
(
Y
) =
Sy,
the twocomponents
of II =(II
l
(y),II
2
(
Y
))&dquo;
s =
(
81
,
S2
)’
a vector of selectivevalues,
a =(cr
l
, cr
2
)’
a vector ofweights.
Weare interested in the response for the trait
a’II,
whenusing
the index s’ll asselection criterion. The
parent-offspring
regression
isequal
tointroducing
thefollowing
notationsh2
=or2 2 ,
c
2
-
(or2
+2 2
A =
(y -
y
o
)lay.
Fromequation
(38),
parent-offspring
regressions
for the meanand for the variance can be written
separately.
With si = 0 and a1 = 0 forinstance,
b tends toas n tends to
infinity
and ifat’
= 0. Thisparent-offspring
regression
is lowerthan a
half,
and tends to1/2
asafl
tends to zero.Note that the
parent-offspring regression for y
iswhich tends to
1/2
as n tends toinfinity
and ifQ
p
= 0.1 n
If the unbiased estimate of variance
H[
= ! 1
1 !(yj -
y)
2
is used in the n - I!’
j-i
index,
then the variance termP!2
=Var(II2
2
)
isproportional
toWhen
o, =
0,
the response inIIZ
is
null and the selection differential iscorresponds
to the variance of the trait(Y -
y)
2
(squared
deviation from themean),
up to amultiplicative
term. Whenafl
=0,
the response inY
is null and the selection differential isequal
toV/n,
More
generally,
this extension shows that a selection index(weights
s =(s
l
, s
2
)’)
can beadjusted
tooptimise
the response in agiven objective
specified
by weights
a =(a
l
, a
2
)’.
4.2. Simulations
Simulations were used to check the accuracy of
previous analytical
expres-sions of response asproposed
inequations
(34)
and(36)/(37),
in moregeneral
situations:
- intermediate selection
intensity,
since theanalysis
assumesonly
weakselection;
- behaviour of the
population
parameters
(mean, variance)
during
severalgenerations
ofselection;
-comparison
of the relative efficiencies of different selectioncriteria,
re-placing
in the simulation the theoretical continuous selection scheme(23)
by
truncationselection
according
to thesimplified
index(22)
andby
the likelihood based index1 (20).
Simulations were restricted to the case of the sire model with no
genetic
correlation
(r
=0).
4.2.1. Selection scheme
The selection scheme was as follows.
1)
Genetic values of sires and dams of the basepopulation
wereran-domly
drawn from thejoint
distribution(7)
with norelationships
(A
isthe
identity
matrix), giving
the sets{(ui, vi),
i =1, ... , ,S} for
thesires,
and(u
j
,
v
j
), j
=1, ... ,
D}
for
the dams.2)
Sires and dams were mated at random.3)
For eachcouple
(i,j),
theperformance
y2! of adaughter
wasgenerated
according
to:where Etj, aZ! and
{3
ij
were drawn from the Gaussian distributionsN(0,1),
jV(0,o’!/2)
andN(0,
a!/2),
respectively.
The terms a2! and{3
ij
represent
Mendelian
sampling.
4)
An index for each sire wascomputed
and elite sires were selected.5)
The elite siresproduced
S sons with the same female cohort used insteps
1-2.
Step
2(with
sons ofstep
5 anddaughters
ofstep
3)
tostep
5 wererepeated
until the 10th
generation.
The sire selection of
step
4 was a truncation selection based either on theof the merit
I(
Yi
)
=M(Û
i
, V
i
)
=3!!
+
exp(!7
+ v
Z
+3w
)
+(p, -
yo +ii 2
)2,
_ _ 428 8 2
with
u
i
andv
i
maximum likelihood estimates of ui and virespectively,
according
to model(10),
butallowing
for nopermanent
environmentaleffect,
and
assuming
that variancecomponents
were known.4.2.2. Simulation
experiments
For a constant
phenotypic
standard deviation for the basepopulation,
several values of variancecomponents
were tested:or =
0.033 and0.114, corresponding
to a ’low’
(hu =
0.10)
and a’high’ heritability
(h!
=0.3);
a ’low’variability
variance
Q
v
= 0.03 and a’high’
or2
= 0.15(corresponding
to ratios of maximumto minimum variance
equal
to 3 and10,
respectively).
Three basephenotypic
means were considered: pt=o
= 1,
1.8 and2,
for anoptimum equal
to yo = 2(giving
discrepancies
A
t=
o
=(
ut=
o -
yo)/
QY
,
c=
o
betweenpopulation
mean andoptimum,
expressed
inphenotypic
standarddeviations, equal
to1.75,
0.35 and0).
For
given
values of the seta!, a!, /1
and y, of the numbers of sires and dams and of selectionintensity,
100 selectionexperiments
wereperformed,
and statistics
averaged
over the runs. The evolution ofphenotypic
meanand
variance,
estimated merit over the tengenerations
andparent-offspring
regression
arehighlighted.
4.2.3. Results
Figure 2 displays
the curvesgiven by
theanalytical approximation
of theresponse, with
point
estimates and confidence intervals obtained with 100 simulated selectionexperiments,
showing good
agreement
of theapproximation
with truncation selectionon
thesimplified
index I(not shown),
but also withthe likelihood based index
I,
except
for intermediate values of A for which thetheory provides
underestimates.Figure
3plots
the evolution ofphenotypic
means and standard deviations overgenerations
ofcanalising
selection. Severalaspects
appear:-
with a
high heritability
h!,
thepopulation
mean tends in a linear mannertowards the
optimum
in a very efficient way;- the
convergence of the mean is
slightly
better ifw
islow;
- the decrease in
phenotypic
variance has a lineartendency, although
morefluctuating
than the evolution of the mean;-
this decrease is even more evident as
Q
v
is higher
andh2
is
lower.This
general
balance was encounteredthroughout
the simulationexperi-ments: a
particular
aspect
wasmaximally improved
when the otheraspects
were not under selection pressure. Variances are best reduced when the
popu-lation mean is at the
optimum.
Theoptimum
is morerapidly
reached when nogenetic variability
of the variances ispresent.
Figure
4 compares
theperformances
of the two indices I andT.
The likeli-hood based indexgives
more efficient results for the trait mean pt,probably
because
heterogeneous
variances were taken into account in the evaluation of the animalgenetic
values u,giving
less biased estimates. On thecontrary,
thephenotypic
varianceQY
is
best reduced with thesimplified
index,
presum-ably
due to the lackof
robustness of v estimationby
maximum likelihood. A fullBayesian
estimationprocedure
withmarginal
posterior expectation
ofparameters
might
be moreappropriate.
It was nevertheless notperformed
be-cause of the heaviness of the
algorithm,
since numericalintegrations
are then needed. The two indicesgive, however, equal
values of theglobal
criterion(p’t -
yo)
2
+0
,2 yl
t
at any time t.The
phenotypic
variance andsquared
difference between mean andoptimum
are lowered more and more as selectionintensity
isincreased,
while theparent-offspring
regression
remains constant in the simulations as in theapproximate
theory
(not
shown).
5. DISCUSSION
5.1. Model for the variance
The introduction of a
log
linear model is an easy way to handle amul-tiplicative
model on the variance. It is known that the distribution ofInS
2
,
thelogarithm
of thesample
varianceestimator,
isapproximately
normal(e.g.
!25!).
Similarly,
Bayesian
considerations onprior/posterior
densities show that the Gaussian distribution is agood
approximation
to alog
invertedchi-square
(see
[13]).
This led us to focus allanalytical
derivations on the first twomoments of
distributions, assimilating
when needed any distribution to the Gaussian distributionsharing
these same moments.Although
this may be acrude
approximation
if it is used forprediction
ofgenetic
response over severalgenerations,
it allows first order solutions to bederived,
and makes itpossible
to build statistical evaluation
procedures.
The model allows estimation of the
importance
ofgenetic
determinism inthe
heterogeneity
ofvariances,
and henceprediction
of how thepopulation
may
respond
to selectionagainst
variability.
Forexample,
theproportion
of the selection response due to thegenetic variability
in thev-component
isgiven by
the ratio
where the Gs are
given
inequation
(40).
It is all the moreimportant
as thepopulation
mean is closer to theoptimum,
theu-genetic
variance islower,
and thev-genetic
variance islarger.
Estimation of
genetic
parameters
(or u 2,
r,av 2)
may be somewhatimprecise,
especially
foru2
and
r. Hence it may be worthconsidering
the robustnessof
predictions
withrespect
tobadly
knownparameters.
As far as asimple
global
criterion isused,
thequestion
can be dealt witheasily,
considering
theexpected
responses as functions ofparameter
values. The situation would be more difficult to handle for selection schemes that wouldrely
on theknowledge
of
parameter
values,
forexample
if a balance between selection for the mean or for the variance wereadjusted
eachgeneration.
5.2. Data
The
generalised
version of the sire model(11),
including
fixed and randompermanent
environmentaleffects,
wasapplied
to actual data ingoats
(dairy
production)
and inpigs
(pH
of muscles afterslaughtering).
5.2.1. Milk data
Protein and fat contents were measured on milk from 2 383 first lactation
goats
between 1992 and 1995. Thegoats
weredaughters
of 54 artificialinsemination
sires,
with 20 observations at least in the data set. The trait of interest is the ratio of fat toprotein contents,
with a desiredoptimum equal
to1.3. This
objective
would becomplementary
toyield
traits such as milkyield
orprotein
yield.
Thephenotypic
mean and variance areequal
to 1.1 and0.0135,
respectively,
i.e. thepopulation
mean is 1.7phenotypic
standard deviationsaway from the
optimum.
Data arenormally
distributed. Forcomputational
ease, data were
pre-corrected
with the additive modelincluding
herd,
season,lactation
length
and age, on a muchlarger
data setincluding
all lactations of all herds where the 2 383kept
daughters
had beenproducing.
The variancecomponents
wereestimated, leading
to a nullcorrelation
coefficient(r - 0)
and zero
variability
variance(Qv -
0),
and aheritability
h!
= 0.44 of the sameorder as those for the
protein
and fat.A
canalising
selectionexperiment
isexpected
to drive thepopulation
meanrapidly
towards theoptimum,
but withoutchange
in environmental variance. Forexample,
assuming
selection of the best 10%
ofsires,
a reduction of 1.5phenotypic
standard deviations of thepopulation quadratic
deviation(!c-yo)2
2 would beexpected
in onegeneration.
5.2.2.
pH
datapH
values of semi-membranous muscle were measured on 947piglets
from 25Large
White sires. Data werenormally
distributed. Each sire had at least 20piglets.
Data werepre-corrected by
the usual linear modelaccounting
forsex,
line,
year andslaughtering
date effects on the trait mean, on a muchlarger
data
set,
in order tosimplify
furthercomputations.
Thereafter,
a sire modelfor the residuals of the
previous
model was fitted.Estimated values of variance
components
under model(11)
with’perma-nent
environmental effects’(non-genetic-sire effects)
wereequal
toa2
=0.15,
hu
= 0.26(with
a
2
y
=0.037),
r =0.79,
QP
=0.00045,
Qt
= 0.046 and p = 0.79.
With an
optimum
value yo = 5.7 not different from the overall mean p =5.75,
the estimated variance
components
should allow ahigh
response tocanalising
selection to be obtained
through
astrong
reduction of thegenetically
controlledpart
of environmental variance:assuming
that selection sorts out the best 10%
of male
parents,
a reduction of about 12%
of the initialphenotypic
varianceIt must be stressed that
predictions
derived from the aboveanalysis
of fatto
protein
ratio ingoats
and ofpig pH
muscle data areonly
indicative. Forexample,
the effect of awrongly
estimated correlationvalue
r remains to beassessed,
even if - in thegoat
example -
nosignificant
genetic
component
ofvariance was found for variances.
Also, although
precision
ofthe previous
early
estimates was not
evaluated,
larger
data sets areprobably
needed. A properprediction
ofexpected
response to selection cannot beproposed
until theseanalyses
are carried out.So far we do not have results from an actual selection
experiment,
based on our index selectionrules,
which would be necessary tocompletely
validate theapproach
through
thecomparison
of observed realised heritabilities with ourpredictions.
It is one of theperspectives
of the current work toorganise
such selectionexperiments.
5.3. Selection criteria .
We have considered a
single global
criterion that combines selection for themean and selection
against
the variance of the trait.Shnol and
Kondrashov [42]
considered the action of selection with fitnessw(y)
on aquantitative
trait y.They
concluded that truncation selectionmin-imises the
genetic
load and the variance of the trait after selection. Linearselec-tion
(corresponding
to our continuous fitness with lowselection)
gives
minimal variance of the relative fitness and is less efficient than truncation selection.However,
linear selection gave us theopportunity
for
robustanalytical
approx-imations of realised
heritability.
Calculations wereimpossible
for truncationselection,
even withthe
simpler
index. Within the limits of thepresent
com-parisons
withsimulations,
the fitnessapproximation
proved useful,
even in cases withstrong
departure
fromlinearity,
and with a ratherstrong
selectionintensity
(proportion
of selected individualsequal
to 20%).
More
sophisticated
selection criteria may bedefined,
allowing
selection to bedifferentially
directed towardschanging
the mean value of the trait orreducing
the environmental variance. In fact
using
aglobal genetic
merit to be maximised in the nextgeneration
is a way to distribute selectionintensity
between bothparameters.
It ispossible
that ahigher multi-generation
response could beob-tained if selection were controlled each
generation
in view of theobjective.
Forexample,
the index(y -
YO
)
2
+ S;
can begeneralised
into81(Y -
YO
)
2
+S2’S’!,
allowing
agreater
selection pressure either on the location near theoptimum,
or on thedispersion,
as illustrated in the above theoretical section. The sameremark is available for the index
(.E(y!y) —
y
o
)
z
+Var(Yd!y).
Moregenerally,
the selection criterionmight
be based on the economic worth ofoffspring.
Thecriterion would then be defined as the
expected
economic value ofoffspring,
a function
depending
on the distribution ofexpected phenotypes
and on theeconomic value of
phenotypic
values. But of course othertypes
of indices andmating
systems
arepotentially
interesting
toconsider,
for instance a linearindex when
Q
v
is
small,
mate selection or group selection.Managing
the balance between location and scale could beinteresting
in along-term
selection process,provided
someanalytical approximation
isavail-able in order to include