HAL Id: hal-00894040
https://hal.archives-ouvertes.fr/hal-00894040
Submitted on 1 Jan 1994
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
An approximate theory of selection assuming a finite number of quantitative trait loci
C Chevalet
To cite this version:
C Chevalet. An approximate theory of selection assuming a finite number of quantitative trait loci.
Genetics Selection Evolution, BioMed Central, 1994, 26 (5), pp.379-400. �hal-00894040�
Original article
An approximate theory of selection assuming a finite number
of quantitative trait loci
C Chevalet
Institut National de la Recherche
Agronomique,
Centre de Recherches de
Toulouse,
Laboratoire de
Génétique Cellulaire,
BP 27, 31326 Castanet-Tolosan
cedex,
France(Received
24August
1993;accepted
11May 1994)
Summary - An
approximate theory
of mid-term selection for aquantitative
trait isdeveloped
for the case when a finite number of unlinked loci contribute tophenotypes.
Assuming
Gaussian distributions ofphenotypic
andgenetic effects,
theanalysis
shows thatthe
dynamics
of the response to selection is definedby
onesingle
additional parameter, the effective number Le ofquantitative
trait loci(QTL).
This number isexpected
tobe rather small
(3-20)
ifQTLs
have variable contributions to thegenetic
variance. Asis confirmed
by simulation,
thechange
with time of thegenetic
variance and of thecumulative response to selection
depend
on this effective number ofQTLs
rather thanon the total number of
contributing
loci. The model extends theanalysis
ofBulmer,
andshows that an
equilibrium
structure arises after a few generations in which some amount ofgenetic variability
is hiddenby gametic disequilibria.
The additivegenetic
varianceV,q
and the
genic
variance Va remain linkedby: V A = % - t(l — / 1/L e )h 2 V A ,
where K isthe
proportion
of variance removedby selection,
andh 2
the currentheritability
of thetrait. From this property, a
complete
approximatetheory
of selection can bedeveloped,
and modifications of correlations between relatives can be
proposed. However,
the modelgenerally
overestimates the cumulative response to selection except inearly generations,
which defines the time scale for which the present
theory
is ofpotential practical
value.quantitative genetics
/
selection/
genetic varianceRésumé - Théorie approchée de la sélection pour un caractère dû à un nombre fini de locus. Une théorie
approchée
de la sélection estdéveloppée
dans le cas d’uncaractère
quantitatif
dont la variabilitégénétique
est due à un nombrefini
de locusgénétiquement indépendants.
Le calcul estdéveloppé analytiquement
en admettant que toutes les distributions statistiques peuvent êtreapprochées
par des lois normales.L’analyse
montre que le comportement
global
dusystème génétique dépend
essentiellement d’un«nombre
e,/!j&dquo;ccace
delocus», Le,
dont les valeurs vraisemblables sont sans doutefaibles
(!
à20).
Des simulationsconfirment
le rôle de ceparamètre
pour caractériser laréponse
cumulée à la sélection et la structure
génétique
de lapopulation.
Le modèlegénéralise l’analyse
de M Bulmer.Après quelques générations
d’unrégime
desélection,
unefraction
de la variance
génétique
reste « cachée» sous laforme
de covariancesnégatives,
de sorte que la variancegénétique
additiveVA
et la variancegénique
Va demeurent liées par la relation :VA -
Va -¡.¡,(1 - I/L,)h 2 V A ,
où est lafraction
de variance réduite par lasélection,
eth
2
est l’héritabilité actuelle du caractère. Cette structuration de la variancegénétique
soussélection permet de proposer des expressions
modifiées
des covariances entreapparentes
issus de parents
sélectionnés,
et dedévelopper
une théoriecomplète
de la sélection.Sauf
àcourt et moyen terme, les
prédictions quantitatives
sont surestimées par le modèle gaussien,ce
qui
délimite lechamp d’application pratique
de la théorie.génétique quantitative
/
sélection/
variance génétiqueINTRODUCTION
Models of
quantitative genetics
aregenerally developed
under theassumptions
of the infinitesimal
model,
which states that a verylarge
number ofgenetically
unlinked loci contribute to the
genetic
variance of a trait. Moreprecisely,
it isassumed that all contributions of individual loci are of the same order of
magnitude.
This
hypothesis
ensures that the distribution ofbreeding
values isGaussian,
andvalidates the whole statistical
apparatus
that made the statisticaldevelopments
ofapplied quantitative genetics possible
and itspractical
achievements. The scope of thepresent
paper is todevelop
anapproximate theory
that may cope with moregeneral genetic situations, owing
to the introduction of an additionalparameter characterizing
aquantitative
trait. The cases considered in thefollowing
involvevariable contributions to the
quantitative
trait of a finite number ofgenetically
unlinked loci. The derivations
rely
on thehypothesis
that all distributions can beapproximated by
aGaussian, following
the method illustratedby
Lande(1976)
andChevalet
(1988).
This makes itpossible
to define ananalytical theory
of selection with a model that seems less unrealistic than the usual infinitesimalhypothesis involving
very many unlinkedquantitative
loci.Two main
qualitative predictions
are derived from the Gaussian model:(i)
asingle parameter,
which can be called the effective number ofquantitative
trait loci(QTL),
is agood
summary of the distribution of the variable contributionsof QTLs
to thegenetic
variance of thetrait;
and(ii)
under continued selection the amount ofgenetic variability
that is hiddenby negative
correlations between contributions of different loci can be calculated as a function of selectionintensity,
the current additivegenetic variance,
and the effective number ofQTLs.
In addition toanalytical derivations,
simulations wereperformed
in order to evaluate thequalitative
andquantitative importance
ofdepartures
fromnormality.
GENETIC MODEL
We consider a
diploid
monoeciouspopulation
of Nreproducing
individuals pergeneration,
with L loci. Letbe the
genotypes
of a male and a femalegamete, respectively.
The numbersg(-) and g(
f
)
are defined as absolute effects of the genes carriedby
thecorresponding
loci e. These effects are distributed in the
population,
and theirjoint
distribution is assumed to be multivariate normal.Assuming symmetry
between male and femalecontributions,
any value is written in thefollowing
way:where
g
is the mean value of agamete
and y a residual.Matings
are assumed to berandom,
so that the variance covariance matrix of gene effects in newzygotes
takes the formwhere G =
Cov(g(-), 9 (-) , )
=( )’) f Cov(g( 9 ), f
is the variance covariance matrix between gene effects of agamete
drawn from thereproducing
individuals in thepreceding generation.
The value
of phenotype
P in azygote
with), m (g( g( f )) genotypic
value is assumedto
depend linearly
on gene effects:where B is a
(L
x1)
vector. Note thatconsidering
several vectors B allows severaltraits to be considered
simultaneously.
The
genotypic
distribution among thezygotes
isgiven by equation !1!,
so thatthe first 2 moments of a trait P are:
where
(2B’GB)
is the additivegenetic
varianceV A
of thetrait,
andV E
is thevariance of environmental effects on the trait.
Similarly,
thegenetic
covariance between P and a second traitQ
characterizedby
a vectorC,
is:Cov(P, Q) =
2C’GB.
Under the Gaussian
approximation,
thegenetic
modifications inducedby
selec-tion on the
phenotype
are calculated from theregression equations,
anddepend only
on the first 2 moments of thephenotypic changes. Thus,
the exact selection rule is notimportant.
Forexample,
truncation selection andstabilizing
selectionwith a Gaussian fitness function
yield
the samepredictions provided they
are char-acterized
by
the samechanges
in the mean and variance ofphenotypes.
The relevantparameters
are defined asfollows,
wheresubscript
s refers to values after selection:- the selection
intensity
i,relating
thechange
in meanphenotypic
value to thephenotypic
standard deviation- the relative
change
in the variance KAssuming
that selection occurs among alarge population
ofzygotes,
the values of covariances between gene effects in the selected individuals is:where
K e
is defined asThen, taking
accountof gametogenesis,
and rejbeing
the recombination fraction between loci andj,
recurrencerelationships
between 2 successivegenerations (t)
and
(t
+1)
can be derived for the mean and the variance covariance matrix of gene effects(Lande, 1976; Chevalet, 1988):
- mean effects
g’s:
- within
population
structure:- variance of the mean values
(drift effect):
SIMULATION MODEL
The simulated model shares the same
general hypotheses
as theanalytical
scheme(same
initial value ofheritability,
same distribution of the contributions of loci to thegenetic variance),
but is acompletely
discretegenetic
model.At each
locus,
a finite number of alleles areassigned
additive effects that sum up to thebreeding
value of azygote,’ to
which a Gaussian random variable is added to simulate the environmental effect. The additive effects of alleles are drawn in the initialgeneration
from a Gaussiandistribution,
andadjusted
toyield
thespecified heritability
and distribution of contributions among loci. Thepopulation
size isdescribed
by
2 numbers: the number ofzygotes;
and the number N of selected adults. Truncation selection on individualphenotypic
values isperformed,
and adults are mated at random(with selfing occurring
with aprobability
of1/N).
The
genetic make-up
ofgametes produced by
theparents
aregenerated using
apseudo-random-number generator
to simulate Mendeliansegregations.
Programs
allow for various initial distributions of allelic effects within and acrossloci,
several selection rules(truncation
selection is usedhere),
and variouslinkage relationships
between loci(fixed
at1/2
in thepresent work). Outputs
from theprogram
include,
at eachgeneration,
the mean values and standard deviations overreplicated
runs of thefollowing
criteria: meanbreeding value; genetic
andgenic variances,
effective numbers of loci(equations [17]
and[18] below)
and of allelesper
locus;
meanhomozygosity; proportion
of fixedloci;
and(for
modelsassuming independent loci)
the Tparameter
defined in thefollowing (equation !21!).
One-hundred runs were
performed
for each considered case.Programs
were written inFortran 77 and were run on a UNIX machine.
ANALYTICAL DERIVATIONS
The effective number of
QTLs
With
equal
contributions of unlinkedloci, equation [9]
leads toonly
2equations
describing
thechange
with time of 2macroscopic statistics,
the additivegenetic
variance
V,q ,
and thegenic
varianceV a (ie
the sum of the variances contributedby
the
loci). Removing
time indices(the
asteriskdenoting
the nextgeneration),
theequations
are(Chevalet, 1988):
where
h 2 is
the current value ofheritability, 2 h = yA .
Thegenic
varianceV a Var(P)
can be written as:
D
being
the sum of the contributions toVA
of the covariances between gene effectsat different loci.
In the case of unlinked
loci, equation [9]
has 2types,
fordiagonal
terms(r tj
=0)
and for
non-diagonal
terms(rt j = -). 2 I Multiplying equation [9] by Be
andsumming
over
yields:
-thus:
Multiplying equation [13] by B!
andsumming yields equation (11!,
as in the caseof uniform contributions. In
contrast, summing
thediagonal products B! G!! in
equation [9] gives:
Introducing
deviationsX j (resp Y j )
of the contributionsj (resp j K B G jj B ? )
ofI 1
locus j
from the meancontribution —V / t
2L(resp - V a )
2L of a locusEquation [14]
becomeswhich can be written in a form similar to
equation (12!:
defining
the effective numberL e
ofquantitative
trait loci as:this can also be written in the
following
forms:where CV is defined as the coefficient of variation of the contributions of the various loci to the total additive
genetic
variance&dquo;:_f
In addition to the 2 main
equations [11]
and[14],
thefollowing equations
for the deviationsX j
and1 j
can be derived:It can seen that these deviations would remain null if
they
are so at some time.However,
it would beinteresting
to check if this null state is stable withrespect
to
perturbations. Together
withequations (12!, [15]
and(16!,
theseequations
forma closed set of 2L
independent equations
which can be extracted from the set ofL(L + 1)/2 (equation (9!).
This exactresult,
which exhibits a hierarchical structure within thesystem (9J,
iscompleted by
theapproximate
result thatonly
2equations
are needed to
get
acomprehensive description
of thedynamics
of thesystem.
Infact,
the value ofL e ,
as definedabove, depends
on time unless initial conditions aresuch that
L e
= L. Various numerical calculationscomparing
thechange
with timeof
V A ,
either from fullequations [9]
or fromsimple equations [11]
and(12!,
with theproper initial value of
L e ,
show that for manygenerations
nosignificant discrepancy
can be found. As far as
only macroscopic parameters
are of interest(genetic
varianceor response to selection on the
phenotypic scale),
it seems valuable tosimplify
thecomplete system,
and reduce itsdescription
to bothequations [11]
and(14!,
whereL
e
is related to themicroscopic (unobservable) parameters by equations (16!-(18).
EQUILIBRIUM
STRUCTURE UNDER SELECTION(BULMER
EFFECT)
Directional selection for a trait due to the additive effects of several loci
develops negative
correlations between the contributions of distinct loci. In the statisticalsetting
of the infinitesimalmodel,
in which loci are notindividually considered,
this effect has been proven
by
Bulmer(1971) by considering
theregression
ofthe
genotypic
value onphenotypes
after selection. In a verylarge population,
andassuming
initiallinkage equilibrium,
he derived thefollowing
recursion(a special
case of
equations [11]
and(12!):
He also showed that after a few
generations,
anequilibrium
structurearises,
in which thegenic
varianceV a
remainsequal
to the initialgenetic
varianceviQ)
andthe
genetic
variance is fixed at a reduced valuedependent
on selectionstrength.
The limit values are such that
Equation [19] gives
the total amount contributed atequilibrium by negative
correlations(ie linkage disequilibria)
to thegenetic
variance.In the first
generation,
this result can be showndirectly by
agenetic analysis,
under the
hypothesis
of the infinitesimalmodel, starting
from a modelinvolving
multiallelic distributions if the initial
population
is assumed to be inHardy- Weinberg equilibrium
at allloci,
and inlinkage equilibrium
for allpairs
of loci.A more
general
treatment of theproblem
isproposed by Turelli
and Barton(1990),
based on the calculations of all the moments of distributions.
However,
unlessspecial hypotheses
arestated,
theirapproach
does notprovide explicit
recurrencerelationships
after the firstgeneration.
In the
present model,
thegenetic variance
decreases to zero as soon as L is finite when selection is active( K
ispositive),
and if N is finite selection accelerates the fixation process(Chevalet, 1988). However,
aqualitative property
similar to Bulmer’s result still holds: under continuous selection(constant
selectionstrength),
the
following approximate relationship
holds at anygeneration
t after 4 or 5generations
under the same selection rules:This shows
that,
whilegenetic
variances decrease to zero, the total contribution ofnegative
correlations remainsproportional
to the square of the availablegenetic
variance.
The result is obtained
by introducing (for K 54 0)
a new variableT( t ):
and
rewriting equations [11]
and[15],
with the 2 variablesV a
and T.Writing equation [21]
as:the recursion in T is obtained as follows
(discarding
time indices asbefore):
The numerator can be written as:
In the
denominator, VI
is written asthus:
The recursion in
(Va,T)
can then be derivedusing
function F(equation !22!)
andassuming
either that thephenotypic
variance is constant(Var(P)
=Vp),
or thatthe environmental variance
(V E )
is constant. In the latter caseVar(P)
andVar * (P)
are written as
F(V a , T)
+V E
andVI
+V E using expressions [22]
and[23!.
In the case of constant
phenotypic
varianceVp,
we obtain thesystem:
Written in this way, it can be seen that
V a
is aslowly varying expression,
for Nand
L e
not toosmall,
while T reaches theneighborhood
of a limitT
in a fewgenerations:
This
yields equation [20]
above. Infact,
as is done inAppendix,
we can showanalytically
that T reaches theneighborhood
of 1 within 4 to 5generations;
afterthis first
step,
the convergence toT
may be rather slow anddepends
on the relativevalues of K,
L e
and N(numerical calculations).
The same occurs for both models ofphenotypic
variance(constant phenotypic
or environmentalvariances),
with thesame limit
T
and the same kind of convergence.An
approximate complete
solutionThe
analysis
of the model can be furtherdeveloped, owing
to the reduction to 2equations,
and even to asingle equation. Indeed,
since T reaches its limit in a fewgenerations, replacing T!t! by T
inequation [21]
or[22]
allowsvi t )
to be writtenas an
algebraic
function ofv Y ).
Equation [15]
becomes:If N and
L e
are not toosmall,
and their ratio isfinite,
this differenceequation
can be transformed into a differential
equation. Assuming
a constantphenotypic
variance
(Var!t!(P)
=V P ),
we obtain a scaled timeand the
following
notations:and
then, substituting
Fby
itsdefinition, equation [28]
isapproximated by
Integration gives
the(scaled)
time u2 - uicorresponding
to a reduction in thegenic variance, from Va( t ’) to U!t2!.
The result iseasily
obtainedby changing
the variableV
a
to W =F(Va,T) (W
is used here instead ofV A
to avoid confusion between the true value ofgenetic
variance and itsapproximation).
The differentialequation
becomes
and the solution is
which
gives W 2
as animplicit
function ofl W
and of(u 2 - Ul ). Using
thisapproximation,
anequation
for the cumulated response to selection can be derived.The cumulative response from time
t,
to timet 2
isChanging V A
to itsapproximation W,
andreplacing
the sumby
anintegral,
wecan write:
where
This
integral
can bereadily
calculatedby rewriting equation [33]
asso
that, taking
account ofequation !34!, 1(u l ,u 2 )
isThe set of
equations [34]-[36] provides
anexplicit
solution to theproblem,
whichcan be
completed by
a furtherequation giving,
fromequations (10!,
the variance of the response due tosampling. Comparing
the numerical results obtained with thiscontinuous
approximation
with that derivedfrom
the iterative use ofequations (11!
and
[15]
shows a verygood agreement,
as far as thecomparison
does not involve the firstgenerations
if initial conditions assumelinkage equilibrium (V A
=E).
Inthe latter case, the continuous
approximation
underestimates the initial response to selection.Although
it is derived under thehypothesis
that N andL e
are ratherlarge,
theapproximation
is still correct for values ofL e
as small as 5.A similar
analysis
can be carried out for the modelassuming
a constant environmentalvariance,
rather than a constantphenotypic
variance.DISCUSSION
The
preceding
calculations show that thedynamics
of the multilocussystem
considered can be describedby introducing
asingle
additionalparameter (the
effective number of
C!TLs),
ascompared
to the standard statisticalsetting
ofquantitative genetics.
The result holds as far asonly macroscopic properties
of thesystem
areconsidered,
and for a limited number ofgenerations,
because the non-linear features of the
system (equation (9!)
do not allow the derivation of a uniform upperboundary
for the deviations of individual locus contributions from their meanvalues. This
parameter, L e ,
also allows the structure ofgenetic
variance to bepredicted, according
to thegeneralization
of Bulmer’s result to a finitepopulation
and to a finite number of
(aTLs (equation (20!).
The number of
(!TLs
Recent results of
QTL detection, mainly
inplants, suggest
a rather small number of locicontributing
asignificant part
to thegenetic
variance ofquantitative
traits.This does not mean that a few genes are involved in the
make-up
of thetrait,
but thatonly
locicontributing
a ratherlarge genetic
variance can be detectedby segregation analysis.
Asimple
way to describe the distribution of individualcontributions of loci may be to consider them as
pertaining
to ageometric
series ofratio x
(x
<.1). Considering
the case of an initialgeneration,
in which selection has notyet developed
correlations betweenloci,
the individual contribution of locus ican be written as:
so that the total
genetic
variance isFrom
equation [17]
thisyields:
giving quite
low values thathardly depend
on the actual number L. Forexample,
this
equation gives L e
about 3 for x =0.5, e L
about 10 for x =0.8,
andL e
about20 for x = 0.9. If an arithmetic series is
assumed,
the ratioL e/ L
remainslarger.
Writing:
yields
Simulation runs were
performed
with different actual numbers L of locisharing
the same value of
L e , according
toequations [38]
and[40] (uniform distribution, arithmetic,
and 2geometric distributions). They
result inparallel
evolutionsconcerning
thegenetic
variancesV A (fig 1),
as well as for thegenic
varianceV a
and the cumulative response to selection
(not shown).
The
validity
of thisparameter
as apredictor
of thedynamics
of thesystem
de-pends
on itbeing
constant overgenerations.
Numerical calculations do not indicate anysignificant departure
ofL e
from its initial value. Simulation results show howL e
,
as estimated eachgeneration
from the simulated data(from equation [17]), changes
with time(fig 2).
Unless the size of thepopulation
isquite
small and se- lectionintensity
ishigh,
it is seen thatL e
isslowly varying;
atleast, comparison
with
figure
1 indicates that thisparameter changes
with time moreslowly
than thegenetic
variance. Thechanges
ofL e
with time occur after a firstperiod
while it isalmost
constant,
in the cases when the initial distribution of contributions of loci to thegenetic
variance is either uniform(leading
to a decrease ofL e )
orhighly
vari-able
(geometric
series with ratio x <0.8)
in which caseL,
increases.Conversely,
slightly
variable contributions(arithmetic series,
orgeometric
series with x about0.9-0.95)
lead toquite
stableL e
values.Changes
are moresignificant
as selectionintensity
isgreater. Moreover,
it seems from simulations that thisparameter
may be very sensitive topopulation size, suggesting
that alarge
effective number of(aTL’s
cannot
segregate simultaneously
in a smallpopulation
understrong
selection.Thus,
theapproximate analytical derivations,
as well as thesimulations,
indicate thatL,
is asignificant parameter.
Even if the absolute values obtained for thesequantities
do notgenerally
fit theoreticalpredictions
verywell,
theprevious
results indicate thatL e ,
as defined inequation !17!,
controls thesensitivity
of thegenetic system
to the number ofQTLs. Moreover,
it is aparameter
that can be estimated.Firstly,
thepresent L e
definition is the same as that obtained with the classicaldesign using
a cross between inbredlines,
andcomparing
the variance inF 2
to the differences ofparental
lines(Wright, 1968; Lande, 1981). Secondly,
methods basedon
segregation analysis
of many linkedgenetic
markers willprovide
datagiving
the distribution of the mostimportant G!TLs segregating
inpopulations
and their’
contributions to the
genetic
variance. Even if these resultsprovide
new tools forselecting
individuals on a truegenetic basis, knowledge
ofL e
remains of interest foranalysing performances
inpopulations,
as soon as selection has beenpractised.
Genetic structure under selection
Equation [20]
shows that under continuousselection,
agenetic
structure arises that characterizes an internalequilibrium
between selection and recombination. Infact,
thisrelationship
holdsexactly
in theequilibrium
state consideredby
Lande(1976)
in the framework of a model
involving
selection andmutations,
when recombination fractions are set to1/2.
It can be also seen as a kind ofquasi-linkage equilibrium,
a situation encountered in more
general
models under weak selection(Barton
andTurelli, 1991).
In contrast with these exact butasymptotic results,
thepresent
result holds veryearly
in the process of selection(fig
3 andAppendix),
then holdsduring
the whole process. Moreover it does not
require
a weak selection. It isapproximate
butquite accurate,
andprobably
moremeaningful
in the context ofexperimental
orapplied quantitative genetics
thanasymptotic
results which may be more relevant toevolutionary problems.
This
relationship
is alsodependent
on the effective number ofC!TLs.
It isillustrated in
figure 3,
in which theoreticalpredictions
arecompared
to the results of simulations.Firstly,
it turns out that Gaussianpredictions
ofgenetic
variances aresatisfactory during
severalgenerations,
and more so aspopulation
size isgreater
and selectionintensity
lower. It seems however that thetheory
underestimates the amount of ’hidden’ variance in smallpopulations,
which isexpressed by
anestimated value of T
larger
than its theoretical value. Even with very fewQTLs, approximations
aregood
forlarge population
sizes and medium selectionintensity,
thegreatest departures
betweenanalytical predictions
and simulationsappearing
for
high
selection intensities.Secondly,
theprediction concerning
theorganization
of
genetic variability,
asgiven by equation !20!,
is more robust than that ofgenetic
variances themselves. The
agreement
betweentheory
and simulations is observedduring
moregenerations, although
anincreasing variability
of the estimated Tparameter
is observed whensignificant departures
between theoretical and observed variances arise(the
estimated variance of T betweenreplicates
then shows asharp
increase).
Such a structure of
genetic variability
under selectionimplies
somechanges
inthe
partition
ofgenetic
variance among groups of related individuals. Forexample,
within the framework of the
simple population
model considered until now, thepartition
ofgenetic
variance into full-sib covariance(C FS ),
andwithin-family
variance
(V w ,
which isequal
to half thegenic
variance for unlinked(aTLs),
canbe written as
follows,
in thepopulation
of new unselectedzygotes:
CONCLUSION
In this paper, the
genetic
model is restricted to a set of unlinked loci.Although
itmay be stated that any trait is due to many loci
spread along
thechromosomes, considering
a finite set of unlinked loci may not becompletely
unrealistic. The firstexperimental
results on the localization ofGZTLs
in tomatosuggest
that a fewmain
regions
areinvolved,
and that theseregions
may in fact include severalclosely
linked genes. The
system
made up of unlinked loci with many alleles(modelled by
a Gaussian distribution of
effects)
may thus be a correct first-orderapproximation
of a
genetic system involving
several clusters of genes on differentchromosomes, provided
nolong-term prediction
isrequired. Indeed,
simulation runsinvolving
either several unlinked loci with many alleles taken from a normal
distribution,
or several clusters of
tightly
linked loci withonly
2alleles,
lead to very similar responses to directional selection(not shown). However,
this is nolonger
true inlater
generations,
when many recombination events within clustersgenerate
’new’alleles,
and lead to responses that areunpredictable
in the framework of thepresent theory (Hospital, 1992; Hospital
andChevalet, 1993).
Investigating
the effects of the distribution ofquantitative
loci on the response to selection could beperformed
hereowing
to the use of the Gaussianapproximation.
It is well known that Gaussian distributions are not robust under Mendelian
segregation (Felsenstein, 1977),
andthat,
even if a Gaussian distribution is acorrect
approximation
at sometime,
selectionpromotes asymmetrical
distributionsneeding higher
moments to be included in the recurrencerelationships (Turelli
andBarton, 1990).
Whenlooking
at the simulationresults,
we can see thatequation
[20]
remains correct, but that the reduction of thegenic
varianceV a
under selection is underestimated. Thissuggests
that the Gaussianexpression
ofregressions
ofgenotypes
at individual loci onphenotypes
is the main incorrectstep
of theapproximation.
In fact it is clear that(in
a truegenetic model)
selection shifts allelefrequencies
anddevelops asymmetrical
distributions which cannot be handled in the framework of Gaussian distributions.Developments
of distributions in Gram- Charlier series could beinvestigated,
but suchdevelopments
would not allow thejoint
distributions at several loci to be madeexplicit.
The
interesting
result obtained here is that severalqualitative
features of thegenetic system
arecorrectly
taken into accountby
the Gaussianapproximation.
Mainly,
theanalysis
introduces a newmacroscopic
parameter(the
effective numberof
quantitative loci)
to characterize aquantitative trait,
in addition to the usualheritability
coefficient. Thisparameter
controls the amount of mid-term selection response and the structure ofgenetic
variance in thepopulation.
The otherimportant
featurepredicted by
the model and checkedby
simulations results is therelationship
between the amount ofgenetic
variance available to selection(V A )
andthe amount of ’hidden’
genetic
variance(equation !20!).
This resultgeneralizes
thoseobtained
by
Bulmer(1971)
and Verrier et al(1990)
for the infinitesimalmodel,
andyields
newexpressions
of covariances between relatives. Whether thesediscrepancies
are
important
for the methods of estimation ofbreeding
values remains to beinvestigated.
Other uses of the model may be
considered,
such as the search foroptimal
selection intensities in a selection program, or the
management
ofmatings
infinite
populations
submitted tostrong
selection.Indeed,
the Gaussian framework would make it easy to take account ofpopulation
structures(separate
sexes,overlapping generations, family
or indexselection)
and of assortativemating.
However,
the poorlong-term predictive
power of the method must be stressed.Indeed,
theequilibrium genetic
structuresarising
from a balance between mutation and selectiondepend strongly
on the chosen model(Gaussian
model vs the rareallele or ’house of cards’
model,
Barton andTurelli, 1987), indicating
that theGaussian
approximation
should not be used ifasymptotic
results are searched for. Nevertheless the’Gaussian’ modelling
of theoptimization
of short or mid-term selection schemes may be
worthwhile,
andprovide
animprovement
over usual theories(Robertson, 1970).
REFERENCES
Barton
NH,
Turelli M(1987) Adaptative landscapes, genetic
distances and the evolution ofquantitative
characters. Genet Res Camb 49, 157-173Barton
NH,
Turelli M(1991)
Natural and sexual selection on many loci. Genetics 127,229- 255Bulmer MG
(1971)
The effect of selection ongenetic variability. Anc
Nat 105,201-211 1 Chevalet C(1988)
Control ofgenetic
drift in selectedpopulations.
In: Proc 2nd IntConf
Quant
GenetRaleigh
NC(BS
Weir, EJ Eisen, MM Goodman, GNamkoong eds), May
31 - June 5 1987, Sinauer Associates
Inc,
SunderlandMA,
379-394Felsenstein J
(1977)
Multivariate normalgenetic
models with a finite number of loci.In: Proc Int
Conf Quant
Genet(E Pollack,
0Kempthorne,
TBJrBailey eds),
AmesIA, August 16-21,
1976, Iowa StateUniversity Press,
227-246Hospital
F(1992)
Effets de la liaisong6nique
et des effectifs finis sur la variabilité des caracteresquantitatifs
sous selection.These,
Université des Sciences etTechniques
duLanguedoc, Montpellier,
FranceHospital
F, Chevalet C(1993)
Effects ofpopulation
size andlinkage
onoptimal
selectionintensity.
TheorAppl
Genet 86, 775-780Lande R
(1976)
The maintenance ofgenetic variability by
mutation in apolygenic
character with linked loci. Genet Res
28,221-235
Lande R
(1981)
The minimum number of genescontributing
toquantitative
variationbetween and within
populations.
Genetics 99,541-553Robertson A
(1970)
Someoptimal problems
in individual selection. TheorPop Biol 1,
120-127
Turelli
M,
Barton NH(1990) Dynamics
ofpolygenic
characters under selection. TheorPop
Biol38,1-57
Verrier
E,
Colleau JJ,Foulley
JL(1990) Predicting
cumulated response to directional selection in finitepanmictic populations.
TheorAppl
Genet 79,833-840Wright
S(1968)
Evolution and the Genetics ofPopulations.
I. Genetic and Biometric Foundations. TheUniversity
ofChicago
Press,Chicago,
USAAPPENDIX
Proof of equation [20J
The
proof
consists ofderiving
from equations[25]
and[26]
that T reaches theneighborhood
of its limit
T (equation [27]),
within a fewgenerations.
Theproof
isgiven
here in the caseof a constant
phenotypic
varianceVp,
but could be extended to the case of a constant environmental variance.Firstly,
equation[26]
is rewritten with the new variableThen it is shown that the series
I U t is
boundedby
a positive series Ztconverging rapidly
to a small value.
The
following
notation is introduced for the current value ofheritability
Let Dt be the denominator in
equation (26!,
which can be written aswhere
and
Equation [26]
becomesThe
expression
in the second line[45]
can beuniformly
boundedby
a small number:In the first line
(44!,
the sum St of the first 2 termscan be shown to have an absolute value less than
1/2.
Theproof depends
on thesign
of(3,
and this distinction is also used to get a minoration of Dt.If
,0
isnegative,
ie if - - -then
Introducing
theseinequalities
inA,
we can write thefollowing inequalities
and
taking advantage
ofand the
hypothesis
on the sign of,3,
we derivev
Similarly,
we notethat,
in this case:In the other case,
/3
> 0, thenand the
preceding inequality [47]
still holds. We havefrom which we get