HAL Id: hal-00894268
https://hal.archives-ouvertes.fr/hal-00894268
Submitted on 1 Jan 1999
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Power for mapping quantitative trait loci in crosses
between outbred lines in pigs
Luis Gomez-Raya, Erling Sehested
To cite this version:
Original
article
Power for
mapping quantitative
trait
loci in
crosses
between
outbred lines in
pigs
Luis
Gomez-Raya* Erling
Sehested
Department
of AnimalScience, Agricultural University
ofNorway,
Box 5025, N-1432
As,
Norway
(Received
28May 1998; accepted
12May
1999)
Abstract - Power for
mapping quantitative
trait lociusing
crosses betweensegregat-ing populations
was studied inpigs.
Crossing
generatesgametic disequilibrium
andincreases
heterozygosity.
The condition for aheterozygosity
among Fl individuals tobe greater than in either line at crossing is that allele
frequency
should be lower than1/2
in one line andhigher
than1/2
in the other line. Maximumexpected
power andexpected
risk were used to compare hierarchical backcross(each
boar mated to several sows; contrast withinboars),
hierarchical intercross(each
boar mated to several sows; contrast within both boars andsows)
and traditional intercross(each
boar mated toone sow; contrast within both boars and
sows).
The use of hierarchicaldesigns
(back-cross andintercross)
increased power for traits with low or intermediate heritabilities.For small
QTL
effects and low heritabilities the hierarchical backcrossdesign
gave thehighest expected
power but also thehighest
risk. There is not ageneral design
whichallocates resources in an
optimum
fashion across situations(heritabilities,
QTL
ef-fect,
heterozygosity).
Acompromise
betweendesigns
withhigh
power and low riskis
suggested.
A hierarchical backcrossdesign
of 400piglets
andheterozygosity
0.68requires
between four(maximum
expected
power)
andeight
boars(minimum risk)
todetect a
QTL
of 0.5phenotypic
standard deviations. Selection of extreme individualsin
parental
lines increased power up to 21 %. Commercial crosses areproposed
as analternative to
experiments
forQTL
mapping. @
Inra/Elsevier,
Parisgene mapping / quantitative trait locus
/
statistical power/
hierarchical design /pig
Résumé - Puissance de détection des loci à effets quantitatifs dans les croisements entre lignées non consanguines chez le porc. On a étudié chez le porc la puissance
de détection des loci à effets
quantitatifs
à partir de croisements entrepopulations
enségrégation.
Le niveaud’hétérozygotie
chez les individus FI estsupérieur
à celui de*
Correspondence
and reprintsl’une ou l’autre des
lignées
en croisement si lesfréquences alléliques
sontsupérieures
à1/2
dans l’une deslignées
et inférieures à1/2
dans l’autrelignée.
Lapuissance
maximaleespérée
et le risqueespéré
ont été considérés pour comparer les croisementsen retour
hiérarchiques
(chaque
verrat estaccouplé
àplusieurs
truies : contrastesintra-verrat),
l’intercroisementhiérarchique
(chaque
verrat estaccouplé
à une truie ; contraste intra-verrat etintra-truie).
L’utilisation de ces deux types de schémashiérarchiques
aaugmenté
la puissance de détection pour les caractères à héritabilitésfaibles ou intermédiaires. Pour les
CaTLs
à petit effet et les héritabilités faibles, le schémahiérarchique
avec croisement en retour a donné la puissanceespérée
maximaleet aussi le
risque
leplus
élevé. Ce n’est pas le même schémaexpérimental qui
alloueles ressources d’une
façon optimale quand
on considère des situations différentes pourl’héritabilité, les effets de
QTL
et le niveaud’hétérozygotie.
Un schémahiérarchique
avec croisement en retour de 400
porcelets
et unehétérozygotie
de 0,68 requiertentre quatre verrats
(puissance
espérée
maximum)
et huit verrats(risque minimal)
pour détecter unQTL
avec un effet de 0,5écart-type phénotypique.
La sélectiond’individus extrêmes dans les
lignées parentales
aaugmenté
lapuissance jusqu’à
21 %. Les croisements entre
lignées
commerciales sontproposés
comme alternativesaux
expérimentations
de détection deQTL. ©
Inra/Elsevier,
Parisdétection de gènes / locus à effet quantitatif / puissance
statistique
/ schéma
hiérarchique / porc
1. INTRODUCTION
Crosses between inbred lines and
analyses
within outbred lines have beenproposed
forquantitative
trait loci(QTL)
detection. In the firstapproach,
inbred lines are assumed to befully
homozygous
for alternative alleles atboth marker and
QTL. Consequently,
F
l
individuals arefully heterozygous
andgametic
disequilibrium
is maximum.High heterozygosity
allowshigh
segregation and, therefore,
high
power. With maximumgametic
disequilibrium,
a
specific
allele at the marker is associated with aspecific
allele at theQTL
in all families.Consequently,
increased power is achieved because a lower number of contrasts is neededusing
acrossfamily
analyses
based on alarger
number of observations per contrast.In the second
approach, analyses
within outbredlines,
both marker andQTL
aresegregating.
Power in crosses between inbred lines is muchhigher
than inanalyses
within outbred lines. A combination of bothapproaches
has assumed that the lines aresegregating
at the marker but fixed for alternative alleles atthe
QTL
[1, 2!.
Anotheralternative,
notexplored
yet,
is to considerparental
lines with different allelefrequencies
at theQTL
because of different selectionhistory.
Conventional
types
of crosses between inbred lines are intercross and back-cross. Power of an intercross versus a backcrossdesign
is increased because the number ofsegregating
rneioses in an intercross is twice as many as in a back-cross.Experiments
conducted inpigs
have restrictions infamily
structure to amaximum of
approximately
tenpiglets
per litter. Asmall
number of full-sibs may result in low power!9!.
A hierarchical structure, such as a few boars matedto several sows
each,
allowslarger subgroups
of progenyinheriting
alternative alleles from eachboar,
which may increase power.It has been
proposed
to use selectivegenotyping
to increase power forgrowing
progeny is less than the cost ofgenotyping.
Another use ofselection,
not considered
yet,
would be to use extreme individuals for thequantitative
trait asparents
toproduce
theF
l
.
Thisapproach
increases thefrequency
ofheterozygotes
in theF
l
and, therefore,
statistical power.The
objectives
of this paper are:1)
toinvestigate
thedegree
ofdisequilib-rium
generated
whencrossing
twopopulations segregating
at twoloci;
2)
toinvestigate
under which conditions theheterozygosity
(and
consequently
power)
in the cross of two linessegregating
at aQTL
ishigher
than within eitherline;
3)
toinvestigate
powerusing
hierarchicaldesigns;
4)
toinvestigate optimum
allocation of resources for a
given experiment
size;
and5)
toinvestigate
the effect of selection of extreme individuals in theparental
lines onheterozygosity
and power.2. THEORY
2.1.
Underlying genetic
model andassumptions
The purpose of the
experimental crossing
is themapping
ofQTL by
the use ofgenetic
markers such as microsatellites. It was assumed that recombinationevents did not occur between marker and
QTL
and that alloffspring
wereinformative for the marker. These
simplifying
assumptions
were made to reduce the number ofparameters
to be shown in the results. Theimpact
of theseassumptions
inpractical
scenarios is addressed in the discussion.An
underlying
mixed inheritance additive model was assumed with anadditive
biallelic locussegregating
atfrequencies
of the favourable allele pB and pc for lines B andC, respectively.
TheQTL
was assumed to be inHardy-Weinberg equilibrium
and ingametic
phase equilibrium
with thepolygene
in eachparental
line. The variance attributable to theQTL
in units of the residualphenotypic
variance in line i was:vb(i)
=2p
i
(1 -
Pi)a2 /(u!
+ a
E
),
where pi is the
frequency
of the favourable allele in linei,
cr is theQTL
effect,
and(a A 2
+0
,2 E
is
the residualphenotypic
variance with0
,2 A
andOF2 E
being
the additive variance attributable to thepolygene
component
and the environmentalvariance,
respectively.
Theheritability
comprising QTL
andpolygene
variation inparental
line i was:where
h! is
the residualheritability
attributable to thepolygene
component,
and has valuea A 2 /(U
2
A
+0,2 E ). Parameters h! and
V
2 Q
areusually
not known andonly
estimates ofh
2
(i)
are available for some traits. Different heritabilitiesin the two
parental populations
were not assumed because of themagnitude
of thesampling
variance of the estimates ofheritability
and becauseparental
populations
may have been raised under different environmental conditions. Forsimplicity,
powercomputation
in this paper was carried out for agiven
QTL
effect and constant residual heritabilities amongF
l
individuals.There-fore,
results are valid for avariety
of situations withrespect
to theparental
with a different
heritability
(comprising
QTL
andpolygene).
Thehighest
pos-sibleheritability
(QTL
andpolygene)
in theparental populations
(denoted
ash
2
(max))
leading
to agiven heterozygosity
at theQTL
amongF
l
individuals wascomputed
as an indication of the values ofheritability
inpractical
situa-tions. Theimpact
of thisassumption
on power is addressed in the Discussion. Three levels ofheterozygosity
in theF
l
were considered:1,
0.68 and 0.32. Aheterozygosity
of 1 occurs when theQTL
is fixed for alternative alleles. Ahet-erozygosity
of 0.68 may occur for a range of situations inparental populations
(e.g.
PB =0.8;
pc = 0.2 andp
B =
0.68;
pc =
0).
Aheterozygosity
of 0.32represents
the situation where the two linessegregate
at the samefrequency
(e.g.
pB =0.8;
pc =
0.8)
or when one allele is absent from onepopulation
andsegregating
at afrequency
lower than 0.5 in the otherpopulation
(e.g.
pB =
0.32;
p c
=
0.0).
Theadvantages
ofcrossing experiments
aimed atQTL
mapping
are:1)
linkage disequilibrium
betweenQTL
and marker alleles in theF
l
;
and2)
increasedheterozygosity among F
l
individuals.2.2. Gametic
disequilibrium
in the cross of two outbred lines If the twopopulations
are fixed for alternative alleles at both marker andQTL,
thengametic disequilibrium
is maximum in theF
l
.
Therefore,
power isincreased with
respect
towithin-family analysis.
If the twoparental populations
are
segregating
at the samefrequency,
then theresulting
F
l
population
is inlinkage
equilibrium
and no benefits areexpected
fromcrossing.
If the genesare
segregating
at differentfrequencies
in theparental populations
then somedegree
ofdisequilibrium
will begenerated.
For a moregeneral
description
of thisproblem, gametic disequilibrium
will becomputed
at twoloci,
M andN,
that can be either two markers or one marker and oneQTL.
Gameticdisequilibrium
between markers can be estimated if estimates of allelefrequencies
in theparental
lines atcrossing
are available. Letf
6M
andfb,!
be vectors of allelefrequencies
at locus M(with
kalleles)
and at locus N(with
g alleles)
inparental population
B.Similarly,
f
eM
andf!,V
are thecorresponding
vectorsof allele
frequencies
at loci M and N inparental
line C.Assuming
that the twoloci are in
linkage equilibrium
within the line then the matrix withgametic
frequencies
at eachpair
of alleles for line B is:The matrix of
gametic frequencies
in line C isF
eMjV
=f
’M
f!!,.
The matrixthe cross between lines B and C is
given by: F
beM
N
=
1/2(F
bMN
+F
EMN
)-The allele
frequencies
for loci M and N in theF
l
cross arefbcm !
F
bcMN
1
1 andf6!N
= 1’F
beMN
,
respectively.
In theseequations
1represents
the unitvector. The
gametic disequilibrium
matrix is then obtainedsimply by
D
MN
=FB,MN - (fb,M
f6!NO
Elements of matrixD
MN
are thedisequilibrium
values for allpossible
combinations of alleles at loci M and N. Define thedisequilibrium
parameter
between two loci M andN, S2
MN
,
as the sum of absolute values of all elements of matrixD
MN
computed by 52,!,Ir,
= 1’abs(D
MN
)1,
where ’abs’denotes the absolute value at each element of the matrix between brackets.
Therefore, S2
MN
measures thegeneral degree
of association between alleles atloci M and N. The value of
52,,,1,!
ranges between 0 and 1. Forexample, Q
MN
is 1 when lines atcrossing
are fixed for alternative alleles anddisequilibrium
is maximum andQ
MN
is 0 when the lines atcrossing
aresegregating
at the samefrequency
for each allele anddisequilibrium
is null.2.3.
Heterozygosity
in the cross of two outbred linesConsider two outbred lines B and C
segregating
at a biallelicQTL
with allelesQ
and q.Assuming
randommating,
thefrequency
ofheterozygous
individuals at theQTL
among theF
l
individuals is:It is shown in
Appendix
1 that the necessary condition for increasedheterozygosity
withrespect
to either line atcrossing
is that one line issegregating
at afrequency
of the favourable allelegreater
than1/2
and the other line at an allelefrequency
lower than1/2.
2.4.
QTL
mapping
designs
for the cross between two outbred linesPower
computation
was carried outassuming
that allelefrequencies
atthe marker were not very different in the two
parental populations,
andconsequently, gametic disequilibrium
was small andignored.
However,
the increasedheterozygosity
in theF
1
boars can be used tostudy
segregation,
within
families,
of aQTL
associated to a markerby
recording performance
and inheritance of alternative marker alleles in the nextgeneration
(backcross
or intercross
depending
on themating
ofF
l
boars to one of theparental
lines orF
l
sows,respectively).
Therefore,
theapproach
that can be followedis the same as in
QTL
mapping
within outbred lines but with an increasedheterozygosity
amongF
l
individuals. Three alternativedesigns
are consideredin this section: hierarchical backcross
design,
traditional intercrossdesign
and hierarchical intercrossdesign.
Contrasts in hierarchicaldesigns
can uselarge
subgroups
of progeny of a mixture of half- and full-sibs within boar.2.4.1. Hierarchical backcross
design
In a hierarchical backcross
design,
b boars from theF
l
are mated to s sowswhere y2!kc is the lth observation of
phenotype
on apiglet
with marker allele k inherited from boar i when mated to sowj,
bo
i
is the fixed effect of boari,
so2! is the fixed effect ofsow j
mated
to boari,
mijk is the fixed effect of marker allele k inherited from boari,
and eijkl is the residual random error. Model(2)
is a three level hierarchicaldesign
in which marker alleles are nested withinsows, which in turn are nested within boar. For
simplicity,
sows and boars were assumed fixed. It is notstrictly
correct because the model does not account forrelationships
between animals.Power for model
(2)
wascomputed
following
thex
2
approach
of Gelder-mann[5]
and Weller et al.[14]
comparing
the square of the difference between the two progenysubgroups inheriting
alternative alleles from their boar(SDP)
for each boarfamily
to theexpected squared
difference under the nullhypothe-sis.
Therefore,
the alternativehypothesis
is aQTL
linked to the marker and the nullhypothesis
is the absence of aQTL
linked to the marker. For a more de-taileddescription
of the method and discussion of theassumptions,
see Welleret al.
!14!.
Briefly,
theassumptions
arecomplete linkage
between afully
infor-mative marker and aQTL,
and theanalyses
are carried out within families. Forsimplicity,
it is assumed that there is no common environmental variance and litter size is fixed at 10.The statistical tests to
reject
or toaccept
the nullhypothesis
are basedon the distribution of the SDP. The summed SDP values divided
by
thesquared
standard error of the contrast(SE
2
)
follow a centralk
2
distribution with bdegrees
of freedom under the nullhypothesis
for alarge sample
size or when
phenotypic
variance is known. For the alternativehypothesis
E(SDP/SE
2
)
N
x
2
(nc,
b),
where nc is thenon-centrality
parameter
of anon-central
x2
distribution
with bdegrees
of freedom. The value of thenon-centrality
parameter
is nc =b he
BC
0!/SE!,
where 0 = a +6(l - 2 p
B
)
and 0 = c! +
6(1 - 2p
c
),
for backcrosses with lines B andC, respectively
(Appendix
2),
a is half the difference between the two alternativehomozygotes,
6 is the dominancedeviation,
andSE! =
4 [(1
+(1/4)shr - l/2h!)/(sp)]
[equation
(A3)
inAppendix
3!.
Following
Weller et al.[14],
the power to detect asegregating QTL
wascomputed
as1 - !
= 1 !p[x
2
(nc,
b)
<T],
where1
3
is theprobability
ofcommitting
atype
2 error,[p(x
2
(nc,
b)
<T]
is theprobability
of a x
2
value under the alternativehypothesis
(non-central
X
2
with parameter
nc and bdegrees
offreedom)
less thanT,
with Tbeing
the value of the centralx
2
(b)
for agiven
significance
level ofcommitting
atype
1 error.2.4.2. Traditional intercross
design
In the intercross
design,
inbred lines B and C are crossed toproduce
theF
l
boars and sows that are intercrossed among themselves. In the traditional intercrossdesign
b boars are mated to one sow each. A linear modelallowing
testing
for themarker-(aTL
effects is:where Yijk is the hth observation on
piglet
k with markergenotype j
inherited
infamily
i,
f
z
is the fixed effect offamily
i,
mij is the fixed effect ofand
heterozygous)
and eijk is the residual error. Model(3)
would be the one touse in the
analysis
of real data.However,
to takeadvantage
of theX
2
approach
of Geldermann[5]
and Weller et al.[14]
for powercalculation,
thefollowing
linear model can be used:where !z!!l is the lth
phenotypic
observation ofpiglet
inheriting
marker allele k from the sow andallele j
from theboar,
mb
ij
is the fixed effect of marker allelej
inherited from theboar,
m8ik is the fixed effect of marker allele k inheritedfrom the sow, and ei!xl is the residual error.
Model
(4)
islinearly equivalent
to model(3)
under theassumption
of afully
additiveunderlying genetic
model and no sexualimprinting.
The distribution under the alternativehypothesis
ofE(SDP/SE
2
)
isapproximately
x
z
(nc,
b)
andX
2 (nc,
s)
for boars and sows,respectively.
Assuming
that thehypothesis
being
tested is the same either within boars or within sows, the distribution of the
sum of
’¿,(SDP /SE
2
)
for both boars and sows follows ax
z
(2nc,
b+s)
under the alternativehypothesis.
The sum of variableshaving
non-centralx
z
distributionsjointly independent
also follows a non-centralX2
distribution
withdegrees
of freedomequal
to the sum of thedegrees
of freedom of the former non-centralX2
2and
non-centrality
parameter
equal
to the sum of thenon-centrality
parameters
of the former variableshaving
non-centralx
z
.
Thenon-centrality
parameter
of thez
x
has a value 2nc =(b
+s)
he
BC
,
(p /SE
2
where 0 = a +6(l -
PB -p
C
),
andSE!
=4 !(1 -
(l/2)!)/p]
[equation
(A4)
inAppendix
3].
Similarly
to a hierarchical backcrossdesign,
power to detect asegregating QTL
wascomputed
as 1 -(3
= 1 -p[x2(2nc,
2b)
<T],
where/3
is theprobability
ofcommitting
atype
2 error,!p(2(2nc,
2b)
<T]
is theprobability
ofax
2
value under the alternativehypothesis
(non-central
X2
with parameter
2nc and 2bdegrees
offreedom)
less thanT,
with Tbeing
the value of the centralX2
(2b)
for agiven
significance
level ofcommitting
atype
1 error.2.4.3. Hierarchical intercross
design
In a hierarchical intercross
design,
b boars from theF
l
are mated to s sows each from theF
l
toproduce
ppiglets
per litter. Power calculation can be carried outusing
the linear model:where Yijklm is the mth observation of
phenotype
on apiglet
with marker allele k inherited from boar i mated to sowj,
bo
i
is the fixed effect of boari,
soij is the fixed effect ofsow j
mated to boari,
mb
ik
is the fixed effectof marker allele k inherited from boar
i,
msijl is the fixed effect of marker allele I inherited from sowij
and eijklm is the residual error. Power calculationusing
model(5)
requires
differentnon-centrality
parameters
within boars andsows. The
non-centrality
parameter
in boars is: ncb =b he
B
ce
2
/SE2,
where0 = a + b( I - p
B
-
P
c )
(Appendix 2),
withSE
2
=4[(1 + (1/4)ph; -1/2h;)/(sp)]
design.
Thenon-centrality
parameter
in sows is nc,s =bshe
BC
B
2
/SE
2
,
where0 =
a+6(I-
PB
-p
c
)
(Appendix
2),
andSE
2
= 4[(1 - ( 1 /2)h$) /p]
(Appendix 3).
Assuming
independence,
the distribution of the sum ofE(SDP/SE
2
)
for both boars and sows follows ax
2
(nc
b
+nc.&dquo;
b(1+s))
under the alternativehypothesis.
Under the nullhypothesis,
the distribution of the sum ofE(SDP/SE
2
)
for both boars and sows follows a centralX
2 (b(
+s)).
Power to detect asegregating
QTL
wascomputed
as1 - (3
= 1 !p!x2(ncb
+ , sncb(1
+s))
<T],
where/3
isthe
probability
ofcommitting
atype
2 error,h
(nc
2
(p(x
+ nc,,b(1
+s))
<T]
is theprobability
of a x
2
value under the alternativehypothesis
(non-central
x
2
with
parameter
nci, + ncs andb(1
+s)
degrees
offreedom)
less thanT,
with Tbeing
the value of the centralx
2
(b(1
+s))
for agiven significance
level ofcommitting
atype
1 error.2.5.
Design
ofexperiments
using
crosses between outbred linesMost
experimental
costs are inraising, genotyping
andrecording
of agiven
number ofslaughter pigs
in theF
2
or backcross.Therefore,
theparameters
to be chosenby
the researcher are the number ofboars,
the number of sows and the number ofpiglets
per sow. Inpractice,
it is convenient for thehandling
of theexperiment
to fix the number ofpiglets
per sow. It is also economical sincethe use of few
piglets
per sow would increase the cost inraising
alarge
number of sows. It will be assumed from now on that the number ofpiglets
per litterin the
experiment
is fixed at ten and thequestion
is how to makeoptimal
use of different numbers of boars and sows in theF
l
for agiven experiment
size(total
number ofslaughter
pigs).
Parameters of interest are the
expected
power(EP)
and its standarddeviation
(SD):
where
pr(i)
is theprobability
according
to the binomial distribution ofhaving
i
heterozygous
boars[assuming
afrequency
ofheterozygous
boarsgiven by
equation
(1)],
andP
i
is the power with i boarscomputed according
to theprevious
sections.Expected
power is utilized for eachdesign
to account for randomsampling
of the boars. SD can be used as a measurement of the variation in power.Comparison
of alternativecrossing
designs
can be doneby
comparing expected
power and its standard deviation at a fixedexperiment
size. Theapproach
used in this paper foroptimum
allocation of resources is based on therepeated computation
ofexpected
power for allpossible
combinations of the number of boars and sows at agiven experiment
sizeassuming
constant litter size of tenpiglets.
The power of themating
structurehaving
thehighest
expected
power will be called maximumexpected
power(MEP).
The functions CINV and PROBCHI of SAS[12]
were used tocompute
central and non-centralAnother
parameter
of interest inplanning experiments
isexpected
risk(ER)
which we define as theprobability
ofhaving
power lower than 0.5 due tosampling
amongF
l
individuals in thesegregating
population:
2.6. Selection in
parental
lines to increase power forQTL
mapping
Selection ofhigh ranking
individuals in oneparental
line and of lowranking
individuals in anotherparental
line can be used to increase statistical power for theexperiment
when the lines are not fixed for alternative alleles. Assume a biallelicQTL segregating
in the twoparental
lines. Consider a hierarchicalbackcross
design
in whichhigh
and lowranking
individuals selected in theparental
lines arerandomly
mated toproduce F
l
,
which areagain randomly
mated to one of the unselectedparental
lines. Two effects would occur in this scheme:1)
an increase in thefrequency
ofheterozygous
individuals in theF
r
which would increase the power to detectQTL;
and2)
an increase ingenetic
variance of the backcrossoffspring
due tolinkage disequilibrium
[3],
which would decrease power to detectQTL.
2.6.1.
Frequency of heterozygotes
amongoffspring of high
and lowranking
parents
Computation
of thefrequency
ofheterozygous
F
i
individuals was carriedout
by
using
integrals simultaneously
of the three normal distributionscor-responding
to the threegenotypes
at theQTL.
The selectedproportion
(p,
5
)
for thehigh
line is related to theproportion
selected among individuals withgenotypes
QQ, Qq and qq
(p
,
poq andpqq)
by
theequation:
with
f(xIG
i
)
being
the normaldensity
given
genotype
(Gi
=QQ,
qq).
Values pQQ, PQq, and Pqq can be found for aunique
truncationpoint,
t,
analytically.
In theexamples
considered in this paperSimpson’s
rule[13]
was used for
increasing
values of the abscissa until a value of t was foundsatisfying
the aboveequation
for agiven
ps.Computation
ofexpected
genotype
The same
procedure
was followed to obtainexpected
genotype
frequencies
among low
ranking
individuals in the low line(pQ
Q
,
L
,
L’
PQ
q
p
9
9
,
L
)
using
thesame
proportion
selected,
ps, as for thehigh
line.Computing
theexpected
frequency
ofheterozygous
F
l
offspring
resulting
from the cross of selectedparents
from each line was carried out withequation
(3)
ofGomez-Raya
and Gibson[6].
It assumes randommating
and utilizes thefrequencies
amongextreme individuals from each of the two
parental
lines.2.6.2. Increased variance among the
offspring
of F
l
individuals The effect of selection ofhigh
and lowranking
individuals ongenetic
variance and on the estimation ofheritability
is discussedby
Bulmer[4]
andGomez-Raya
et al.
!7!.
Briefly,
if selection ofhigh
and lowranking
individuals is carried outin the same
population
thengenetic
variance increases in the selected group ofparents
QAS
=(1 + !s!)o’!;
whereQA
is thegenetic
variance for thepolygenic
component
in the unselectedpopulation, k,
5
=[z 4>(z)]/ps,
z is the absolutevalue of standard normal deviates at cutoff
points,
4>( z)
is the ordinate and ris accuracy of evaluation. The above
equation
can also be used whencrossing
high
ranking
individuals from oneparental
line with lowranking
individuals from the otherparental
line aslong
as theparental populations
have the samegenetic
variance and the samepolygenic heritability
and allowance is made forthe different selection intensities for each
genotype
at theQTL
and for each line. Forsimplicity,
anapproximation
to account for different selection pressuresacross
genotypes
was madeby
weighting
the values of z and4
>(z)
according
tothe
proportion
selected from eachgenotype
and line in thecomputation
ofk
8
by
where
zi&dquo;
and4>(z;&dquo;)
are the absolute values of standard normal deviate and ordinate at truncationpoint
forgenotype
i and line m(i
had values1,
2 and 3 forgenotypes
QQ, Qq
and qq,respectively).
An exact
computation
would be feasibleby using
allpossible
groupsresulting
in theF
l
offspring
andby including
the variation of the meanscorresponding
tothe different groups. This variation is
small,
particularly
forQTL
of smalleffect,
and the
approach
used accounts for the increased variance amongoffspring
from selectedparents.
Following
Bulmer(3!,
thegenetic
variance in theF
l
generation
is0’ A
(Fl)
2
(1
+(1/2)ksr2)u!.
Thedisequilibrium
is halved in the nextgeneration
sinceparents
in theF
l
are chosenrandomly.
Thegenotypic
variance among backcrossoffspring
fromrandomly
mating
F
l
individuals to one of theparental
lines(unselected)
is:The
heritability
among backcrossoffspring
is also increasedby
selection ofhigh
Power calculation was carried out as described in the
corresponding
section butusing
the increasedfrequency
ofheterozygotes
andheritability
ascomputed
above. The estimates of the gene effect are biased if selection ofparents
is used. Correction for selection bias in the estimates of the gene effect can be achieved afteraccounting
for the increased variance among backcrossoffspring by
where a* is the estimate of the gene effect from the
experiment
using selection,
and a is the estimate of the gene effect after correction for selection bias. Both
&* and a are in
phenotypic
standard deviations units.3. RESULTS
Table I shows the
gametic
disequilibrium
parameter
for alternative number of allelessegregating
in theparental populations.
Thedisequilibrium
ishigh
when alleles at each of the two loci are fixed in onepopulation
butthey
arerare in the other
population.
If the alleles at the two loci aresegregating
at a similarfrequency
in the twopopulations
thendisequilibrium
is small. In thissituation,
theanalysis
could beperformed
within familiesallowing
for different alleles at the marker to be associated with the same allele at theQTL
in different families.Maximum
expected
power and its standard deviation for three differentcrossing designs
in avariety
of situations(QTL
effect,
residualheritability,
experiment
size)
forheterozygosity
1,
0.68 and0.32,
aregiven
in tablesII,
111 andIV,
respectively.
Theheritability including QTL
andpolygenic
variation(h
2
(max))
of theparental population having
thehighest possible
value isgiven
in these tables. It can be observed that residualheritability
has a small effect on power for the range of heritabilities in theparental populations.
Therefore,
powerfigures
can be considered as agood
approximation
whenaccurate estimates of
heritability
in theparental population
are not available. On the otherhand,
a reduction in thefrequency
ofheterozygotes
amongF
r
individuals diminished the power for any situation considered. Forexample,
for residualheritability
0.2 andQTL
effect 0.5 in anexperiment
with 200piglets
in a hierarchical backcrossdesign,
the maximumexpected
power with asignificance
level of 0.05 fell from 0.85 to 0.31 forheterozygosity
1 and0.32,
respectively.
Powerusing
hierarchicaldesigns
(backcross
andintercross)
increases in all cases with theexception
of traits withhigh heritability
whencompared
to traditional intercrossdesigns
(tables
77-7 V).
Maximumexpected
powerusing
hierarchical backcrossdesigns
islarger
thanusing
hierarchical intercrossdesigns
when theQTL
effect is small. Theopposite
occurs forlarge
QTL
effects.The use of a
larger
experiment
size increases power in all cases. Forexample,
the average maximum power acrossQTL
size and residualheritability
(table
II)
intercross
designs,
average maximum power is0.58,
0.71 and 0.81 forexperiment
sizes of200,
400 and800,
respectively.
Variation in power due to the
sampling
in theF
l
should also be consid-ered. Hierarchical backcrossdesigns
show a muchlarger
variation than eitherintercross
designs
(tables
III andII!.
Anexample
of how power, standard devi-ation of power andexpected
riskchange
for alternativemating
designs
isgiven
in table V. In thisexample,
heterozygosity
is0.68,
residualheritability
of the trait is 0.2 andQTL
effect is 0.5phenotypic
standard deviations withcorresponding
standard deviation of power andexpected
risk are 0.27 and 10%,
respectively.
Acompromise
betweenexpected
power and risk can be taken. Forexample,
the use of five boars has a lowexpected
risk of 4%
with anexpected
power of 0.74. It is more obvious with a hierarchical intercross
design
whereexpected
powerusing
between one and ten boars is very similar(0.81
to0.83).
However,
theexpected
risk isonly
low whenusing
four or more boars. Hierarchical and traditional intercrossdesigns
gave identical results ather-itability
0.5(tables 11-I!.
This is because traditional intercross is an extremecase of hierarchical intercross
(one
sow perboar)
which is the best allocationof resources at
high
heritabilities.Table VI shows the number of
boars, expected
power and risk fordesigns
having
maximumexpected
power and minimum risk in hierarchical intercross and backcrossdesigns.
The residual heritabilities are0.10,
0.20 and 0.50 andthe
QTL
size ranges from 0.3 to 0.6phenotypic
standard deviations in anexperiment
of 400piglets
withF
z
heterozygosity
of 0.68. It can be observed that the numbers of boars for maximumexpected
power and minimum risk areHeterozygosity
and powerusing
phenotypic
selection of extreme individualsin a hierarchical backcross
design
with 400piglets,
residualheritability
0.20 andvarying
percentage
selected aredepicted
infigures
1 and2,
respectively.
Parental lines weresegregating
atfrequencies
of 0.8(selected
toincrease)
and 0.2(selected
todecrease).
Apercentage
selected of 100 isgiven
forcomparison
with the unselected case.Percentage
selected of 0.5%
was the lowestattempted.
Heterozygosity
increases withincreasing
selection pressure in each line and with the size of theQTL. Following
the samepattern,
power increases withdecreasing
percentage
selectedbeing
up to 21%
more(figure 2).
The increaseis small if the power in the unselected situation is close to 0 or 1.
4. DISCUSSION
Most
previous
research inQTL
mapping
has assumed that lines atcrossing
are fixed for alternative alleles at the
QTL
[1, 2, 8].
Theavailability
ofdivergent
Domesticated breeds of
pigs
have beenundergoing
artificial selection to increasegrowth
rate and littersize,
forexample.
Only
for research purposes, selection criterion has been lowgrowth
rate or reduced litter size inexperimental
populations. Therefore,
it seems more reasonable to assume that the lines atcrossing
differ in allelefrequency
at theQTL
rather than thatthey
are fixedfor alternative alleles.
The benefits of
crossing
lines are to increasegametic
disequilibrium
between marker andQTL
alleles and to increaseheterozygosity
amongF
l
individuals. Gameticdisequilibrium generated
atcrossing
ishigh
when the allelefrequency
at the two loci
(either
two markers or one marker and oneQTL)
is very differentin the two
parental
lines. Gameticdisequilibrium
between two linked markers could be estimated in chromosomalfragments by
the use of thedisequilibrium
parameter
proposed
in this paper. It wouldrequire
that markers are nottightly
linked so
parental
lines could be assumed to be inlinkage
equilibrium.
If theout within families as considered in this paper with
only
a small loss in power. If thegametic
disequilibrium
ishigh
(e.g.
S2 >0.8)
then a maximum likelihoodapproach
could bedeveloped
allowing
the same marker allele to be associated with the sameQTL
alleleonly
in some families.Consequently,
powerfigures
asgiven
in thisstudy
represent a lower bound of the achievable power in thosecases. More work is needed to assess the
gain
in power in situations wheredisequilibrium
parameter
has intermediate values.Power for
QTL
detection inexperiments involving
crosses between twooutbred lines is
higher
thanusing
within-lineexperiments
when thefrequency
of the favourable allele ishigher
than1/2
in oneparental
line and lower than1/2
in another
parental
line. The increased power can be attributed to thehigher
frequency
ofheterozygous
F
l
individuals than in eitherparental
breed. Thelarger
the difference in allelefrequency
between the twoparental populations,
the
larger
is the increase in power.It was assumed in the
computation
of power that residualheritability
wasconstant for
varying QTL
effects and for agiven heterozygosity
in theF
l
.
Thisrepresents
avariety
of situations in whichparental populations
can besegregating
for aQTL
at a differentfrequency
andhaving
a differentheri-tability.
Theapproach
taken in this paper was tocompute
also theheritability
(comprising
QTL
andpolygene
variation)
of theparental
line with thehighest
possible
value for agiven heterozygosity
in theF
l
.
At low residualheritabil-ity,
the contribution oflarge QTL
to theheritability
ishigh.
Forexample,
for residualheritability
0.1 andQTL
effect of 0.6phenotypic
standarddeviations,
theheritability
is 0.22 when the allelefrequency
in theparental population
is 0.68(table 111).
Inspite
of thelarge
contribution of theQTL
variation to theheritability,
changes
in power for differentheritability
values are small.Discrep-ancies in power
comparing
residualheritability
0.1 and 0.2 for the sameQTL
size range between 0.00 and 0.05
(table 111).
Therefore,
powerfigures,
asgiven
in this paper, are well
approximated
whenprecise
estimates of heritabilities fromparental
population
are not available. Theapproximation
isparticularly
good
when theQTL
has a small effect.On the other
hand,
it was assumed whencomputing
power thatrecombina-tion between
QTL
and marker did not occur. In mostinstances,
recombination would occur. If the recombination fraction between the marker and theQTL
is c then the number ofoffspring
should be increased inproportion
to1/( 1- 2c?
to
obtain the same power as with
complete linkage
!14!.
Power could be increasedby using
intervalmapping
instead of the use of asingle
marker!10!.
Most of the relevant information in intervalmapping
comes from the non-recombinantindividuals. The number of
offspring required
to obtain power asgiven
in this paper should be increased in aproportion
1/(1-c’),
where c’ is therecombina-tion fraction between the two markers. That
is,
around 25%
moreoffspring
are needed to achieve power asgiven
in this paper for intervalmapping
betweentwo markers with a recombination fraction of 0.20. A
larger
experiment
size will also be needed to achieve the same powerfigures
asgiven
in this paperwhen the marker is not
fully
informative. The use of intervalmapping
canmit-igate
the lack of informativeoffspring
by
utilizing
informationcorresponding
to
nearby
informativeflanking
markers.The models used in this paper
ignore
thepossibility
of a common full-sibpigs.
Power formapping
QTL affecting
those traits would be reduced!9!.
Theapproach
used in thisstudy
tocompute
power could be modified to accountfor this
problem
by
incorporating
a termcorresponding
to the commonfamily
component
in the standard error of the contrasts.It has also been assumed that the heritabilities in
parental populations
as well as within the
offspring
ofF
l
remainunchanged.
Thefrequency
ofheterozygotes
at other lociaffecting
the trait could increasegenetic
variance amongF
l
individualsand, therefore, heritability.
Onegeneration
of randommating
isenough
to restoreHardy-Weinberg equilibrium
as would occur afterintercrossing
F
l
individuals.However,
changes
in thegenetic
variance amongoffspring resulting
from the hierarchical backcrossdesign
may occur. If there are otherQTL affecting
the traitshowing
dominance orepistasis,
thenchanges
in thephenotypic
variance andheritability
could also occur.Therefore, departures
from the power ascomputed
in this paper would beexpected
for traits that show heterosis such asreproductive
traits.For a
given
power, a backcrossrequires
about twice as many progeny as an intercrossdesign
because the number ofsegregating
meioses is doubledin the intercross
[10].
Experimental design
formapping QTL
inpigs
has the limitation that the number ofpiglets
per litter has a maximum around ten.This limits the size of the
subgroups
of progeny where the contrasts are carriedout. Hierarchical backcross and intercross
designs
can,however,
be used inpigs
and allow contrasts within boars withlarger subgroups
of progeny. Power of intercross versus backcrossdepends
on bothheritability
andQTL
effect whenusing
hierarchicaldesigns.
Athigh heritability levels,
the hierarchical intercrossdesign
is morepowerful
than the hierarchical backcrossdesign.
Theopposite
occurs at low
heritability
values.An
interesting
result was that power for smallQTL
effects washigher
ina hierarchical backcross than in a hierarchical intercross. The
only
difference between the twodesigns
is that the hierarchical backcrossignores
meioses from sows. In a hierarchicalintercross,
contrasts between alternative alleles inherited from sows are small with alarge
variationgiven
the smallQTL
effect and the small progeny group(ten piglets).
However,
the total number ofdegrees
of freedom may behigh
because of thelarge
number of sows. On thecontrary,
in a hierarchical backcross each of the few boars haslarge subgroups
of progeny which means that contrasts have low variation andhypothesis
testing
is carriedout with a low number of
degrees
of freedom. As a consequence of theabove,
power may be reduced in the intercross.
There is not a
general
experimental design
which can allocate resources in anoptimum
way if theexperiment
size is fixed. Power inexperiments
witha low number of boars is
higher
for small size ofQTL
andhigh
heritability
(table V7).
Theopposite
occurs forQTL
withlarge
size and lowheritability.
Risk measured as the
probability
of power lower than 0.5 due tosampling
ofF,
boars ishigh
with a low number of boars. If thefrequency
ofheterozygous
F
l
individuals is low then the risk increases and the power decreases(results
not