HAL Id: hal-00894186
https://hal.archives-ouvertes.fr/hal-00894186
Submitted on 1 Jan 1997
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Mapping linked quantitative trait loci via residual
maximum likelihood
Fe Grignola, Q Zhang, I Hoeschele
To cite this version:
Original
article
Mapping
linked
quantitative
trait loci via
residual
maximum
likelihood
FE
Grignola Q Zhang
I
Hoeschele
Department of Dairy Science, Virginia Polytechnic
Institute and StateUniversity,
Blacksburg,
VA2l!061-OS15,
USA(Received
2September 1996; accepted
19August
1997)
Summary - A residual maximum likelihood method is
presented
for estimation of thepositions
and variance contributions of two linkedQTLs.
The method alsoprovides
tests for zero versus oneQTL
linked to a group of markers and for one versus two(aTLs
linked. Adeterministic,
derivative-freealgorithm
isemployed.
The variance-covariance matrix of the allelic effects at eachQTL
and its inverse iscomputed
conditional onincomplete
information frommultiple
linked markers. Covariances between effects at different(aTLs
and between
CaTLs
andpolygenic
effects are assumed to be zero. A simulationstudy
was
performed
toinvestigate
parameter estimation and likelihood ratio tests. Thedesign
was a
granddaughter design
with 2 000 sons, 20 sires of sons and nine ancestors of sires. Data were simulated under a normal-effects and a biallelic model for variation at eachQTL. Genotypes
at five or nineequally spaced
markers weregenerated
for all sons andtheir ancestors. Two linked
(aTLs
accountedjointly
for 50 or 25% of the additivegenetic
variance,
and distance betweenQTLs
varied from 20 to 40 cM. Power ofdetecting
asecond
QTL
exceeded 0.5 all the time for the 50%QTLs
and when the distance was(at
least 30 cM for the 25%QTLs.
An intersection-union test ispreferred
over a likelihood ratio test, which was found to be rather conservative. Parameters were estimatedquite
accurately
except for aslight
overestimation of the distance between two closeQTLs.
quantitative trait loci
/
multipoint mapping/
residual maximum likelihood/
outcrosspopulation
Résumé - Détection de gènes liés à effets quantitatifs
(QTL)
grâce au maximumde vraisemblance résiduelle. On
présente
une méthode de maximum de vraisemblance résiduelle pour estimer lespositions
et les contributions à la variabilitégénétique
de deuxQTLs
liés. La méthodefournit également
des tests de l’existence d’un seulQTL
lié à un groupe de marqueurs(par
rapport àzéro)
ou de deuxQTLs
(par
rapport à unseul).
Un
algorithme
déterministe sans calcul de dérivées est utilisé. La matrice de variance-covariance deseffets alléliques
àchaque QTL
et son inverse est calculée conditionnellementà
l’information incomplète
sur les marqueursmultiples
liés. Les covariances entre leseffets
auxdifférents QTLs
et entre leseffets
auxQTLs
et leseffets polygéniques
sont*
supposées
nulles. Une étude de simulation a étéeffectuée
pouranalyser
lesparamètres
estimés et les tests de rapports de vraisemblance. Le schémaexpérimental
a été un schéma«
petites-filles
» avec 2 000fils,
20pères
desfils
et 9 ancêtres de cespères.
Des donnéesont été simulées avec un modèle de variation au
QTL
de type Gaussien oubiallélique.
Les
génotypes
pour cinq ouneuf
marqueurségalement espacés
ont étégénérés
pour tous les,fils
et leurs anchêtres. DeuxQTLs
liésexpliquaient
conjointement 50 % ou 25 % de la variancegénétique
additive et la distance entre lesQTLs
variait de 20 CM à 40 CM. Lapuissance de détection d’un second
QTL
adépassé 0,5,
dans tous les cas pour la situation 50%,
etquand
la distance entreQTLs
étaitsupérieure
ouégale
à 30 CM pour la situation 25 %. Le test d’unQTL
par rapport à deuxQTLs correspond
à la réunion de deux tests.On l’a trouvé
plutôt conservatif.
Lesparamètres
ont été estimés avec unegrande précision
excepté
la distance entre deuxQTLs proches
qui a étélégèrement
surestimée.locus de caractère quantitatif
/
cartographie multipoint/
maximum de vraisemblance résiduelle/
population
consanguineINTRODUCTION
A
variety
of methods for the statisticalmapping
ofquantitative
trait loci(QTL)
exist. While some methods
analyze squared phenotypic
differences of relativepairs
(eg,
Haseman andElston, 1972;
Gotz andOllivier,
1994),
most methodsanalyze
the individualphenotypes
ofpedigree
members. Main methodsapplied
to livestockpopulations
are maximum likelihood(ML) (eg,
Weller,
1986;
Lander andBotstein,
1989;
Knott andHaley,
1992),
least-squares
(LS)
as anapproximation
to ML(eg,
Weller etal,
1990;
Haley
etal, 1994;
Zeng,
1994),
and a combination of MLand LS referred to as
composite
intervalmapping
(Zeng,
1994)
ormultiple QTL
mapping
(Jansen, 1993).
These methods weredeveloped mainly
for linecrossing
and, hence,
cannotfully
account for the morecomplex
data structures of outcrosspopulations,
such as data on several families withrelationships
acrossfamilies,
incomplete
markerinformation,
unknown number ofQTL
alleles in thepopulation
andvarying
amounts of data on different(aTLs
or in different families.Recently,
Thaller and Hoeschele(1996a, b)
and Uimari et al(1996)
implemented
a
Bayesian
method forQTL
mapping using
single
markers or all markers on achromosome,
respectively,
via Markov chain Monte Carloalgorithms,
andapplied
theanalyses
to simulatedgranddaughter designs
identical to those in thepresent
study.
Hoeschele et al(1997)
showed that theBayesian
analysis
can accommodateeither a biallelic or a normal-effects
QTL
model. While theBayesian analysis
wasable to account for
pedigree relationships
both at theQTL
and for thepolygenic
component,
and gavegood
parameter estimates,
it was verydemanding
in termsof
computing time,
inparticular
whenfitting
two(aTLs
(Uimari
andHoeschele,
1997).
Therefore,
Grignola
et al(1996a)
developed
a residual maximum likelihoodmethod,
using
adeterministic,
derivative-freealgorithm,
to map asingle QTL.
Hoeschele et al
(1997)
showed that this method can be considered as anapproxi-mation to the
Bayesian analysis fitting
a normal-effectsQTL
model. In thenormal-effects
QTL
modelpostulated
by
the REMLanalysis,
the vector ofQTL
allelic effects is random with aprior
normal distribution. The REMLanalysis
builds onit to the estimation of
QTL, polygenic,
and residual variancecomponents
and ofQTL location, using incomplete
information frommultiple
linked markers.Xu and
Atchley (1995)
performed
intervalmapping using
maximum likelihood based on a mixed model with randomQTL
effects,
but these authors fitted additivegenotypic
effects rather than allelic effects at theQTL,
with variance-covariancematrix
proportional
to a matrix ofproportions
of allelesidentical-by-descent,
and assumed that this matrix was known. Theiranalysis
wasapplied
to unrelated full-sibpairs.
In order to account for severalQTLs
on the samechromosome,
Xu andAtchley
(1995)
used the idea behindcomposite
intervalmapping
and fitted variancesat the two markers
flanking
the marker bracket for aQTL.
Thisapproach, however,
is not
appropriate
formulti-generational
pedigrees,
as effects associated with markeralleles erode across
generations owing
to recombination. It is alsoproblematic
foroutbred
populations,
whereincomplete
marker information causes theflanking
andnext-to-flanking
markers to differ among families.In this paper, we extend the REML method of
Grignola
et al(1996a)
to thefitting
ofmultiple
linkedQTLs.
While the extension isgeneral
for any number of linkedQTLs,
weapply
the method to simulatedgranddaughter designs by
fitting
either one or two
QTLs.
METHODOLOGY Mixed linear model
The model is identical to that of
Grignola
et al(1996a),
except
that it includes effects at several(t)
QTLs,
and it can be written as:where y is a vector of
phenotypes,
X is adesign-covariate matrix, j3
is a vector of fixedeffects,
Z is an incidence matrixrelating
records toindividuals,
u is a vectorof residual additive
(polygenic)
effects,
T is an incidence matrixrelating
individualsto
alleles,
vi is a vector ofQTL
allelic effects atQTL
i,
e is a vector ofresiduals,
A is the additive
genetic relationship matrix,
c
7
’
is
thepolygenic
variance,
Q,
0
,2 V(
i)
is the variance-covariance matrix of the allelic effects at
QTL
i conditional onmarker
information,
Qv!2!
is the allelic variance atQTL
i (or
half of the additive variance atQTL
i),
R is a knowndiagonal
matrix,
andQ
e
is
residual variance. Eachmatrix
Q
i
depends
on one unknownparameter,
the mapposition
ofQTL
i (d
i
).
Parameters related to the marker map
(marker
positions
and allelefrequencies)
are assumed to be known. The model is
parameterized
in terms of the unknownparameter’s heritability
(h
2
=0&dquo;!/0&dquo;2),
withaj_
being
phenotypic
andU2
additiveeffects at
QTL
i(v?
=o,’ v(i) /’a 2;
i =l, ... ,
t),
the residual variance0,2, e
andQTL
map locations
d
l
, ... ,
di, ... , dt.
A model
equivalent
to the animal model in[1]
is(Grignola
etal,
1996a):
where W has at most two non-zero elements
equal
to 0.5 in each row in columnspertaining
to the knownparents
of anindividual,
F
i
is a matrix with up to fournon-zero elements per row
pertaining
to theQTL
effects of an individual’sparents
(Wang
etal,
1995;
Grignola
etal,
1996a), Ap
andQp(
j
)
are sub-matrices of Aand
Q, respectively, pertaining
to all animals that areparents,
and m and eiare Mendelian
sampling
terms forpolygenic
andQTL
effects, respectively,
with covariance matrices asspecified
inequation
!2!.
WhileVar(m)
isdiagonal,
Var(e
i
)
can have some
off-diagonal
elements in inbredpopulations
(Hoeschele,
1993; Wang
et
al,
1995).
Note that models
[1]
and[2]
are conditional on a set ofQTL
mappositions
(and
on markerpositions
which are assumed to beknown).
Dependent
on themap
positions
are the matricesQ
i
in model[1]
and the matricesF
i
andQp(
j
)
inmodel
!2!.
Note furthermore that models
[1]
and[2]
assume zero covariances between effectsat different
QTLs,
and betweenpolygenic
andQTL
effects.However,
selection tendsto introduce
negative
covariances between(aTLs
(Bulmer,
1985).
A reduced animal model
(RAM)
can be obtained from model[2]
by combining
m,
the e
i
(i
=1, ... ,
t) and
e into the residual. Mixed modelequations
(MME)
can be formed
directly
for theRAM,
orby
setting
up the MME for model[2]
andabsorbing
theequations
in m and the ei(i
=with the A matrices defined in
equation
!2!.
MatrixD,
which results from successiveabsorption
of the Mendeliansampling
terms for thepolygenic
component
and theQTLs,
can be shown to bealways diagonal
and verysimple
tocompute,
even whenseveral
(aTLs (t
>2)
are fitted. Let6v!i!!!
represent
the Mendeliansampling
termpertaining
to v effect k(k
=1, 2)
ofindividual j
at
QTL
i,
and6,,(
j
)
the Mendeliansampling
term for thepolygenic
effect ofj. Then,
the element of Dpertaining
toindividual j
(d
jj
)
iscomputed
as follows:where
r
jj
is thejth
diagonal
element ofR-
1
.
REML
analysis
The REML
analysis
wasperformed using
intervalmapping
and a derivative-freealgorithm
to maximize the likelihood for anygiven
set ofQTL positions,
asdescribed
by Grignola
et al(1996a)
for asingle
QTL
model. Thelog
residuallikelihood for the animal model was obtained
by adding
correction terms to the residual likelihood formeddirectly
from the RAM MME(Grignola
etal,
1996a).
The RAM residual likelihood is:
where N is the number of
phenotypic
observations,
NF the number of estimable fixed effects(rank
ofX),
NR
RA
,yI
the number of randomgenetic
effects of theparents
((1
+2t)
times the number ofparents),
CRAM
is the coefficient matrix inthe left-hand-side of
[3],
P =V-’ -
1
,
1
X’V-
1
X)-
X(X’V-
1
V-
V =Var(Y)/(7!,
and
GRA,!,I
is ablock-diagonal
with blocksAp(7! and
Qp(i)(7!(i)
for i =1, ... ,
t (see
The RAM residual likelihood is modified to obtain the residual likelihood for the animal model as follows
(Grignola
etal 1996a):
where A is the
block-diagonal
with blocksA
u
and!v(i)
(i
=1, ... ,
t)
from!2!,
Czz
is the
part
of the MME for model[2]
pertaining
to mand e
i
(i
= 1, ... ,
t
),
and
NRis total number of random
genetic
effects!(1
!-2t)
times the number ofanimals]
in the animal model.The
analysis
is conducted in the form of intervalmapping
as inGrignola
et al(1996a),
except
that now a t-dimensional search on agrid
of combinations ofpositions
of thet CaTLs
must beperformed.
Moreprecisely,
weperformed cyclic
maximization
by
optimizing
theposition
of the firstQTL
whileholding
theposition
of the secondQTL
constant andsubsequently fixing
theposition
of the firstQTL
whileoptimizing
theposition
of the secondQTL,
etc. A minimum distancewas allowed between the
QTLs,
which was determined such that the(aTLs
werealways separated by
two markers. Whittaker et al(1996)
showed that forregression
analysis
and F2 or backcrossdesigns,
the two locations and effects of two(aTLs
inadjacent
marker intervals are notjointly
estimable. With other methods anddesigns,
locations and variances of two(aTLs
inadjacent
intervals should be eithernot estimable or
poorly
estimated. At each combination ofd
1
andd
2
values,
the residual likelihood is maximized withrespect
to theparameters
h
z
,
v2
(i
=1,...,
t)
and
er!.
Matrices
Qp(
j
),
F
i
andOv!2!
were calculated for eachQTL
as described inGrignola
et al(1996a).
Hypothesis testing
The presence of at least one
QTL
on the chromosomeharboring
the markerlinkage
group can be tested
by
maximizing
the likelihood under theone-QTL
model and under apolygenic
model with noQTL
fitted(Grignola
etal,
1996a).
The distribution of the likelihood ratio statistic for these two models can be obtainedvia simulation or data
permutation
(Churchill
andDoerge,
1994;
Grignola
etal,
1996a, b;
Uimari etal,
1996).
Here,
we considertesting
theone-QTL
modelagainst
thetwo-QTL
model. This test isperformed by comparing
the maximized residual likelihood under thetwo-QTL
model with(i)
the maximized residual likelihood under theone-QTL model,
(ii)
the residual likelihood maximized under theone-QTL
model withQTL position
fixed at the REML estimate ofd
1
obtained under thetwo-QTL
model,
and(iii)
the residual likelihood maximized under theone-QTL
model withQTL
position
fixed at the REML estimate ofd
2
obtained under thetwo-QTL
model. The distribution of these likelihood ratio statistics is notknown,
and
obtaining
it via datapermutation
would be difficultcomputationally,
as manypermutations
would need to beanalyzed,
and as the two-dimensional search took1-2 h of run-time for the
design
described below. The likelihood ratioscorresponding
to
(i) (LR
d
),
(ii)
(LR
dl
),
and(iii)
(LR
d2
)
should have anasymptotic
chi-square
distribution within 1 and 3
degrees
of freedom. Whenusing
LR
dl
andLR
dz
,
both ratios have to besignificant
in order toreject
the nullhypothesis
of oneQTL.
Thisfor the first likelihood ratio the
hypotheses
are:H
o
:
Œ!(l) -I-
0 andŒ!(2)
= 0 versusH
1
:
Œ!(l) -I-
0 andŒ!(2) -I-
0,
and for the second likelihood ratio thehypotheses
are:Ho:
Œ!(1) =
0 andŒ!(2) -I-
0 versusH
1
:
Œ!(1) -I-
0 andŒ!(2) -I-
0. The intersection-union test constructed in this way can bequite conservative,
as its size may bemuch less than its
specified
value ce. Forgenome-wide testing,
thesignificance
levelshould also be
adjusted
for the number ofindependent
testsperformed
(the
number of chromosomesanalyzed
times the number ofindependent
traits).
SIMULATION
Design
The
design
simulated was agranddaughter
design
(GDD)
as in thesingle QTL
study
ofGrignola
et al(1996b),
where markergenotypes
are available on sons andphenotypes
ondaughters
of the sons. The structure resembled the real GDD of the USpublic
genemapping project
fordairy
cattle based on thedairy
bull DNArepository
(Da
etal,
1994).
The simulated GDD consisted of 2 000 sons, 20sires,
and nine ancestors of the sires
(fig 1)..
The
phenotype
simulated wasdaughter
yield
deviation(DYD)
of sons(Van-Raden and
Wiggans,
1991).
DYD is an average of thephenotypes
of thedaughters
adjusted
forsystematic
environmental effects andgenetic
values of thedaughters’
dams. For details about theanalysis
ofDYDs,
seeGrignola
et al(1996b).
Marker and
QTL
genotypes
were simulatedaccording
toHardy-Weinberg
fre-quencies
and the mappositions
of all loci. All loci were in the samelinkage
group.Each marker locus had five alleles at
equal frequencies.
Severaldesigns
werecon-sidered which differed in the map
positions
of the twoQTLs,
in the number ofmarkers,
and in theproportion
of the additivegenetic
varianceexplained
by
thetwo
QTLs.
Thesedesigns
are defined in table II. Also simulated was asingle QTL
at 45 cM to test the
two-QTL analysis
with datagenerated
under thesingle QTL
model.Polygenic
andQTL
effects were simulatedaccording
to thepedigree
infigure
1.Data were
analyzed by
using
thepedigree
information on the sires. Note thatin the
simulation,
nolinkage
disequilibrium
(across
families)
wasgenerated, ie,
polygenic
effects were zero.Therefore,
an additionaldesign
was simulated wherelinkage disequilibrium
wasgenerated
by
simulating
DYDs also forsires, creating
a
larger
number of sires andculling
those sires with DYD lower than the 90thpercentile
of the DYD distribution.QTL positions
for thisdesign
were 30 cM(interval 2)
and 70 cM(interval 3)
with fivemarkers,
and theQTL
model wasthe normal-effects model
(see below).
Estimates of the simulated correlations(SE
in
parentheses),
across 30replicates,
were -0.20(0.05),
-0.33(0.04),
and -0.32(0.04),
betweenpairs
of v effects atQTL
1 andQTL 2,
betweenpairs
of v effectsat
QTL
1 andpolygenic
effects,
and betweenpairs
of v effects atQTL
2 andpolygenic effects,
respectively.
The effects of one or severalgenerations
of phenotypic
truncation selection on additive
genetic
variance in a finite locus model has beenQTL
modelsTwo different
QTL
models were used to simulate data. Under bothmodels,
phenotypes
were simulated aswhere n
j
was the number ofdaughters
of sonj,
gi!k was the sum of the v effectsin
daughter
k ofson j at
QTL
i,
uj was anormally
distributedpolygenic effect,
e
j was a
normally
distributedresidual, polygenic
variance(
0
&dquo;)
wasequal
to the difference between additivegenetic
variance(afl )
and the varianceexplained
by
theQTLs,
andafl was
environmental variance. Number ofdaughters
per son was set to50,
corresponding
to areliability
(Van
Raden andWiggans,
1991)
near 0.8. Narrowsense
heritability
of individualphenotypes
wash
2
=0.3,
andphenotypic
SD wasQ P = 100.
Note that the
QTL
contribution to the DYDs of sons wasgenerated by sampling
individual
QTL
allelic effects ofdaughters
under each of the twogenetic
models described below. Thissampling
ofQTL
effects ensures that DYD of aheterozygous
son, or of a son with substantial difference in the additive effects of the alleles at
a
QTL,
haslarger
variance amongdaughters
due to theQTL
than ahomozygous
son or a son with similar
QTL
allelic effects.Two different models were used to describe variation at the
QTL,
which areidentical to two of the models considered
by Grignola
et al(1996b).
Normal-effects modelFor each individual with both or one
parent(s)
unknown,
both or oneeffect(s)
atQTL
k(k
=1, 2)
were drawn fromN(O,
a v 2(k)).
For thepedigree
infigure
1,
therewere 32 base
alleles,
and eachQTL
was treated as a locus with 32 distinct allelesin
passing
on alleles to descendants. Theparameter
a
v
2(k)
was set to0.125or or
0.625(J&dquo;!,
ie,
QTL
k accounted for25%
(2
V2
=0.25)
or12.5%
(2v!
=0.125)
of thetotal
additive
genetic variance,
respectively.
k k
Consequently,
the two(aTLs
accountedjointly
for between 25 and50%
of the additivegenetic
variance.Biallelic model
Each
QTL
was biallelic with allelefrequency
pi =p 2 =
P= 0.5. The variance at
QTL
k waswhere for p = 0.5 and
2v!
= 0.25 or2v!
=0.125,
half thehomozygote
difference atRESULTS
The
designs
studied are described in table II and differ in theQTL positions,
in the number of
markers,
and in theproportion
of the additivegenetic
varianceexplained jointly by
two linkedQTLs.
Overall,
theQTL
parameters
were estimatedquite accurately
as in thesingle-CaTL
analysis
ofGrignola
et al(1996b),
except
that there was atendency
to overestimate the distance between theCaTLs
withdecreasing
true distance.Parameter estimates for all
designs
in table II and for the normal-effectsQTL
model used in the data simulations arepresented
in table III. Thereappeared
tobe a
slight tendency
to overestimate theQTL
variance contributions(v
2
),
but,
in most cases notsignificantly.
TheQTL
mappositions
and the distance betweenthe
QTLs
were estimatedaccurately
when the true map distance between the(aTLs
was 30 or 40 cM. When the true map distance wasonly
20cM,
there was atendency
to overestimate theQTL
distance. This overestimation wassignificantly
morepronounced
when the number of markers was reduced from nine(every
10cM,
designs IIIA,
B)
to five(every
20cM, designs IVA,
B).
Toinvestigate
whether the overestimation of theQTL
distance was related to the searchstrategy requiring
a minimum distance between the
QTLs
such that these werealways
separated
by
two markers(with
theexception
ofdesigns IVA, B),
the minimum distancewas reduced to 10 and 2 cM.
However,
parameter
estimates and likelihood ratios remainedunchanged.
When the
(aTLs
accountedjointly
foronly
25%
of the additivegenetic
varianceas
compared
to50%,
there was littlechange
in theprecision
of the estimates of theQTL
variance contributions. Standard errors of theQTL positions
werehigher,
andoverestimation of the distance between
(aTLs only
20 cMapart
wasslightly
morepronounced.
Parameter estimates for
designs
simulated under the biallelicQTL
model areshown in table V.
Except
for theQTL model,
thesedesigns
are identical todesigns IA,
B andIIIA,
B in table II. Parameters were estimated with an accuracynot
noticeably
lower than for the normal-effectsQTL model,
an observation inagreement
with thesingle-(aTL
study
ofGrignola
et al(1996b).
When
analyzing
thedesigns
in table II with thesingle-(aTL
model,
the mostlikely QTL
position
(d
in tables III andV)
wasalways
somewhere in between thethe estimated
QTL
position
was very near the mean of the truepositions.
This result wasexpected,
as both(aTLs
hadequal
variance contributions and on averageequally
informativeflanking
markers.Likelihood ratio statistics for all
designs
in table II and for the normal-effectsQTL
model arepresented
in table IV. The average values of the likelihood ratio statistics fortesting
between thesingle-
andtwo-QTL
models declined asexpected
with
decreasing
distance between the twoQTLs
(designs
IIA,
B versusdesigns IIIA,
B),
withdecreasing
number of markers(designs
IA,
B versusIIA,
B,
anddesigns
IIIA,
B versusdesigns IVA,
B),
and withdecreasing joint
variance contribution of theQTLs
(A
versusB).
The average value of the likelihood ratio statisticLR
d
wasalways considerably
lower than those ofLR
dl
andLR
d2
.
The power
figures
in table IV were calculatedassuming
thatLR
dl
,
LR
d2
andLR
d
follow either achi-square
distribution with 1 df or 3 df andusing
an cx value of0.05/29
= 0.0017. To allow forany
interpretation
of these powerfigures,
weestimated the
type-I
errorby simulating
data with asingle
QTL
explaining
either25,
12.5 or6.25%
of the additivegenetic
variance. For thetype-I
errorestimation,
a was set to0.05,
and the number ofreplicates
was 200. Estimates oftype-I
errors are in tableVI,
for the two tests(LR
d
andLR
d
,
andLR
d2
)
and for thresholds fromchi-square
distributions with1,
2 and 3 df.Type-I
errors tended to increaseslightly
with size of the
QTL
variance,
and wereconsistently
lower for the testusing
LR
d
.
Based on these
results,
theempirical type-I
error was close to thepre-specified
value of 0.05 for theLR
dl
andLR
d2
tests whenusing
thexi-threshold,
while it wasWith this
background,
the power ofrejecting
thesingle-(aTL
model based onrequiring
bothLR
dl
andLR
d2
to exceed thesignificance
threshold was asexpected
always
higher
than orequal
to the power of theLR
d
statistic. For the test based onLR
dI
andLR
d2
,
power declined asexpected
withdecreasing
distance betweenQTLs
and withdecreasing
trueQTL
variance contribution. For thejoint QTL
variance contribution of50%
and the test based onLR
d
,
andLR
d2
,
power wasequal
to orhigher
than 0.5always
for thexi
threshold
andalways
except
fordesign
IVA for theX2
threshold.
For thejoint
QTL
variance contribution of25%,
a power of atleast 0.5 was achieved
only
for the 30 cM distance betweenQTLs
and theX2
andX3
thresholds.
Thisfinding
must beinterpreted by keeping
in mind the choice ofa = 0.0017 and the fact that the
test,
as contructedhere,
is rather conservative.For the data simulated with a
single QTL explaining
either 25 or12.5%
of theadditive
genetic variance,
the mappositions
estimated under thetwo-QTL
modelwere near the true
position
and aghost
position
to either side of the trueposition
in mostreplicates.
This behavior of the REMLanalysis
seems tosupport
the useof
LR
dl
andLR
d2
instead ofLR
d
.
Likelihood ratio statistics for the biallelic
QTL
model and some of thedesigns
intable II are
presented
in table V.Overall,
likelihood ratios and powerfigures
weresimilar to those for the normal-effects
QTL
model,
with somewhat lower power fordesign
IA butslightly higher
power for otherdesigns.
These differences are mostlikely
due to the limited number ofreplications
(30).
The
cyclic
maximizationstrategy
for thetwo-QTL
model took about 20 minof serial wall-clock time on a
21-processor
IBM SP2system
for a chromosome of80 cM
length, compared
with 1.5 h for a two-dimensional search. Run-time for thesingle-QTL
analysis
was at most 8 min.For the
designs
in tableII,
the twoQTLs
hadequal
variance contributions.and 25 and 45 cM
(nine
markers), respectively,
were simulatedusing
thenormal-effect
QTL
model,
withQTL
1explaining
25%
andQTL
212.5%
of the additivegenetic
variance. The average estimates(with
SE inparentheses)
ofQTL
position
from the
single
QTL
analysis
were 0.396 M(0.023)
and 0.298 M(0.010)
for the30 and 70 cM and 25 and 45 cM
designs, respectively, being
closer to the first locus with thelarger
variance contribution. EstimatedQTL positions
from thetwo-QTL
analysis
were 0.285 M(0.008)
and 0.720 M(0.010)
for the 30 and 70 cMdesign,
and 0.077
(0.012)
for the seconddesign, respectively.
For the 30 and 70 cMdesign,
power was 0.47
(0.40)
for theXi
2 (X’)
threshold and the test based onLR
dl
andLR
d2
.
For the 25 and 45 cMdesign,
power wasonly
0.33(0.10)
for the same tests.When
linkage
disequilibrium
wasgenerated
by phenotypic
truncation selectionof sires for the
design
withQTL
positions
of 30 and 70 cM andjoint QTL genetic
variance contribution of50%,
QTL
parameters
and their estimates wereclearly
affected.
Heritability
was estimated low(0.213 ! 0.020),
and thevfl (k
=1, 2)
wereestimated
high (0.184 ±0.014,
0.211 ±0.014),
butQTL positions
were estimatedaccurately
(0.292 t
0.010,
0.716 ±0.007).
Powerappeared
to be somewhat reducedcompared
to the samedesign
withoutselection,
and was estimated at 0.86 and 0.73 for theX
i
1 and X thresholds,
respectively.
Reduction in power wasprobably
due tothe
high
estimate of error variance(1737.9 t 55.3).
CONCLUSIONS
The REML
analysis of Grignola
et al(1996a, b),
based on a mixed linear model withrandom and
normally
distributedQTL
allelic effects and conditional onincomplete
information from
multiple
linkedmarkers,
has been extended here to fitmultiple
linkedQTLs.
This extension is necessary to eliminate biases in the estimates of theQTL
parameters
position
andvariance,
which occur whenfitting
asingle
QTL
and other linked
QTLs
arepresent.
For thepresent
study,
theanalysis
had beenimplemented
for twoQTLs
on the same chromosomeusing
a two-dimensionalsearch. When
fitting
more than two linkedQTLs
or additional unlinkedQTLs,
a more efficient searchstrategy
may berequired,
or alternativealgorithms
(eg,
Meyer
andSmith,
1996).
In themeantime,
acyclic optimization approach
wasimplemented,
with oneQTL position
held constant whileoptimizing
theother,
and vice versa, which reduced the number of likelihood evaluations and hence CPU time
considerably
relative to a two-dimensional search. As likelihood maximizationsat different
position
combinations areindependent
of eachother,
use ofmultiple
processors, if
available,
would reduce run-timessubstantially.
For the
one-QTL
model,
relationships
between the REMLanalysis,
theequiv-alent method of Xu and
Atchley (1995),
the method of Schork(1993),
and theBayesian analysis
of Uimari et al(1996)
were discussed inGrignola
et al(1996a).
The method of Xu andAtchley
(1995)
fitting
variances associated with thenext-to-flanking
markers to account for additional linkedQTLs
would not have worked well for thedesigns
studied here. A first reason is the inclusion of ancestors of thesires in the
analysis,
as their method fits random effects associated with the markeralleles in founders which erode across
generations
due to recombination. Anotherreason is the small number of families
differing
in theflanking
andnext-to-flanking
markers,
resulting
in too little information for estimation of variances associated with thenext-to-flanking
markers in the method of Xu andAtchley
(1995).
For similar reasons,composite
intervalmapping
(Zeng, 1994)
andmultiple QTL
map-ping
(Jansen, 1993),
which are based on the inclusion of markers ascofactors,
arenot suitable for the
analysis
ofmulti-generational pedigrees.
REML
analysis
under thetwo-QTL
modelyielded
fairly
accurateparameter
estimates.Map
distance between theQTLs
wasoverestimated,
withdecreasing
study,
the REMLanalysis
was robust to the number of alleles at theQTLs,
as there wasrelatively
little difference inparameter
estimates and likelihood ratio statisticsbetween
designs generated
with the normal-effects and biallelicQTL
models, given
the number ofreplicates performed.
Previouslinkage analyses
(eg,
Knott andHaley,
1992;
Haley
etal,
1994)
lead to the conclusion that a minimum distance of 20 cMwas
required
between linkedQTLs
for theirseparate
detection. This result wasconfirmed in the
present
study. However,
discrimination among different numbersof
QTL
(eg,
one versustwo)
requires
additionalresearch,
including
aninvestigation
of alternative
approaches
such as anadaptation
ofcomposite
intervalmapping
topedigree analysis.
Gains in power fromfitting
additional unlinkedQTL
and selection ofQTL
to be included in theanalysis
are relatedtopics warranting
further research.If there is
linkage disequilibirum
due toselection, QTL positions
will still be estimatedaccurately,
while variance estimates and power may be affected.Accounting
fordisequilibrium
in theanalysis
should beinvestigated.
ACKNOWLEDGMENTS
This research was
supported by
Award No 92-01732 of the National Research InitiativeCompetitive
GrantsProgram
of the USDepartment
ofAgriculture, by
the Holstein AssociationUSA,
andby
ABS Global Inc.REFERENCES
Berger
RL(1996)
Likelihood ratio tests and intersection-union tests. In: Advances in Statistical DecisionTheory
(N
Balakreshnan and SPanchapahesan,
eds),
Birkhauser,
Boston(in press)
Bulmer MG
(1985)
The MathematicalTheory of Quantitative
Genetics. ClarendonPress,
OxfordCantet
RJC,
Smith C(1991)
Reduced animal model for marker assisted selectionusing
best linear unbiasedprediction.
Genet SelEvol 23,
221-233Casella
G, Berger
RL(1990)
StatisticalInference.
Wadsworth &Brooks/Cole,
Advanced Books &Software,
PacificGrove,
CAChurchill
G, Doerge
R(1994)
Empirical
threshold values for quantitative trait mapping.Genetics 138,
963-971Da
Y,
RonM,
YanaiA,
BandM,
EvertsRE,
Heyen DW,
WellerJI,
Wiggans GR,
Lewin HA
(1994)
TheDairy
Bull DNARepository:
A resource formapping
quanti-tative trait loci. Proc 5th World
Congr
GeneticsAppl
Livest Prod 21, 229-232 FernandoRL,
Grossman M(1989)
Marker-assisted selectionusing
best linear unbiasedprediction.
Genet Sel Evol 21, 467-477Goddard M
(1992)
A mixed model foranalyses
of data onmultiple genetic
markers. Theort
l
pPl
Genet83,
878-886Gotz
KU,
Ollivier L(1994)
Theoretical aspects ofapplying
sib-pair linkage
tests tolivestock species. Genet Sed Evol 24, 29-42
Grignola
FE,
HoescheleI,
Tier B(1996a)
Mapping quantitative
trait loci via residual maximum likelihood: 1.Methodology.
Genet SelEvol 28,
479-490Grignola FE,
HoescheleI, Zhang Q,
Thaller G(1996b)
Mapping quantitative
trait loci via residual maximum likelihood: 1. A simulationstudy.
Genet Sel Evol 28, 491-504Haley CS,
KnottSA,
Elsen J-M(1994)
Mapping
quantitative trait loci in crosses betweenHaseman
JK,
Elston RC(1972)
Theinvestigation
oflinkage
between a quantitative trait and a marker locus. Behav Genet 2, 3-19Hoeschele I
(1993)
Elimination of quantitative trait lociequations
in an animal modelincorporating genetic
marker data. JDairy
Sci76,
1693-1713Hoeschele
I,
UimariP, Grignola FE, Zhang Q, Gage
KM(1996)
Statisticalmapping
ofpolygene
loci in livestock. Proc Int Biometrics Soc(in
press)
Hospital F,
Chevalet C(1996)
Interactions ofselection, linkage
and drift in thedynamics
ofpolygenic
characters. Genet Res Camb 67, 77-87Jansen RC
(1993)
Intervalmapping
ofmultiple quantitative
trait loci. Genetics135,
252-324Knott
SA, Haley
CS(1992)
Maximum likelihoodmapping
of quantitative trait lociusing
full-sib families. Genetics 132, 1211-1222Lander
ES,
Botstein D(1989)
Mapping
Mendelian factorsunderlying
quantitative traitsusing
RFLPlinkage
maps. Genetics 121, 185-199Meyer
K(1989)
Restricted Maximum Likelihood to estimate variance components for animal models with several random effectsusing
a derivative-freealgorithm.
Genet Sel Evol 21, 317-340Meyer K,
Smith SP(1996)
Restricted Maximum Likelihood estimation for animal modelsusing
derivatives. Genet Sel Evol 28, 23-50Schork NJ
(1993)
Extendedmulti-point identity-by-descent analysis
of humanquanti-tative traits:
Efficiency,
power, andmodeling
considerations. Am J Hum Genet 53,1306-1319
Thaller
G,
Hoeschele I(1996a)
A Monte Carlo method forBayesian analysis
oflinkage
between
single
markers andquantitative
trait loci: I.Methodology.
TheorAppl
Genet93, 1161-1166
Thaller
G,
Hoeschele I(1996b)
A Monte Carlo method forBayesian analysis
oflinkage
betweensingle
markers and quantitative trait loci: I. A simulationstudy.
TheorAppl
Genet93, 1167-1174
Uimari
P,
ThallerG,
Hoeschele I(1996)
The use ofmultiple
linked markers in aBayesian
method formapping quantitative
trait loci. Genetics143,
1831-1842Uimari
P,
Hoeschele I(1996)
Mapping
linked quantitative trait loci withBayesian analysis
and Markov chain Monte Carloalgorithms.
Genetics 146, 735-743VanRaden
PM, Wiggans
GR(1991)
Derivation, calculation,
and use of national animal model information. JDairy
Sci74,
2737-2746Wang
T, FernandoRL,
van der BeekS,
Grossman M(1995)
Covariance between relatives for a markedquantitative
trait locus. Genet SelEvol 27,
251-274Weller JI
(1986)
Maximum likelihoodtechniques
for themapping
andanalysis
ofquanti-tative trait loci with the aid of
genetic
markers. Biometrics 42, 627-640Weller
JI,
KashiY,
Soller M(1990)
Power ofdaughter
andgranddaughter designs
fordetermining linkage
between marker loci andquantitative
trait loci indairy
cattle. JDairy
Sci73,
2525-2537Whittaker
JC, Thompson R,
Visscher PM(1996)
On themapping
ofQTL by regression
ofphenotype
onmarker-type. Heredity
77, 23-32Xu