HAL Id: hal-00894167
https://hal.archives-ouvertes.fr/hal-00894167
Submitted on 1 Jan 1997
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Detection of cytoplasmic effects on production: the
influence of number of years of data
A Salehi, Jw James
To cite this version:
Original
article
Detection
of
cytoplasmic
effects
on
production:
the influence of number
of
years
of data
A
Salehi,
JW James
Department of
Wool and AnimalScience,
TheUniversity of
New SouthWales,
Sydney
2052, Australia(Received
17January 1995;
revised 6January 1997, accepted
3 March1997)
Summary - Data were simulated for a
single
trait determinedby
additivegenetic,
cytoplasmic
and environmental effects in asheep
flock. Theproportions
of variance dueto genotype and
cytoplasm
wereh
2
= 0.30 or 0.60 andc
2
= 0.05 or0.10,
with an extra set in whichh
2
= 0.30 andc
2
= 0. For all five parameter sets, twoperiods
(10
or 20years)
were
considered,
with the same number of records in each case. Tenreplicates
of each were run so that 100independent
data sets were obtained. A ten-year data set with half the number of animals was also run,giving
150analyses
in all. Data sets wereanalysed by
derivative-free restricted maximumlikelihood, fitting
fourmodels,
which included additivegenotype alone or in combination with either or both an additive
genetic
maternal effectand a
cytoplasmic
effect. If thecytoplasmic
effect were omitted from theanalysis model,
heritability
and maternal variance were overestimated. Ifcytoplasmic
effects wereincluded,
parameter estimates were close to true values.Heritability
did not affect the power of tests forcytoplasmic effects,
which washigher
forc
2
= 0.10 than forc
2
= 0.05. Detection ofcytoplasmic
variance was morelikely
with dataspread
over 20 rather than over 10 years, and power was reduced with fewer data.cytoplasmic effect
/
restricted maximum likelihood/
animalmodel / sheep
Résumé - Détection d’effets cytoplasmiques sur des caractères de production : inci-dence du nombre des années avec données. On a simulé des données pour un
carac-tère déterminé par les
effets génétiques additifs, cytoplasmiques
et demilieu,
dans untroupeau de moutons. Les parts de variance
phénotypique
dues auxeffets génétiques
etcytoplasmiques
sont h
2
=0,30
ou0,60,
et c2
=0,05
ou0,10 respectivement,
avec unesérie
supplémentaire
oùh
2
= 0, 30 et2
c
= 0. Pour les cinq ensembles deparamètres,
onconsidère deux
périodes
(10
ou 20ans)
avec un nombreégal
des données dans les deux cas. On afait
dixrépétitions
pourchaque
situation, et donc 100fichiers
indépendants
sont obtenus.
Un fichier
couvrant 10 années avec un nombre d’animazix réduit de moitiéest
analysé
aussi, soit 150analyses
en tout. Lesfichiers
sontanalysés
par la méthode*
du maximum de vraisemblance restreinte
(REML)
avec unprocédé
sans dérivées.Quatre
modèles ont étéajustés, incluant,
soit legénotype additif seul,
soit legénotype additif
plus
l’un deseffets cytoplasmiques
ougénotype
additivematernel,
ou les deux.Quand
le modèle
d’analyse
excl!tl’effet cytoplasmique,
l’héritabilité et la variance maternelle sont surestimées.Quand
le modèle inclutl’effet cytoplasmique,
lesparamètres
estimésapprochent
leurs valeurs vraies. La puissance des tests deseffets cytoplasmiques
estplus
élevée pourc
2
=0,10
que pour c2
=0, 05,
mais elle nedépend
pas de l’héritabilité. La détectiond’effets cytoplasmiques
estplus probable
avec des donnéesréparties
sur 20 quesur 10 ans, et la puissance du test est diminuée
quand
le nombre de données diminue.effet cytoplasmique
/
maximum de vraisemblance restreinte/
modèle animal/
moutonINTRODUCTION
Although
dam and sire contributeequally
(with
theexception
of sex-linked genes in theheterogametic offspring
sex)
to thegenotype
of theiroffspring, it
isaccepted
thatprogeny
performance
is affected moreby
the dam thanby
the sire(Hohenboken,
1985).
A maternal effect may be defined as any influence on a progenyphenotype
attributable to the
dam,
other than nuclear genes. Inmammals,
milkproduction,
intrauterine environment and
parental
care are commoncomponents
of maternaleffects,
which may be bothgenetically
andenvironmentally
determined.Another
possible
source of maternal effect is thecytoplasm,
especially
mitochon-drial DNA
(mtDNA),
which ismaternally
transmitted(Wagner,
1972).
In recentyears several studies
following
that of Bell et al(1985)
havepresented
evidence for the influence ofcytoplasm
onproduction
in cattle.However,
Kennedy
(1986)
pointed
out that carefulanalysis
of data was needed to demonstrate the occurrenceof a
cytoplasmic effect,
and that inparticular
it was necessary to account for the influence of nuclear genes. An assessment of theimportance
offitting
anappro-priate
model was madeby
Southwood et al(1989).
They
simulated data with aconventional maternal
effect,
with acytoplasmic
effect,
or withboth,
in addition toan additive
genetic
effect.They
thenanalysed
the simulated datausing
models that includedonly
a maternaleffect, only
acytoplasmic effect,
orboth,
as well as anadditive
genetic
effect. When the correct model wasfitted,
theparameter
estimates matched the truevalues,
but estimates were biased when an incorrect model wasfitted.
They
simulated adairy
cowpopulation
over 60 years, andanalysed
datafrom the last 30 years, with about 4 500 records
being
used.In Australian Merino
sheep
it would be unusual to have such a dataset,
andmost data sets would extend over 10-20 years. Over such a
period,
aseparation
of
cytoplasm
and maternalgenotype
might
be more difficult to achieve. We havetherefore undertaken studies of the effectiveness of
estimating cytoplasmic
effects in suchpopulations.
Most suchpopulations
would beundergoing
selection,
unlike the random choice ofparents
simulatedby
Southwood et al(1989),
and so we have simulatedpopulations
under selection. Boettcher et al(1996)
also used simulationof a
dairy
cowpopulation
withcytoplasmic
effects but their interest was inMATERIALS AND METHODS
Data were
generated by
computer
simulation of asheep
flock with five age groups ofbreeding
ewes and two age groups of rams. It was assumed thatinitially
allbreeding
animals had beenrandomly
chosen from the basepopulation,
while thejuvenile
animals were also a randomsample
of the basepopulation. Thereafter,
afifth of the
breeding
ewes and a half of the rams were culled each year andreplaced
by
the best availablehoggets.
The basic simulation used 10 rams and 200 ewes,with data
being
simulated over 20 or 10 years. Another set of simulations used 20rams and 400 ewes for
mating
each year, with data simulated over 10 years. Each animal had an additivegenetic
value and acytoplasmic
value. These wereadded, together
with a random environmentaldeviation,
togive
thephenotypic
value.
Except
for the basepopulation,
an animal’sbreeding
value was the averageof its
parents’
breeding
values,
plus
asegregation
effect in which allowance was madefor
parental
inbreeding.
An animal’scytoplasmic
value was that of its dam. The seed for the random numbergenerator
was different for every run of the simulation so that allreplicates
wereindependent.
The10-years
data sets were simulatedseparately
from the20-years
data sets.The trait under selection was assumed to have a
heritability
(h
2
)
of 0.3 or0.6,
with
cytoplasmic
variance(c
2
)
0.05 or 0.10 as a fraction ofphenotypic
variance.Cytoplasmic
and additivegenetic
effects were uncorrelated. For aheritability
of0.3,
another series of runs withcytoplasmic
variance set to zero was carriedout,
making
a total of five models. Tenreplicates
were simulated for each model. Withthree data structures described above
(population
size,
number ofyears)
this gave150 data sets for
analysis.
The data were
analysed
using
an animal model restricted maximum likelihoodprocedure
and a derivative-freealgorithm
(Graser
etal,
1987).
The DFREMLcomputer
package
(Meyer,
1989,
1991)
wasemployed
to fit four different models tothe data. These were models
1, 2,
3 and 7 in DFREMLterminology.
The modelsare described in table I.
The
complete
model fitted to the data(model 7)
was:where y is a N x 1 vector of
observations,
b is a vector of fixed effects andX,
Z
a
,
Z
m
andZ,
are incidence matricesrelating
fixed effects(b),
random additivegenetic
effects(a),
maternalgenetic
effects(m),
andcytoplasmic
origin
effects(c)
to y,respectively,
and e is a vector of random residual effects. The variance-covariancestructure among the random effects for the full fitted model was:
with all covariances zero. Here A is the additive
genetic
relationship
matrix betweenanimals,
afl is
additivegenetic variance,
0
,2 m
is maternalgenetic variance,
a
2
is
cytoplasmic
variance,
!e is
environmentalvariance, I
c
is anidentity
matrix of orderequal
to the number ofcytoplasm origins,
andI
e
is anidentity
matrix of orderequal
to the number of records. For model 3 it was assumed that c =
0,
for model 2 that m = 0 and for model 1 that both c and m were 0. Fixed effects included in themodel were year of record and sex. A
single
record athogget
age wasanalysed.
For the
fitting
of thecytoplasmic
effect the female in the basepopulation
from whom thecytoplasm
descended was identified. The convergence criterion chosen forstopping
the estimationprocedure
was a variance of10-
7
in the likelihood values for thesimplex.
Standard errors ofparameter
estimates were obtained from variationbetween
replicates.
Theprobability
offinding significant cytoplasmic
effects wasassessed
by
use of likelihood ratio tests of model 7 versus model3, assuming
that -2 2times the difference in
log
likelihoods hadapproximately
achi-squared
distributionwith one
degree
of freedom. This test is anappropriate
way to testsignificance
of the
changes
in the likelihood function when additional terms are included asexplained by
Rao(1973).
RESULTS
Parameter estimates with their standard errors
(calculated
from variation amongreplicates)
arepresented
in tablesII,
III and IV for different data structures. The results of thelog
likelihood tests arepresented
in tables V and VI.When there was no
cytoplasmic
effect in the simulated data the estimatedheri-tability
wasalways
close to the true value. In the presence of acytoplasmic effect,
heritability
wasalways
overestimated whenc
2
was omitted from the fitted model.Estimates of
h!
and
C
2
with the modelfitting
additivegenetic
andcytoplasmic
effects
(model 2)
were all close to the true values for all data sets across all setsof
parameters,
although
a few values differed from the true valueby
more thantwo standard errors. With a model that included the maternal
genetic
effect asthe
only
additional random effect(model 3),
estimates ofh
2
were
higher
than for model 2.According
to thelog
likelihood ratio testpresented
in table V acomplex
model
(model
7)
fitting
the additivegenetic effect,
maternalgenetic
effect andthe
ability
to detect acytoplasmic
effect. The averageX
2
values
werehigher
forthe
larger
datasets,
but among thelarger
datasets,
they
werehigher
for 20 yearsrather than 10 years.
Furthermore,
tableVI,
in which the numbers ofsignificant
x
z
values forreplicates
for each data set arepresented,
also indicates that the20-year period
wasslightly
morelikely
to showsignificance
than a10-year period,
even with the same amount of data. In table V it can be seen that the averageX2
2value for 20 years is about twice that for 10 years with an
equal
amount ofdata,
and thus with smallercytoplasmic
contributions the difference in power would bemore noticeable.
DISCUSSION
some of these effects are then attributed to the additive
genetic
effect. WhenC
2
is accounted forby fitting
model2, C
2
estimates are not different from thetrue values.
Higher
estimates ofheritability
with model 3compared
to model2,
showed that with model 3 some of thecytoplasmic
effects are then attributed tothe additive
genetic
effect and most attributed to a maternal effect. Southwoodet al
(1989)
stated that if anappropriate
animal model isapplied
to thedata,
variance
components
cancorrectly
bepartitioned
among additivedirect,
additive maternal andcytoplasmic
effects. Model7,
including
the additivegenetic effect,
maternal
genetic
effect andcytoplasmic
effect was used to demonstratecytoplasmic
variation,
distinct from maternalgenetic effects,
since in real data as distinct from our simulated data we can not know that there is no maternal effect other than thatof the
cytoplasm.
Theability
to detect acytoplasmic
variancecomponent
does notof 10 years, power was lower when the amount of data was halved.
However,
for agiven
amount ofdata,
the power ishigher
if the data isspread
over 20 rather than10 years. Thus data
extending
over a number of years ispreferable,
but ifenough
records are
available,
it is worthanalysing
data from a shortperiod.
The
major
difference between 10 years and 20 years of data for the detectionof
cytoplasmic
effects is the difference in number ofgenerations
in which thecytoplasm
may becomeindependent
from nuclear genes of the founder females. Thedistributions,
averaged
over allsimulations,
ofcytoplasmic
generation
numbers areshown in table VII. Distributions were checked for different simulation models and were
virtually
identical in all casesexcept
for the difference between the numberof years. With
10-year data,
less than20%
ofcytoplasms
areseparated
from theIn this
study
there were nonon-cytoplasmic
maternal effects tocomplicate
the simulation. Further work is in progress toinvestigate
cases where agenetic
maternalACKNOWLEDGMENT
The senior author is
grateful
for ascholarship
from theMinistry
of Culture andHigher
Education of Iran for the PhDstudy during
which thisstudy
was conducted. Gratefulacknowledgment
is also made to Dr KMeyer
forproviding
the DFREML program foranalysing
the data.REFERENCES
Bell
BR,
McDanielBT,
Robison OW(1985)
Effects ofcytoplasmic
inheritance onproduction
traits indairy
cattle. JDairy
Sci68,
2038-2051Boettcher
PJ,
KuhnMT,
Freeman AE(1996)
Impact
ofcytoplasmic
inheritance ongenetic
evaluations. JDairy
Sci79,
663-675Graser
HU,
SmithSP,
Tier B(1987)
A derivative-freeapproach
forestimating
variancecomponents in animal models
by
restricted maximum likelihood. J Anim Sci64,
1362-1370Hohenboken WD
(1985)
Maternal effect. In: World Animal Science. General andquanti-tative
genetics
(AB
Chapman,
ed),
Elsevier, Amsterdam,
135-149Kennedy
BW(1986)
A further look at evidence forcytoplasmic
inheritance forproduction
traits in
dairy
cattle. JDairy
Sci 69, 3100-3105Meyer
K(1989)
Restricted maximum likelihood to estimate variance components for animal models with several random effectsusing
a derivative-freealgorithm.
Genet SelEvol 21,
317-340Meyer
K(1991)
DFREML program to estimate variance componentsby
restricted max-imum likelihoodusing
a derivativefree-algorithm.
UserNotes,
Version2.0, University
of NewEngland
Rao CR
(1973)
Linear StatisticalInference
and itsApplications.
JWiley
andSons,
NewYork, 417-420
Southwood