HAL Id: hal-00893808
https://hal.archives-ouvertes.fr/hal-00893808
Submitted on 1 Jan 1989
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Probability statements about the transmitting ability of
progeny-tested sires for an all-or-none trait with an
application to twinning in cattle
J.L. Foulley, S. Im
To cite this version:
Original
article
Probability
statements
about the
transmitting
ability
of
progeny-tested
sires
for
anall-or-none
trait with
anapplication
to
twinning
in
cattle
J.L.
Foulley
S.
Im2Institut National de la Recherche
Agronomique,
station de génétique quantitative etappliquée, centre de recherches de Jouy, 78350 Jouy-en-Josas;
Institut
National de la Recherche Agronomique, laboratoire de biométrie, centre de recherches de Toudouse BP 27, 31326 Castanet-Tolosan Cedex, France(received 20 September 1988, accepted 17 April
1989)
Summary - This paper compares three statistical
procedures
formaking probability
statements about the true transmitting ability of
progeny-tested
sires for an all-or-nonepolygenic trait. Method I is based on the beta binomial model whereas methods II and III result from Bayesian approaches to the threshold-liability model of Sewall Wright.
An application to lower bounds of the transmitting ability of superior sires with a high
twinning
rate in theirdaughter
progeny is presented. Results of different methods are ingood
agreement. Theflexibility
of these different methods with respect to more complexstructures of data is discussed.
genetic evaluation - all-or-none traits - beta binomial model - threshold model
-Bayesian methods
Résumé - Enoncés probabilistes relatifs à la valeur génétique transmise de pères testés sur descendance pour un caractère tout-ou-rien avec une application à la
gémellité chez les bovins. Cet article compare 3 procédures statistiques en vue de la
formulation d’énoncés probabilistes
relatifs
à la valeur génétique transmise de pères testéssur descendance pour un caractère polygénique tout-ou-rien. La méthode I repose sur le modèle bêta binômial alors que les méthodes II et III découlent d’approches bayésiennes du modèle à seuils de S. Wright. Une application concernant la borne
inférieure
de la valeur génétique transmise de pères d’élite présentant un taux degémellité
élevé chez leursfilles est présentée. Une bonne concordance des résultats entre méthodes est observée. La
flexibilité
de cesdifférentes
méthodes vis-à-vis de structures de données plus complexes estabordée en discussion.
INTRODUCTION
Genetic evaluation for all-or-none traits is
usually
carried out via Henderson’s mixed modelprocedures
(Henderson, 1973)
having
optimum properties
for the Gaussian linear mixed model. Eventhough
a linearapproach taking
into account somespecific
features of binomial or multinomialsampling procedures
can be worked out inmulti-population analysis
(Schaeffer
&Wilton, 1976;
Berger
&Freeman, 1976;
Beitler &Landis, 1985;
Im etad.,
1987),
these methods suffer from severe statisticaldrawbacks
(Gianola,
1982;
Meijering
&Gianola,
1985;
Foulley,
1987).
Especially
asdistribution
properties
ofpredictors
and ofprediction
errors are unknown forregular
orimproved
Blup
procedures applied
to all-or-nonetraits,
it would therefore bedangerous
to baseprobability
statements on the property of a normalspread
ofgenetic
evaluations or of truebreeding
valuesgiven
the estimatedbreeding
value.The aim of this paper is to
investigate
alternative statistical methods for that purpose.Emphasis
will beplaced
onmaking
probability
statements abouttrue
transmitting
ability
(TA)
ofsuperior
siresprogeny-tested
for somebinary
characteristic
having
a multifactorial mode of inheritance. Numericalapplications
will be devoted to sires with a
high twinning
rate in theirdaughter
progeny.METHODS
The methods
presented
here are derived from statistical sire evaluationprocedures
which are based on
specific
features of the distribution involved in thesampling
processes of such
binary
data. Three methods(referred
to asI,
II andIII)
will bedescribed in relation to recent works in this area. The first method is based on the
beta binomial model
(Im, 1982)
and the two other ones onBayesian approaches
(Foulley
etal.,
1988)
to thethreshold-liability
model due toWright
(1934
a andb).
All three methods assume a conditional binomial distribution B
(n,
T
r)
ofbinary
outcomes(i.e.
n progenyperformance
of asire)
given
the true value 7r of aprobability
parameter(here
the sire’s truebreeding
value ortransmitting
ability).
The three methods differ in
regard
to themodelling
of 7ritself,
eitherdirectly
(beta
binomial)
orindirectly
(threshold-liability model),
andconsequently
indescribing
theprior
distributions of parametersinvolved,
v.i.z. !r itself or location parameters on anunderlying
scale.Method I ’
Let yij = 0 or 1 be the
performance
of thejth
progeny( j
=1, 2, ...,
n
i
)
out of the ith sire(i
=1, 2,
..., q)
andn
i unrelated dams.
Let
J
r
i
designate
the truetransmitting
ability
(
7
r
i
)
of sire i.A
priori,
the7r
i
’s
are assumed to beindependently
andidentically
distributedConditional on true value -7ri, the distribution of
binary
responses among progenyof a
given
sire is taken as binomial B,
i
(n
7
r
i
).
These distributions areconditionally
independent
among sires so that the likelihood is theproduct
binomialwhere the circle stands for a summation over the
corresponding
subscript
(here
y2o =
Ej
Y
ij)
andcapital
letters indicate random variables.As the
prior
and the likelihood areconjugate
(Cox
&Hinkley,
1974,
p.308),
theposterior
distribution remains in the betafamily
and can be written as:! , ¡’ , ... -- I
with
normalizing
constantThe
density
in(3)
is aproduct
of qindependent
beta densities.Ignoring
subscripts,
theposterior
density
for agiven
sire is:This Beta distribution B
(a, b)
can beconveniently expressed
with areparame-terization in terms of a
prior
mean 7r,, =a/(a
+!3),
an intra class correlationp
b =
(a +,3 +
1)-
1
and the observedfrequency
p =y
o/
n.
The conditional distribu-tion of 7rgiven
n, p, 1r and pb isthen
with
expectation
which will be noted 7r so as to reflect both its
interpretation
as aBayesian
estimatorof 1
f as well as its
equivalence
with the best linearpredictor
or selection index(Henderson, 1973).
Using
7r andletting
a
b
=pb
1
-
1 withÀ
b
interpretable
as aratio of within to between sire components of
variances,
the distribution in(4b)
can be viewed as a function of n, 7r and
A
b
,
thatis,
conditional on n,!6
and1
?
,
thedistribution of 7r is:
with
expectation
and variance
Probability
statements about true values of TAgiven
the data(n
and p or7i’)
expressions
(4b)
or(5)
of theposterior
density
of -7r. Notice that formula(6a)
also represents theprobability
of responsePr(Y
ik
=1
ni, p
i
)
for a future progeny(k)
out of sire(i)
with an observedfrequency
of response pi in nioffspring.
These
probability
statements can be made forspecific
siresgiven
their progenytest data
(n,
p or11’)
and the characteristics of thecorresponding population,
suchas the mean incidence 1fand the intraclass coefficient pb as a parameter of
genetic
diversity.
To allow forcomparisons
amongmethods,
this pb, orequivalently
theratio
A
b
,
will beexpressed
according
to Im’s(1987)
results which relate intraclasscoefficients
on thebinary
(p
b
)
andunderlying
(p)
scales in apopulation
in whichthe incidence of the trait is 7r,,
(see
nextparagraph).
In the case of
twinning,
interest isusually
insuperior
sireshaving
estimatedtransmitting ability
(ETA)
values above the mean 7ro. Attention will then bedevoted to the lower TA bound 1fm which is exceeded with a
probability
ai.e,
to 1fm, such that:This involves
computing
x E[0, 1]
values of the so-calledincomplete
betafunction defined as, in classical notations
Details about numerical
procedures
used to that respect aregiven
inappendix
A.In
addition,
moregeneral
results can beproduced
for instance in terms of(n, 1
?
)
values such that formula
(7)
holds forgiven
values of a&dquo;t(TA
lowerbound)
and a(probability level):
seeappendix
A.Method 11
This method is derived from
genetic
evaluationprocedures
for discrete traitsintroduced
recently by
several authors. All theseprocedures postulate
theWright
threshold
liability
concept. We restrict our attention here toBayesian
inferenceapproaches proposed independently by
Gianola &Foulley
(1983),
Harville & Mee(1984),
Stiratelli et al.(1984)
and Zellner & Rossi(1984).
Although
themethodology
is verygeneral
vis-a-vis data structures, for the sakeof
simplicity
only
itsunipopulation
version(p
model)
will be considered in thispaper.
Let
l
ij
be aconceptual
underlying
variable associated with thebinary
responsey2! of the
jth
progeny of the ith sire. The variable12!
is modelled as: .where 1/i is the location parameter associated with the
population
of progeny outof sire i and the
e
ij
’s
are NID(0,
o,’)
within sire deviations.Conditional on qj, the
probability
that a progenyresponds
in one of the twoexclusive
categories
coded[0]
and[1]
respectively
is written as:where T is the value of the
threshold,
a the within sire standard deviation andIt is convenient to put the
origin
at the threshold and set ue tounity,
i.e.&dquo;standardize&dquo; the threshold model
(Harville
&Mee,
1984)
the
expression
for 7ri[o] can be written asand that for 7ri[l] as:
In what
follows,
and tosimplify
notation,
jpjrJ will be referred to as 7ri .Letting
tt=fail
be an(q
x1)
vector, a natural choice for theprior
distributionof
IL
underpolygenic inheritance,
is:where A is
equal
to twice Malecot’sgenetic relationship
matrix for the q sires(A
= I in methodI),
U2
is
the sire component of variance andp
o the
general
phenotypic
mean in theunderlying
scale.These parameters po and
u2
are linked to the overall incidence 7r via:- . - .. ,
or,
equivalently,
defining
Q2 =0
-; +
U2
with
Q
e
=
1,
Similarly,
the sire varianceQ!6
in thebinary
scale can be related to theunderlying
distribution via
where
!2(x, y; r)
is the standardized bivariate cumulativedensity
function withmean 0 and correlation
r, jl
p
o
j
+Q
u)i/2
and p in(13b)
is the intraclasscorrelation coefficient p =
a;/a2.
The variance in(13b)
can be obtaineddirectly by
a
probability
argument or as the limit of a formulagiven by Foulley
et al.(1988)
forthe variance of the observed
frequency
pi when the progeny group size n tends toinfinity.
Notice also that it differs from the classicalexpression
<p2(jí,)a;
proposed
by
Dempster
& Lerner(1950), 0(.)
designating
the standardized normaldensity
function,
which is a first orderTaylor expansion
of(13b)
about p = 0.where
z!
is a(1
xm)
row vectorhaving
1 in the ith column and 0 elsewhere.The
logposterior
density
L(p.;
y, !Co,a2)
can be minimized with respect to p.by
a
scoring
algorithm
of thegeneral
formThe value
of t
L
in the t-th iteration can becomputed by solving
the non-linearsystem
where W and v are an
(n
xn)
diagonal
matrix and an(n x 1)
vectorrespectively
having
elementsDefine A =
Œ; / Œ;
=1/ Œ;
and u= (L -p
o
l
as an(m x 1)
vector of sire deviations. Then the system to be solvedA
IL
= A u becomesAn
interesting
feature of theposterior
distribution in(14)
is itsasymptotic
normality
where
(.1.
*
is the mode of theposterior density
of
IL
in(14)
and I(IL)
is defined aslim
[I (
E
t) /
no!-1
can bereplaced
for test statisticsby
a consistent estimator such as-no[I(!*)!-1
where I(t
L
)
is evaluatedat t
L
_t
L
*
.
The variance of the
limiting
normal distribution isusually
taken to be(Berger,
1985,
p.224)
where
wi
stands
for wi evaluated at the mode/-Li.
*Letting
pm _<I>-l (
1I
&dquo;
m
)
be the parameter value in theunderlying
scalecor-responding
to 1I&dquo;m in(7),
theprobability
that the true sireTA,
pz of sire i(ETA
=tt*)
exceeds tt
(or
equivalently
!r >1I
&dquo;
m
)
can beexpressed
asFor
given
values of n, p(or 7
?
)
andÀb( 7f 0,
011
),
thisprobability
can becomputed,
given
7fm, andcompared
to thecorresponding probability
level obtained with thebeta binomial model.
Alternatively,
one can determine the lower bound 7f m such that Pr(
7
r
i
>7fm)
= afixed,
by
taking
pm =f
-Li -
Î
;/
2
ép-l(0:).
Notice that
computing
theprobability
in(20),
based
on theposterior
distributionof the true
TA,
f
(p
n,
p, po,A)
isequivalent
tocomputing
Pr(
7
r
>7fm
)
overthe distribution of the
probability
of response 7r =4)(,U)
for a future progeny of sireshaving
an ETAequal
tott
*
and a true TA distributedaccording
to(19).
This distribution in the observed scale wouldprobably
be moreappealing
forpractitioners.
This isespecially
clear as far as ETA’s are concerned and one mayalternatively
tott
*
,
consider as a sireevaluation,
theexpectation
of !r =!(!)
withrespect to the
density of
A
in(19),
say!!2.
Thisexpectation
is:However,
the whole distribution of 7r =4b(/,t)
remains less tractablenumerically
than thatof p
in(19)
due to itsfollowing
form:Method III
This method is also derived from the threshold
liability
model butemploys
asymptotic properties
at an earlierstage.
Let us
consider,
aspreviously,
the observedfrequency
of response pi in niprogeny of sire i.
Conditionally
to the trueTA,
),
i
r
7
(
pi has anasymptotic
normaldistribution,
i.e.:The normit transformed
of p
i
,
mi = 14D-(p
i
)
has a conditional distribution whichis also
asymptotically
normal.Following
a classical theorem inasymptotic
theory
(see
forinstance,
formula6a.23,
page 386 inRao,
1973)
andknowing
that:one
has, given
aiAssuming
as in section II Bthat,
apriori,
thef-Li’S
are i.i.d !N(p.!, o
D
leadsThe
expectation
(jiz)
and variance(ci)
of the distribution in(25)
can beeasily
expressed analytically
as(see
for instance Cox &Hinkley, 1974,
formula22,
page373).
with
Alternative formulae can be derived so as to mimic usual selection index
expressions,
which are,ignoring subscripts
... ,- - !
where CD is
given by
the usual formula for the coefficient ofdetermination,
i.e. CD=
n/n
+k)
where the scalar k is defined as:In method II and with a definition of CD restricted to var
(f-Lly,a-;) = (1- C D)a;,
this coefficient is:
Notice that k can be
interpreted
as in selection indextheory
as the ratio of awithin sire to a sire
variance,
sincep(l -
p)/(p2(.)
is theasymptotic
variance of the normit transformation of thefrequency
of response inprogenies
of agiven sire,
conditionally
on the sire’s trueunderlying
TA.Substituting
jiz
and ci for(26a
andb)
or(27
a andb)
forp*
and Iirespectively
in
(20)
enables us either to determine 1rm for agiven
aprobability
levelknowing
po
,
ufl
(or
equivalently
7r,, and!),
p and n, or to compute theprobability
level asuch that p.i exceeds a
given
thresholdAgain,
the distribution of 7r =iI?(f-L)
inthe observed scale
corresponding
to thedensity of p,
in(25)
can be obtainedusing
the formula
(22).
Itsexpectation
has the sameexpression
as in(21) with Îii
and cireplacing f
-
L
i
and
7zrespectively.
NUMERICAL APPLICATION
Procedures described in this paper are
applied
to theproblem
ofscreening
superior
sires with a
high
twinning
rate in their female progeny.There are
relatively large
differences among cattlepopulations
with respect to theprevalence
-7r,, oftwinning
(see
for instanceMaijala
&Syvajarvi,
1977).
Twovalues of 7r,,, say 2.5 and 4% are considered in this
application.
These values of
twinning
rate are those observed in 2nd and mature(3rd
to7th)
calvings respectively
in such breeds as Charolais andMaine-Anjou
(Frebling
etal.,
1987).
The first value of or1(0.025)
can be viewed as a progeny test of youngof its
extremely
low rate(0.007).
The second value(0.04)
is an illustration of anevaluation system of service sires based on mature
calvings.
Genetic variation for occurrence of
multiple
births in cattle was assumed tohave an
heritability
coefficient in theunderlying
scaleequal
to0.25,
according
toestimates
published
by Syrstad
(1974).
In thisstudy,
h2 values of theunderlying
continuous variable arehigher
than those in the observed scaleand, especially
morestable over
parities
astheoretically expected
for suchbinary
traits.Using
this valuein
(13a
andb)
leads to !oequal
to -2.0242 and -1.8081, and h2 in thebinary
scaleequal
to 0.0394 and 0.053 for 7!-oequal
to 0.025 and0.04,
respectively.
Notice,
asalready
pointed
outby
Im(1987)
that h2 valuesreported
here areslightly higher
than those which would be obtained
(0.0350
and0.0483)
from the classical formula ofDempster
& Lerner(1950).
First
application
The first
application
deals with 5specific
sires known for theirhigh twinning
ratein
daughters.
Table I shows lower bounds(
TT
m)
of the TA intwinning
of these 5 siresknowing
their progeny testperformance
(!,, p)
in maturecalvings
at 2 differentprobability levels,
a = 0.90 and 0.95. In both cases, results are ingood
agreement
across methods
with,
asexpected
due toasymptotic approximations,
higher
values of 7r mbeing
obtained with method IIand,
to alarger
extent with III. Differences between I andII,
however,
are of littleimportance
given
thelarge
values of n.Differences among methods are also reflected in ETA values on the observed scale
with values for methods II and III very
slightly
lessregressed
towards the meanincidence 7r = 0.04 than in method I. On the other
hand,
this is agood
example
of a
change
ofranking
according
to criteria used. B and C have close ETA and !r,&dquo;,,values
although they
differlargely
infrequency
(p).
E is lower than D in p, close toD in ETA but
larger
in Tfm due to a greater number of progeny.Second
application
For the two 7r
frequencies,
tables II and III show the values of progeny group size(n)
whichprovides
an a = 0.9probability
level for different combinations of !r&dquo;t,the TA taken as a minimum and
7i’,
the ETA valueaccording
to formulae(5b)
and(6).
Minimum TA and ETA values wereexpressed
in percent as well as, forpractical
purposes, deviations from 7r in Qub units(uu
b
= 1 standard deviation in true TA in the 0 —1 scale).
Results are shown in tables II and III for !&dquo;,,varying
from +0.25 to +2.75 Qub and ETA from +1.00 to 3.00 oub with an
elementary
increment of 0.25.
The
higher
theETA’s,
the lower are progeny numbers for agiven
value ofTfm-For
instance,
for 7ro = 0.025 and !r&dquo;,, = 0.048(or
equivalently
+1.50o-it
t
,)
progenynumbers
providing
a 0.9probability
level are5199, 1291, 546, 279,
152 and 81when the ETA goes from + 1.75 to 3.00 uub.
Corresponding
figures
are3631, 895,
375, 188,
100 and 51respectively
when 7r
o
=0.04,
i. e. about 60-70% ofprevious
quantities.
Thisspecial
case is illustrated for 7r = 0.025 infigure
1 with agraph
of the beta
prior
density
andposterior
distributionscorresponding
to 7rm = 0.048For a
given
ETAvalue,
the closer this value to 7r,n, thehigher
the progeny group size needed to reach aprobability
level of 0.9. For such a difference(ETA
minus7
r
m
)
taken as fixed in uubunits,
variations in the progeny number n are rather lesspronounced.
These variations(An)
in n areproportional
to variations(A£)
in ETA within the range of values considered. Forinstance,
for 7? - 7r m = 0.75 uub, An = 504 - 330 = 174 when 7? varies from + 1.00 to + 2.00uu b
at 7r, = 0.025 and
An = 671 - 504 = 167 with 7?
going
from + 2.00 to + 3.00( TUb
.
Clearly,
thesevariations result from the
dependency
of theposterior
variance on ETA as reflectedin formula
(6b)
contrarily
to whathappened
in the Gaussian linear model. For
?!-! = 0.025 and h2= 0.25 and a =
0.9,
73 to167,
157 to325, 330
to671,
904 to 1663 and 4041 to 7062 progeny arerequired
to exceed a minimum TA valueequal
toETA minus
1.25, 1.00, 0.75, 0.50
and 0.25 (TUb,respectively. Corresponding figures
can be found in table III for 7r = 0.04.Coming
back to the case oftwinning,
practical
interest will be for 7rmvalues around +1.50 Qub. Thiscorresponds
to sireshaving
an ETAequal
to 2.25 to 2.50 (TUb(A
difference ETA -7rm of 1.00 to 1.25 Qu
b seems reasonable in
practice).
A progeny number of 279 to 546 and 188 to 375is needed
depending
on 7? - 7r! for 7r... = 0.025 and0.04,
respectively.
For each n, 7rnt,
ETA,
)
o
r
7
(
b
À
combination,
thecorresponding
probability
thatof
fact,
methodsI,
II III can becompared only
whenusing
the same amount ofinformation,
v.i.z. the same n and p values on the onehand,
and the sameprior
expectations
and variances(formulae
13a andb)
on the other hand.Probability
values for methods II and III are shown in Tables II and III.
The agreement between the 3 methods is
generally
good, especially
between the beta, binomial model and theBayesian approach
of the threshold model. For 71&dquo;m < +2.00 and
ETA <
+2.25 uub , the difference between the 2probabilities
neverexceeds 0.01. As
pointed
outpreviously
(first application),
distributionsemployed
to make
probability
statements in II and III are bothasymptotically
normal onthe
underlying scale,
andconsequently
underestimate the realposterior
varianceand overestimate the
probability
that the true TA ishigher
than 71&dquo;m.Clearly,
thisdrawback is more severe for method III than method
II,
especially
forhigh
values ofthe ETA as shown in Tables II and III and in
figure
2 with thegraph
of distributionscorresponding
to Jro=0.025,
hz=0.25, n
= 38 andp =
5/38.
DISCUSSION-CONCLUSION
Adaptation
for otherpractical
situationsThe situation adressed in this paper was to make
probability
statements about thetrue TA of
&dquo;superior&dquo;
sires vis-a-vis a minimum(
71
&dquo;
m
)
value for a rareinteresting
cha,racteristic. The
procedure
described can also be viewed asmaking probability
statements about TA’s of &dquo;inferior&dquo; sires vis-h-vis an upper TA limit for a
frequent
unfavourable trait. This is
likely
to occur inpractical
animalbreeding, especially
for the so-called
secondary
traits(e.g.
fertility
ordystocia
incattle)
where selectionusually
operatesagainst
the poorest sires and dams. In some instances for suchdetrimental traits at a low
prevalence,
interestmight
be towards the occurrenceof TA’s in the lower tail of the distribution. The
procedure reported
here can beeasily
accomodated for this case inchanging
theinequality sign
in formula(7)
andin
taking
anopposite
argument for«1>(.)
in(20).
Threshold
liability
concept
and beta binomialapproaches
In the
example
studied and within the range of parametersconsidered, probability
statements madeby
methods I and II were very similar asclearly
illustratedby
figure
2. This mayhelp
reconcile the beta binomialapproach praised by
statisticians(see,
among othersIm,
1982)
with the thresholdliability
concept put aheadby
quantitative geneticists
especially
thoseworking
in humangenetics
(see
for instance Curnow &Smith,
1975;
Falconer,
1965;
Fraser,
1980),
even undercomplex
segregation
patterns(Lalouel
etal.,
1983).
However,
the thresholdliability
model offers moreflexibility
than the beta binomial. Inparticular,
it can beeasily adapted
to mixed model structuresinvolving
amultipopulation analysis
as well as severalrandom sources of variation.
Foulley
et al.(1988) highlighted
how inferential issues in such structures can be handled from a unifiedperspective using
theBayesian
paradigm.
Thus,
taking
into account several records per animal(e.g.
multiple
vssingle
births at differentparities)
can be achieved with thismethodology using
with
possible
missing
data patterns(Foulley
&Gianola,
198G)
as well as correlatedinformation on normal continuous traits
(Foulley
etal.,
1983).
Unfortunately,
the beta binomial model in its present state ofdevelopment
remains confined to asingle
population
analysis
with one random factor(Williams, 1988).
Gilmour’s
approach
Although
theapproach
of Gilmour etal.,
(1985)
to the threshold model has itsown
rationality
via its connection to themethodology
ofgeneralized
linear modelsand
quasi-likelihood,
itsjustification
forpredicting breeding
values ascompared
toBayesian
methods is stillquestionable
(Foulley
etal., 1988;
Knuiman &Laird,
1988).
This isespecially
true with respect to distributionproperties
of TApredictors
or those of
prediction
errors for which no formal statisticalproperties
can be claimedfor Gilmour’s method.
Distribution
properties
Distribution
properties
of the true TAgiven
the data were derived in thisstudy
under theimplicit assumption
that the beta binomial model is true in method Iand the threshold concept is also true in methods II and III. Another issue on
comparing
these methods is toinvestigate
which is the best.Ways
ofchallenging
models is a difficult
topic
which isbeyond
the scope of this paper(see
for instanceSmith,
1986).
Forcategorical
data and in an animalbreeding
context,suggestions
on how to compare non-nested models were madeby Foulley
(1987),
which arebased on the
predictive
distribution of a future data setgiven
the data observed atpresent.
Genetic
charges
The
reasoning
followedthroughout
this paper is a conditional viewknowing
theprogeny test data information on sires. This leads to
probability
statements about the true TA ofspecific
sires.Discussing
progeny group numbers for aplanned
progeny test programme can involve a different
approach
(e.g.
Curnow,
1984)
especially
whenlooking
apriori
at this issue with nospecific
progeny-test results.One could make some statements about a distributional form for ETA’s and values
of the selection differential and
point
of truncation. Aspointed
outby
Hill(1977),
genetic
change
due toselecting
superior
sires on their ETA’s under extra constraintswith respect to such factors as
inbreeding levels, testing
facilities and other costs appears to be a naturalapproach
(e.g.
Curnow,
1984).
Response
(R)
to onegeneration
ofupward
truncation selection on1
?
(1
?
>1
?s
)
is a random variable 1rhaving
in the beta binomial model thefollowing
conditionaldensity given
7T > 1rs.account, not
only
theposterior
density
of 7rgiven
progeny test results but also themarginal density
of the ETA’s andintegrate
7r out in the latter and theirproduct
over the selection space.
These are not easy
manipulations
from ananalytical
point
ofview,
especially
with the threshold model. More research is therefore needed to derive
original
results in that area.ACKNOWLEDGEMENTS
The authors are
grateful
to Mr Dan Waldron(University
of Illinois atUrbana)
and Mrs Annick Bouroche(Unite
dedocumentation,
Jouy-en-Josas)
for theirEnglish
revision of the
manuscript.
Thanks are also extented to the three anonymousreferees for their
helpful
comments.REFERENCES
Abramowitz M. &
Stegun
I.A.(1972)
Handbookof
mathematicalfunctions.
9th ed. Doverpublications,
New YorkBeitler P. & Landis J.R.
(1985)
A mixed-effects model forcategorical
data. Biometrics41,
991-1000Berger
J.O.(1985)
Statistical decisiontheory
andbayesian analysis.
2nd edition.Springer-Verlag,
New YorkBerger
J.P.,
Freeman A.E.(1978)
Prediction of sire merit forcalving
difficulty.
J.Dairy
Sci.61,
1146-1150Cox D.R. &
Hinkley
D.V.(1974)
Theoretical statisticsChapman
&Hall,
London Curnow R.N.(1984)
Progeny-testing
for all-or-none traits when a multifactorial modelapplies.
Biometrics40,
375-382Curnow R.N. & Smith C.
(1975)
Multifactorial models for familial diseases in man.J.R. Statist. Soc.
A,
138,
131-169Dempster
E.R. & Lerner I.M.(1950)
Heritability
of threshold characters. Genetics35, 212-236
Falconer D.S.
(1965)
The inheritance ofliability
to certain diseases estimated fromthe incidence among relatives. Ann. Hum. Genet.
29,
51-76Foulley
J.L.(1987)
Methodes d’evaluation desreproducteurs
pour des caractères discrets a d6terminismepolyg6nique
en s6lection animale. These d’Etat. Universitede
Paris-Orsay
Foulley
J.L. & Gianola D.(1986)
Sire evaluation formultiple binary
responses when information ismissing
on some traits. J.Dairy
Sci.69,
2681-2695Foulley
J.L.,
Gianola D. &Thompson
R.(1983)
Prediction ofgenetic
merit fromdata on
binary
andquantitative
variates with anapplication
tocalving
difficulty,
birth
weight
andpelvic opening.
Genet. Sel. Evod.15,
401-424Foulley
J.L.,
Gianola D. & Im S.(1988)
Genetic evaluation for discretepolygenic
traits inanimal
breeding.
In: Advances in Statistical Methodsfor
GeneticIm-prove!rcent
of
Livestock.February
16-20, 1987,
Armidale. Australia.Springer
Verlag
Fraser F.C.
(1980)
The William Allan Memorial Award Adress: Evolution of apalatable
multifactorial threshold model. Am. J. Hum. Genet.32,
796-813Frebling
J.,
Gillard P. & Menissier F.(1987)
Resultatspr6liminaires
surl’aptitude
naturelle a la
gemellite
dans un troupeau de vaches Charodaises etMaine-Anjou
etde leur
filles. INRA,
mimeoGianola D.
(1982)
Theory
andanalysis
of threshold characters. J. Anim. Sci.54,
1079-1096Gianola D. &
Foulley
J.L.(1983)
Sire evaluation for orderedcategorical
data witha threshold model. Genet. Sel. Evol.
15,
201-224Gilmour
A.R.,
Anderson R.D. & Rae A.L.(1985)
Theanalysis
of binomial databy
a
generalized
linear mixed model. Biometrika 72, 593-599Harville D.A. & Mee R.W.
(1984)
A mixed modelprocedure
foranalyzing
orderedcategorical
data. Biometrics40,
393-408Henderson C.R.
(1973)
Sire evaluation andgenetic
trends.Proceedings of the
animalbreeding
andgenetics symposium
in honorof
Dr. J.L. Lush. Americansociety
of animal science and americandairy
scienceassociation,
Champaign,
Illinois,
10-41 Hill W.G.(1977)
Comments on statisticalefficiency
in bull progenytesting
forcalving difficulty.
Livest. Prod. Sci.4,
203-207H6schele
I.,
Foulley
J.L. & Gianola D.(1986)
Genetic evaluation formultiple binary
responses. Genet. Sel. Evol.
18,
299-321Im S.
(1982)
Contribution d 1’etude des tables decontingence
dparam!tres
aleatoires: utilisation en biom6trie. These 3e
cycle,
UniversitePaul-Sabatier,
ToulouseIm S.
(1986)
Mutualindependence
in a mixed effects model letter to the editor. Biometrics42,
997Im S.
(1987)
A note onheritability
of a dichotomous trait. Biom. J.4,
407-411Im
S., Foulley J.L.,
& Gianola D.(1987)
A linear model forgenetic
evaluation oncategorical
traits. 82nd AnnualMeeting
of the AmericanDairy
Science Association ColumbiaMissouri,
June 21-24. J.Dairy
Sci. 70,Suppl.
1,
124 abstractJohnson N.L. & Kotz S.
(1972)
Distributions in Statistics Continuous UnivarianceDistributions,
vol. 3. JohnWiley
&Sons,
New YorkKnuiman M.W. & Laird N.M.
(1988)
Parameter estimation in variance componentmodels for
binary
response data. In: Advances in Statistical Methodsfor
GeneticImprovement
of
Livestock.February
16-20, 1987,
Armidale. Australia.Springer
Verlag
(forthcoming)
Lalouel
M.,
RaoD.C.,
Morton N.E. & Elston R.C.(1983)
A unified model. J. Hum. Genet.35,
816-826Maijala
K. &Syvajarvi
(1977)
On thepossibility
ofdeveloping
multiparous
cattleby
selection. Z. Tierz.Zuechtungsbiol.
94,
136-150Meijering
A. & Gianola D.(1985)
Observations on sire evaluation withcategorical
data
using
heteroscedastic mixed linear models. J.Dairy
Sci,
68,
1226-1232 Pearson K.(1968)
Tablesof
theincomplete
betafunction.
2nd ed.Cambridge
University Press, Cambridge
Rao C.R.
(1973)
Linear statisticalinferences
and itsapplications.
2nd ed. JohnWiley
&Sons,
New YorkSchaeffer L.R. & Wilton J.W.
(1976)
Methods of sire evaluation forcalving
ease.Smith A.F.M.
(1986)
SomeBayesian
thoughts
onmodelling
and model choice. The Statistician35,
97-102Stiratelli
R.,
Laird N. & Ware J.H.(1984)
Random effects models for serial observations withbinary
response. Biometrics40,
961-971Syrstad
O.(1974)
Inheritance ofmultiple
births in cattle. Livest. Prod. Sci.11,
373-380
Williams D.A.
(1988)
Extra binomial variation intoxicology.
XIVth International biometricconference, Namur, July
18-23, 1988,
invited papers, 301-316Wright
S.(1934a)
Ananalysis
ofvariability
in number ofdigits
in an inbred strainof
guinea pig.
Genetics19,
506-536Wright
S.(1934b)
The results of crosses between inbred strains ofguinea pigs
differing
in number ofdigits.
Genetics19,
537-551Zellner A. & Rossi P.E.
(1984)
Bayesian analysis
of dichotomousquantal
responsemodels. J. Econ.
25,
365-393APPENDIX A: ALGORITHM FOR SOLVING
EQUATION
(7)
.Analytical
approximations
to the inverse function of theincomplete
data functionare
proposed
in calculus books: see, forinstance,
formula26.5.22,
page 945 inAbramowitz &
Stegun (1972).
Toimprove
the accuracy of theseapproximations,
weemployed
analgorithm
based on finite differencetechniques.
It consists initerating
from
(t)
to(t
+1)
withStarting
values are xo =0;
(a, b)
o
I
= 0 and xl = value of the inverseprobability
functiongiven by
e.g. Abramowitz& Stegun.
In this
equation,
theincomplete
beta function is calculated from Peizer andPratt’s
approximation
as recommendedby
Pearson(1968,
formulae47, 48,
49 pageXXX)
and Johnson & Kotz(1972,
vol.III,
pp.48,
49).
For the values of a andb encountered in this
study,
thisapproximation
is very accurate; the differenceItrue-approximated values)
is less than10-
3
according
to theprevious
authors.Computationally, solving
equation
(7)
in terms of(n,1?)
requires
obtaining
theparameter a =
(n
+’x
b
)1?
of theincomplete
beta functionI
z
(a, b)
= 1 - a,given
x values of a, x and the ratio
a/b
=!r/(1 -
7r).
This can be worked out with analgorithm
similar to[Al]
in which a is substituted for x.For the sake of