THE PARTIAL REGRESSION COEFFICIENTS IN PEDIGREE EVALUATION
Ahmed MOSTAGEER
Institut Iiiy Tierzucht und
Haustievgenetik
Tierd y
ztliche Hochschule, Linke Bahngasse zr, Wien
(Ostevveich)
SUMMARY
The method described by MOSTAGEER(1970) for assessing the efficiency of ancestorrecords
was used to derive the partial regression coefficients for traits expressed on both sexes and for sex-limited characters. The regular cases in which all the measurable animals in the T available
generations of ancestors provide information of the same size were first discussed. In sex-
limited characters where the number of partial regression coefficients using T generations of
ancestors is equal to 2T-1, the method presented requires only the calculation of T-1 factors from which all the exact regression coefficients could be computed by simple arithmetic.
A solution for the irregular cases in which the ancestors provide information of different heritabilities was also attempted. Included were also the cases in which the animal in question provides information, and those in which some of the ancestors are missing or having a number
of half sib progeny. A simple and exact solution was derived that requires the calculation of a
few factors from which all the regsession coefficients and the accuracy could be computed. The
number of these easily calculated factors does not usually exceed half the number of sources
providing information. Several worked out examples are presented.
INTRODUCTION
When the
pedigree
records are to be used inassessing
thebreeding
value of ananimal the number of
regression
coefficients to be calculated withrespect
to the characters measured on both sexes isequal
to the number T of ancestorgenerations
to be included in the
index,
thatis, assuming
that each animalgives
the same sizeof information. When the character is measured
only
on one sex, say thefemale,
the number ofregression
coefficients will beequal
to the number of female ancestors.In a recent paper
(M OSTAGE E R , 1970 )
the author described a method forassessing
theefficiency
of ancestorrecords,
based on aconcept suggested by
(
*
) On leave of absence from the Faculty of Agriculture, Cairo University, 2gypte.
R OB E
RTSON
(i 95 g).
Theobjective
of this work is to use this method in the calcula- tion of suchregression
coefficients. We shall first discuss theregular
cases inwhich each measurable animal in the
pedigree gives
the same size of information and then turn to theirregular
cases.Assumptions
made are that the animals arenot
inbred,
that the covariance between relatives isonly
additivegenetic
and thatobservations are taken as deviations from the
appropriate populations
means.In sex-limited characters it will be assumed that the trait is
expressed
on females.I. - THE REGULAR CASES
A. -
Equat Intormation
From Both SexesThe method described
by
MOSTAGEER(r 97 o),
forevaluating
theefficiency
ofancestor records involves the conversion of evidence on the T available
generations
of ancestors into the
equivalent nT half
sib progeny where :and h2 is the
heritability.
That is to say that!cT
is the number ofhalf
sib progeny thatgives
the same accuracy(r!)
of the estimate of thebreeding
value of the animal inquestion
as does the information on the T availablegenerations
of ancestors.To arrive at the
partial regression
coefficients to beapplied
to theperformance
ofeach animal we have first to evaluate its contribution to
nT.
The relation from which MosTAGEER
( 1970 ) developed
thegeneral equation
for
nT
was such that- I II 1 11
the contribution to
n’ of
theparents
relative to that of thepreceding generations being
ze):n ’ T -1
where z! _(q.-h 2 ) I(,-h 2 ).
From relation iwhere mT_i =
(>i§_ i + k) /(nT-y-!’-E-k).
In thisEquation ( 2 )
theparents
contribute(it,
’
,+k) (r-m T _ i ) / 2
half siboffspring
and the rest ofn§ represent
the contribution of all thepreceding generations
of ancestors. Nown T _ 1
could beput
asApplying
this toEquation ( 2 )
the contribution of the
grandparents
is then(n,+k) (m T _ 1/ 2) [(i-m T _ 2 )/2].
In
general
theparents’
contribution tonT
will be(nT-f-k) (i-m T _ 1 ) / 2
and thecontribution
T C t
ofgeneration
t of ancestors(t >i)
will be- , I
Table I contains the values of
Mj
up to 5generations
of ancestors and thelimiting
values ofnT
for 18 different heritabilitiesranging
from 0.05 to 0.90,together
with thecorresponding
accuracies. Table 2 shows the actual contribu- tions of the 5generations
of ancestors toit’ and
their values relative to5 C 5
forthe same 18 heritabilities. It is needless to stress the fact
already
shownby
SEARLE
( 19 6 3 )
that the inclusion of moregenerations
of ancestors adds less and less as h2gets higher.
Twogenerations
of ancestorsgive
almost the same accuracy as the infinitepedigree
chart when h2 --- o.go. The natm e ofEquation
3 says also that the ratio between the contribution(to nT)
of theT-t l generation
to that ofthe
T-t 2
isalways
the same for the same h2regardless
of the number T of the ancestorsgenerations (t l and t 2 having
values from o toT-r). Thus, given
thecontributions
5 C t
one can calculate the Tcomponents
ofM j when T is iess thant 5.
For
example,
the 3components
ofn’ for
h2 = 0.5o will have the ratio ro:3:i,i.e.,
out of the 2.882 half sib
daughters (= n’)
theparents
contribute10 / 14 ,
thegrand- parents 3 /rq
and thegreat grandparents r /rq.
The value
T C I
of the contribution toizT
of the t&dquo;’generation
of ancestors isprovided equally by
the 2’ ancestors of thatgeneration, i.e.,
each member contri- butes!/2’
of theT C,
half siboffspring.
Now we have to note that thegenetic
covariance between the individual’s
breeding
value and his half siboffspring
isG2/2 (where a9
is the additivegenetic variance)
while that between hisbreeding
value and any of the ancestors of
generation
i is(1/!)‘ a9.
Thus we have tomultiply
the
performance
of each of the ancestors ofgeneration
tby
2’-’ toget
it on thesame scale as the half sib progeny of the individual in
question.
So theweight given
to theperformance
of each ancestor ingeneration
t will be!/2TC, (where
1 /
2
is 2’-’ X%t).
This is then furthermultiplied by
the factor 2/ (> 1 .§ + k)
toproduce
the contribution of the ancestor’s
performance
to the estimate of thebreeding
value of the animal in
question.
Thus theregression (bt!) applied
to theperfor-
mance of the
ancestor j at generation
t will beThe values of m!
(up
tom 5 )
and thelimiting
m values for 18 different heritabi- lities are shown in Table 3. It is clear in fact that the values of m2 are almost the same as thecorresponding limiting
values of m, and this indicates that we needonly
to calculate the two values of ml and m2(beside
mo which isr-h 2 )
and to use m2 for all values of m with
subscripts greater
than 2. It has to be noted inpassing
that if we use the mean of the ancestors at eachgeneration
tthen the
regression
coefficient(b t ) applied
will be2b j
orIn this case it is clear that the
weight
in the final indexgiven
to the mean ofthe z’ ancestors of
generation
t may becomparable
to theweight given
to theparents’
mean. This is true for the lowheritability
values. Forinstance,
the ratio between theweight given
to theparents’
mean to the mean of the fifthgeneration
of ancestors(when
T =5)
isr. 9 8 :
1.70 when h2 --- 0,05, and1 . 9 6:0.8 5
when h
2 = 0.25. This ratio increases to 2.30:0.04 when h2 = 0.75.
The method of
calculating
theregression
coefficients may better be discribedon the ancestors’ map. This will be useful for further discussion. In
Figure
zawe wish to estimate the
breeding
value ofQ using
the information about the threegenerations
of ancestors. Theregression
coefficientsapplied
to each ancestorwill be the
product
of thepaths connecting
it withQ, starting
with a solid line andtracing
the broken ones afterwards towardsQ.
Thus theregression
coefficientapplied
to A(or
any member of the thirdgeneration
ofancestors)
will bethat
applied to J (or
any member of the secondgeneration)
will beand that
applied
to 0 or P will beB. - Sex Limited Characters
In sex limited characters the situation is different since each member of the t
tll
generation
of ancestors contributesdifferently. Converting
information onT-f-z generations
of ancestors into theequivalent N T + 1
half sibdaughters
MOSTA-GEER
(1970)
showed thatThis recurrent relation could be used to calculate the values of N for any number of ancestor
generations starting
withNo =
o.o. The values ofN T (up
toN
5
)
and itslimiting
values for the 18 heritabilities are shown in Table 4together
with the
corresponding
accuracies.We
proceed
to fractionN T
into itscomponent parts.
The accuracy of the estimate ofbreeding
value of the individual inquestion
isN TI (N T + k)
and thisis the sum of the two accuracies arrived at
using
theN T _ 1
half sibdaughters
of thefather
equivalent
to his ancestors’information,
andNT_1-+-r!
half sibdaughters
ofthe mother
equivalent
to her ownperformance plus
those of her ancestors. These two accuracies arerespectively.
On the maternal side the ratio between the contribution of the mother’sperformance
to that of her ancestors isw:N T _ l .
Thus of theN T /(N,,-)-!)
themother comtributes
and the rest of the
pedigree (maternal
andpaternal)
contributesNote that the contribution of the
paternal pedigree
relative to the maternalpedigree
is l:mT-l’ Now the value ofNT-l
could in turn be subdivided in a
NT_1-!-k
kmanner similar to that used for
N T j(N T+ k).
Thisoperation repeated
willgive
the
general
form for the contributionT C t
of the tt&dquo;generation (where
t >i):
Thus if we have for instance 3
generations
of ancestors the mother’s contribu- tion toN 3
will be a( l -m 2 ) / 4 ,
that of the twograndmothers
will bea(i-m l )
( l+
m 2
) /r6,
and the share of thegreat grandmothers
will bea(i-m o ) ( I + J n i ) (
l +m 2
) /6 4 ,
where a =N, + k.
Table 5 shows the contributions
(,C,)
of the fivegenerations
of ancestors toN
5
for the 18heritabilities,
and its values relativeto, C 5 .
This table as well asTable q show that the
approach
to thelimiting
value of N or r2 is slow in sexlimited characters and that the inclusion of more ancestor
generations
will be moreeffective. It may be worth
noting
that the values ofN!
are less than the compa- rablen’
in characters measured in both sexes.The value of
T C t (where
t >I )
is not dividedequally
between the 2’-’ ancestors ofgeneration
t,since,
as shownabove,
the contribution of the ancestors of the father of any individual ishigher
than that of his mother’s ancestors.In fact the value
T C t
is divided between the 2’-’ contributorsby
the factorsT-i
resulting
fromexpanding
the term II(i + m in
a way such that of each term T-rm I
of
(i + m i )
I is used if the ancestor’s descendant atgeneration
T—i is amale,
and mi if it is a female.
The contribution of each ancestor thus
defined, again
we have tomultiply
each
performance by
the factor 2tm toput
it on the same scale of the individual’s half sibdaughters.
We then canapply
the factor2 /(NT!-k)
toget
thepartial
regression
coefficient for theperformance
of each ancestor. Thus thepartial regression
coefficientsapplied
to the 2’-’ ancestors ofgeneration
t could be obtainedby expanding
the termby
the manner described fordividing T C t .
The
simplicity
of this method incalculating
thepartial regression
coefficients may better be visualised ondrawing
thepedigree
chart.Again
m2(or m 3 )
may beused for m values of
higher subscripts (see
Table6).
In
Figure z b,
thepartial regression
coefficientsapplied
to theperformances
of each ancestor will be the
product
of thepaths connecting
it with the animal inquestion (Q) starting
with a &dquo;solid &dquo; line and
tracing
the broken lines afterwards.Thus,
forexample,
theregression
coefficientsapplied
to a, c, I and P areIt is obvious that we need
only
to calculate one m-value for each ancestorgeneration
included in the index over the firstgeneration ( l -m o
=h 2 ).
The method could be used of course if all the ancestors in the
pedigree
have acertain fixed number of records. In such case, values of m relevant
to hd
are to dh2be used instead of those for h2
(where h§ = 1+(d-l)P , d = the number of re-
I + (d-I)fi
cords, and fi
= therepeatability
of thecharacter).
Since the values of thepartial regression
coefficients of ancestors on thepaternal
side could be calculated inde-pendently
from those on the maternalside,
the method described does notrequire
the same number of ancestor
generations
in bothparental
sides.Suppose,
forinstance,
that we know theperformances
of P and Monly
on the motherside,
and all the ancestors shown in
graph
zb for thepaternal
side. Theregression
coefficients
applied
to the female ancestors of the father will be the same as calcu- latedbefore,
but on the mother side we have thefollowing regression
coefficients :In other
words,
the earliestgeneration
in each of theparental
sides should start with thepath (1 I 2!°l. B
!/
It may be noted also that ancestorsof
the mater- nal side need nothave
thesame
number of records as those on thepaternal
sidefor this method to
work,
solong
as each side has a fixed number. Of course all these modificationsapply
also to the charactersexpressed
in both sexes.II. - IRREGULAR CASES
The
regular
cases nowdescribed,
we can turn to the cases in which the ances-tors
provide
information of different heritabilities. Included will also be the casesof
missing
ancestors, of ancestorshaving
some recorded half sib progeny or fullsib
progeny from individuals outside thepedigree,
and of the individual inquestion providing
information on himself or his half- or full sib progeny.(By heritability
here is meant the
squared correlation).
The
previous
discussion has shown that the ratio between the factorby
whichan individual releases its information to that
by
which it releases its ancestors’is
equal
toI m .
This of course will hold true when animals in thepedigree
m
provide
information of different heritabilities. It is also clear that the m valuestemming
from an individualprovides
a summary of his information and of that of all his recorded ancestors. Theproblem
then is to relate a new m value(m j ) stemming
from an individual(j)
to the two m values(m a
andm b ) connecting
itwith its
parents
a and b. We shall start with the situation in which the individualj as
well as hisparents
a and bprovide
information.A
simplifying
device for thepedigree diagram
may besuggested:
When the individual inquestion provides
newinformation,
we can add to thediagram
apair
ofpaths connecting
this information with the&dquo; breeding
value &dquo; in whichwe are
interested,
onepath
of the value of m! for &dquo; old &dquo; information and the other(i-m j )
for the new information.The accuracy of the estimate of
breeding
value of the individual inquestion
could of course be obtained
by multiplying by 1 / 2
all thepaths
in thepedigree diagram except ( l -m J
and m!, andsumming
all the factorsconnecting
the bree-ding
value with all the ancestors, those factors to be calculated the same way as described inobtaining
theregression
coefficients.However,
sincewhere njis the number of half sib progeny
equivalent
to the ancestors’information, /!.
and w;are the k and w values calculatedusing
theheritability
of the individual’s information(hj),
thenwhere
When j does
notprovide information,
the accuracy of the estimate of hisbreeding
value(R!)
will beequal
to 1/ 4
the sum of the two accuracies of the bree-ding
values of his twoparents,
orx
. and xb
being
calculatedusing
the heritabilities of a and b information.The value of
R!
is of courseequal
tosince n j
represents
the number of half siboffspring equivalent
to the informationprovided by j’s
ancestors.Equating
these two values ofR!
weget
, . <M; . ,
M;+!,
and sinceMj
isequal
to!!,
then1-m
j ’1i!j
When an individual receives information from a
parent (say, a)
whose ances-tors are not included in the
index,
the value of(r--m a ) Ix .
will beequal
toi-hu
2and when the &dquo; a &dquo; branch of ancestors does not
provide
anyinformation, ha
isthen taken as o.o, and the value of
(z-nap) I Xa
is thenequal
to I. If aparent
doesnot
provide
any information over what isprovided by
his ancestors,i.e.,
if he passes his ancestors’ informationby
a factor of1 / 2 ,
then it could be shown that theappropriate
value of( I - Jn) /x
inEquation
10 isequal
to(r-H),
where His the sum of the values of
(i-m)
in this branch of ancestors, connectedto i (the
individual inquestion) only by paths
of!,
each value of whichmultiplied by (V 4 ) 0 ,
where g is the number ofgenerations separating
theparticular (r-m)
value from
j.
This method of course willyield
the m values for theregular
casesdiscussed above.
This method of
calculating
theregression
coefficients will cover all the relevantcases described
by
OSBORNE(ig 57 ), J ARDINE (rg 5 8),
LERov(rg58),
YouNG( 19 61),
FE W SO
N
( 19 6 4 ),
LE Roy( 19 6 4 ),
RASCH and HLRREND6RFER( 19 68)
and FLOCKand HAUSSMANN
(rg 7 o).
In fact many of the cases describedby
these authors could be solvedby
the rathersimpler
method shown above for theregular
cases.The
following
are a fewexamples
to illustrate theapplication
of this method.Examples (
I
)
Thebreeding
valueof j to
be estimated(Fig.
2,a).
h2 is taken asr j 4
andthe
repeatability p as % 2 .
Theheritability
of d records will bed / ( 2 + 2 d). Clearly
we need
only
to calculate one m value(ni j ). Applying Equation
10:where -
=h! /(i-h!)
=5!5, ( l -m b ) /x b
is setequal
to I since the b branch does3 5 5
not
provide information,
andThus the
regression
coefficients are:The accuracy of the estimate
(applying Equation 7 )
will be(z)
Thebreeding
value ofJ
to be estimated(Fig.
2,b). J
does notprovide
information. We need to calculate two m values
(m A
andm B ) (h 2
= .25 andp = .50):
Thus:
The accuracy is to be estimated
using Equation
8:( 3
)
Thisexample
is taken from LE Roy( 195 8).
h2 = .3oand P =
.40. Thebreeding
value of E to be estimated(Fig.
2,c).
Thisexample requires
thecalculation of 2 m values:
Thus:
( 4
)
The estimate of thebreeding
value of A isrequired (Fig.
2,d).
The father
(C)
has 60 half sibdaughters
and thegrandfather
D has 15. h2 is taken as .25 and therepeatability
as .50. Theheritability
to be attached toill
C or D is
!!+k,
Mbeing
the number of half siboffspring.
The meanperformance n+k
of the
daughters
has to be doubled toget
it on the same scale of their father’s bree-ding
value. Thus the information on C may be taken as if a character of the h2 of4 B- 60 i5 was
measured onhimself,
and hisperformance
was double the mean5
B 6o+i5/
’of his 60
daughters.
The informationsupplied by the 7 full
sibdaughters
of Ais
equivalent
to 5 half sibdaughters (M OSTAGEER , ig6g).
Thisprovides
an h2of
1 1 4
and aperformance
of double the mean of thedaughters. (This
method ofconverting
the full sibdaughters
into half sibdaughters
is useful when the indivi- dual has several groups of full sibdaughters.)
Thus:
Reçu pour publication en avril z9!z.
ACKNOWLEDGEMENT
The writer wishes to express his thanks to Professor Dr. Franz PIRCHNER for his comments
on the manuscript and for providing access to the electronic computer.
RÉSUMÉ
LES COEFFICIENTS DE RÉGRESSION PARTIELLE DANS LES ESTIMATIONS SUR ORIGINE
La méthode décrite par NIOSTAUFER (1970) pour estimer l’efficacité des performances des
ascendantes est utilisée pour calculer les coefficients de régression partielle correspondant aux
caractères extériorisés par les deux sexes et à ceux qui n’apparaissent que chez un sexe. Le premier
cas étudié est celui où tous les ancêtres jusqu’à la Tme génération, fournissent une information de même valeur. Lorsque les caractères ne sont extériorisés que par l’un des sexes, les coefficients de régression partielle, au nombre de 2T-1, peuvent être simplement déduits de T—1, facteur seulement.
Une solution est donnée également lorsque les informations apportées par les ascendantes ont des héritabilités différentes. Sont aussi traités les cas où l’animal étudié apporte lui-même
une information, celui où certains ascendants sont inconnus et celui où on en connaît une descen- dance de 1/2frère 1/! sceur. La solution exacte et simple fournie exige le calcul d’un petit nombre seulement de facteurs dont on peut déduire les coefficients de régression et la précision. Le nombre
de ces facteurs facilement calculables n’excède jamais la moitié du nombre des sources d’informa- tion.
Plusieurs exemples d’application sont proposés au lecteur.
REFERENCES
P L
wsorr D. P., 1964. Untersuchungen iiber die Effektivitat verschiedener Selektionsmassnahmen unter besonderer Beriicksichtigung der ziichterischen Verbesserung der Milchmenge und des Fettgehaltes beim Rind. Z. Tierz. Ziicht Biol., 79, ioi-i6i.
FLOCK D. K., HAUSSMANNH., 1970. Tabellen fur die Zuchtwertschdtzung bei landwirtschaftlichen Nutz- tieren. Ausschluss genetisch-statistischer Methoden in der Tierzucht der deutschen Gesellschaft fur Ziichtungskunde pp. 105.
JnanmrE R., r958. Animal breeding and the estimation of genetic value. Heredity, 12, ¢99-5rr·
I,
E Roy H. I,., rg58. Die Abstammungsbewertung. Z. Tier.. Ziicht Biol., 71, 328-378.
L
ERoyH. I,., 1964. Die verallgemeinerte Abstammungsbewerhmg. Z. 7’ierz. ZiichtBiol., 79, 303-309. M
OSTA G
EERA., 1969. The use of information on mates in estimating breeding values. Z. Tierz. Zucht Biol., 86, 42-51.
M O S TA G
EERA., 1970. The usefulness of pedigree information in animal selection with special reference
to dairy cattle. Z. 7’ierz. Ziicht Biol., 87, 56-61. OS
BORNER., i9g7. The use of sire and dam averages in increasing the efficiency of selective breeding in hierarchical mating systems. Heredity, 11, 93-m6.
I 2
escx D., HERRENDORFER G., x968. Statistische Untersuchungen zur Selektion auf quantitative Merk-
male. Biom. Z., 10, 173-181. Ro
BE
aTSOrr A., 1959. A simple method of pedigree evaluation in dairy cattle. Anim. 1’rod., 1, r67-r7¢.
S EARLE
S. R., 1963. The efficiency of ancestor records in animal selection. Heredity 18, 351-360.
YOUNG S. S. Y., 1961. The use of sire’s and dam’s records in animal selection. Heredity, 16, 91-102.