HAL Id: hal-00893872
https://hal.archives-ouvertes.fr/hal-00893872
Submitted on 1 Jan 1991
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Genetic evaluation of horses based on ranks in
competitions
A Tavernier
To cite this version:
Original
article
Genetic evaluation of horses based
on
ranks in
competitions
A Tavernier
Institut National de la Recherche
Agronomique,
Station deGénétique Quantitative
etAppliquée,
Centre de Recherches deJouy-en-Josas,
7835!
Jouy-en-Josas Cedex,
France(Received
14 October1988; accepted
9January
1991)
Summary - A method is presented for
analysing
horseperformance
recorded as a seriesof ranks obtained in races or competitions. The model is based on the assumption of the existence of an
underlying
normal variable. Then the rank of an animal ismerely
thephenotypic expression
of the value of thisunderlying
variable relative to that of the other horsesentering
the samecompetition.
Thebreeding
values of the animals are estimatedas the mode of the a
posteriori density
of the data in aBayesian
context. Calculation of this mode entailssolving
a non-linear systemby
iteration. Anexample involving
the results of races of2 .yr-old
French trotters in 1986 isgiven.
Practicalcomputing
methodsare
presented
and discussed.horse
/
ranking/
order statistics/
Bayesian methodsRésumé - Évaluation génétique des chevaux à partir de leurs classements en
compéti-tion. Cet article
présente
une méthoded’analyse
deperformances enregistrées
sous laforme
de classements obtenus dans desconfrontations
restreintes et variables(courses
ouconcours).
Le modèlepostule
d’existence d’une variable normalesous-jacente.
Leclasse-ment d’un cheval est alors
simplement d’expression phénotypique
de la valeur de cetteva-riable
sous-jacente
relativement à celles des autres animauxparticipant
à la mêmeépreuve.
Les valeurs
génétiques
des animaux sont estimées àpartir
du mode de la densité aposte-riori des données dans un contexte
bayésien.
Le calcul de ce mode amène ic la résolutiond’un
système
non linéaire par itérations. Unexemple
d’application
est réalisé sur lesrésul-tats des courses des chevaux Trotteurs
Français
de 2 ans en 1986. Des méthodes de calculspratiques
sontproposées
et discutées.cheval
/
classement/
statistiques d’ordre/
méthodes bayésiennes INTRODUCTIONChoosing
agood
selection criterion is one of themajor
problems
ingenetic
evaluation of horses. The
breeding
objective
is theability
to succeed inriding
competitions
(jumping,
dressage,
3-day-event)
or in races(trot
andgallop).
Buthow should success be measured?
The "career" of a horse is made up of a series of ranks obtained in races or
competitions.
A"physical"
measure ofperformance
is notalways
available. Such aThese data are not
always
collectedand, furthermore,
they
maygive
a poorindication of the real level of the
performance:
aracing
horse must be fast but itmust,
aboveall, adapt
toparticular
conditionsprevailing
in each event. This mayexplain
therelatively
lowheritability
of timeperformance
ofthoroughbreds
(Hintz,
1980;
Langlois,
1980a).
In the case ofriding horses,
it is difficult to assessthe technical level of a
jumping
event. Itdepends
notonly
on theheight
of theobstacles
but,
to agreater
extent, on the difficulties encountered whenapproaching
the obstacles and on the distance between obstacles. None of these variables can be
easily
quantified.
Therefore,
informationprovided by
theranking
of horses in each event deserves attention.Ranking
allows horsesentering
the same event to becompared
to the others.However,
the level of the event has to be determined too. The mostfrequently
used criterion related toranking
is transformedearnings.
Each horse that is&dquo;placed&dquo;
in an event,ie,
ranked among the first ones, receives a certainamount of money.
Prize-money
in a race is allocated in anexponential
way: forinstance,
the second horse earns half the amountgiven
to thefirst,
the third half of thatgiven
to the second and so on... If the rate of decrease is not50%,
it oftenequals
a fixedpercentage,
for instance75%
in horse shows. Theearnings
of a horse in a race can then beexpressed
as G =ax( k-
l
) D
with abeing
theproportion
ofthe total endowment
given
to the winner(constant),
xbeing
the rate of decrease ofearning
with rank(constant),
k the rank of the horse in the race and D the totalendowment of the race. The constants a and x must
satisfy
(axK-1-!+(1-a)
=0)
with K the total number of horses
&dquo;placed&dquo;. So,
alogarithmic
transformationgives
Log(G)
=Log(a)
+Log(D)
+(k - 1) Log(x).
This is a linear function of the rankof the horse. To use it as a function of the
ability
of thehorse,
Log(D)
should beassumed to be a linear function of the level of the race. The total amount of money
given
in a race or acompetition
shoulddepend
on the technicaldifficulty
or thelevel of the
competitors. Hence,
withadequate competition
programmes(Langlois,
1983),
the
logarithm
ofearnings
of a horse may be agood
scale formeasuring
horse
performance
and it has beenwidely
used(Langlois,
1980b, 1989;
Meinardus andBruns, 1987; Tavernier, 1988, 1989;
Arnason etal, 1989; Klemetsdal, 1989;
Minkema,
1989).
However,
this criterionstrongly
depends
on the way money is distributed. The choice of the amount of moneygiven
injumping
competitions
does not follow strict technical rules in France and does not
directly depend
onthe scale of technical difficulties but on the choice of the
organizing
committee.Therefore,
it appears that ranks should be taken into account without reference toearnings.
The purpose of this article is to
present
a method forestimating
thebreeding
value of an animal
using
a series of ranks obtained in events where itcompeted
against
asample
of thepopulation.
In order tointerpret
thesedata,
the notion ofunderlying
variable will be used as in Gianola andFoulley
(1983)
for estimationof
breeding
value withcategorical data,
and inHenery
(1981)
forconstructing
the likelihood of outcomes of a race. The horse’s &dquo;real&dquo;performance,
which cannot bemeasured,
is viewed as a normalvariable;
this is a reasonableassumption
for traitswith
polygenic
determination.Only
the location orranking
of thisperformance
is recorded instead of a
performance.
Practicalcomputational
aspects
as well as anapplication
to trotters arepresented.
METHOD
Data
The data
(Y)
consist of the ranks of all the animals in all the events. The total number of observations is thereforeequal
to the sum of the number of animalsper event. It is assumed that the ranks are related to an
underlying
unobservedcontinuous variable. The rank
depends
on the realized value of thisunderlying
unobserved variable
(&dquo;real&dquo;
animalperformance)
relative to that of the other animalsentering
the same event. Thegenetic
model is the same as for usual traitswith
polygenic
determinism. Theunderlying performance
yjk follows a normaldistribution with residual standard deviation (Fe and
expected
value !,2!. The model is:where:
- y2!! _ &dquo;real&dquo;
performance
ofhorse j under
environmental conditions i in the kth race ofj;
-
b
i
= environmental effecti (eg
age, sex,rider...);
- u
j = additive
breeding
value of horsej;
-p
j = environmental effect common to the different
performances
of horsej,
asit may
participate
in severalevents;
- eij
k = residual effect in kth race.
The vector of
parameters
to be estimated is 0 =(b’,
u’,
p’)
where b ={b
i
},
u =
(uj )
and
p =
{
Pj
}.
Inference is based onBayes
theorem. Since themarginal
density
of Y does not vary with 0:where
pee)
is theprior
density
of0,
g(Y/6)
is the likelihood function andf (9/Y)
is theposterior
density
of theparameters.
Prior
density
The vectors
b,
u, p and e are assumed to bemutually independent
and to followthe normal distributions:
N(13,
V), N(O, G),
N(O, H), N(O, R),
respectively.
Prior information about b is assumed to be vague, whichimplies
that thediagonals
of V tend to +00.Then,
theprior
density
of b is uniform and theposterior
density
of e does notdepend
on!3 !
G =Ao,’
where
A is therelationship
matrix and0
-; is
the additive
genetic
variance. H is adiagonal
matrix withdiagonal
elementsequal
to the variance of
p
(u p 2).
The variances0
-;
and
a
P2
are assumed to beknown,
0
-; is
Likelihood function
Given ai, the
performances
y2!! areconditionally independent.
Let y(l),!(2),...,
Yen)
be the orderedunderlying performances
of the n horses whichcompeted
in an event(for
notation,
see forexample
David, 1981,
p4).
Then,
the likelihoodof
obtaining
the observedranking
in that event can be written as(Henery,
1981;
Dansie,
1986):
where:
-
yis the standard normal
density.
-
J1(t)
is the locationparameter
of the horse ranked &dquo;t&dquo; in that event.This
probability
can beinterpreted
in thefollowing
way: theperformance
of thelast animal may
vary
between -oo and +00, theperformance
of the next to last varies from that of the last to +oo and so on.Thus,
theperformance
of a horse variesfrom that of the horse ranked
just
behind it to +00, henceleading
to the bounds of eachintegral
inP
k
.
Eachintegration
variable(t)
follows a normal distributionwith mean
J1(
t
)
and standard deviation ue = 1.Given 1L
(
t
),
these distributions areindependent
for all animals in the samecompetition.
This
probability
may beexpressed
in terms of a multivariate normalintegral
with thresholdsindependent
ofintegration
variables(Godwin,
1949; David,
1981):
where the distribution of
(xl, ... , !t, ... , !n-1 )
is normal with mean(
/1
(
1
)
-!(2!, ... , ,!(t) - /1(t+1) , ... ,/1(n-1) -
/t(
n
))
and variance V ={v
ml
}
with
Vmm =
2,
Vm,m-1 = v m , m+1= -1 and all other Vml = 0. Then:
Results of races are
likely
to be correlated.However,
if the model isappropriate,
this correlation woulddepend only
ongenetic
or environmental effects iegiven
theJ
.
L
ij’
S
,
the races areindependent.
The likelihood function isequal
tothe
product
of theprobabilities
of each event:Estimation of
parameters
The
posterior
density
of theparameters
is:The best selection criterion is known to be the mean of the
posterior
distribution(Fernando
andGianola,
1984;
Gof&net andElsen,
1984).
Asexpressing
itanalyti-cally
is notpossible
for the model usedhere,
we will take as estimator of 0 themode of the
posterior distribution,
which can be viewed as anapproximation
to theoptimum
selection criterion.Finding
this mode iscomputationaly equivalent
to the maximisation of ajoint probability
massdensity
function as calculatedby
Harville and Mee(1984)
forcategorical
data(Foulley,
1987).
It is more convenient to usethe
logarithm
of theposterior density:
/C=1
where m is the number of events.
The
system
which satisfies the first-order condition is not linear and mustbe solved
iteratively,
forexample using
aNewton-Raphson
type
algorithm.
Thisalgorithm
iterates with:where
9
is the solution for 0 at theqth
round of iteration andAM
= 9!q!-e!q 1!.
Iterations arestopped
when a convergencecriterion,
a function of0,
is less than anarbitrarily
small number.The first and second derivatives of
L(O)
withrespect
tob,
u, p arereported
inAppendix
1.The
system
can be written in thefollowing
way:m
where
A, B, C,
D are sub-matrices of minus the second derivatives ofL
Log(P
k
)
k=l
m
with
respect
to 0 and w, z are the vectors of first derivatives ofE
Log(P,!)
withk=l
The numerical solution of
system
(I)
raises theproblem
of the calculation of thecorresponding integrals.
Multivariate normalintegrals
may be calculated with numerical methods such as’that of Dutt(1973),
described andprogrammed by
Ducrocq
and Colleau(1986).
A second method consists ofusing
aTaylor’s
seriesexpansion
about zero which seems togive good
results(Henery,
1981; Dansie,
1986; Pettitt,
1982).
Thisrequires
that animalsparticipating
in agiven
event haverelatively
close means Itij, which is a reasonableassumption
in thepresent
contextof horse
competitions.
Thisexpansion
involves moments of normal orderstatistics,
as
explained
inAppendix
2.Example
In order to illustrate these
computations,
asimple
example
was constructed. Thisexample
involves 5 unrelated horses. There are no fixedeffects,
hence a
=(u
+p)
is estimated. The variance-covariance matrix
of p
isdiagonal
with each termbeing
9/11.
Two races with 4 runners are considered. The first gave thefollowing ranking:
No1,
No2,
No3,
No 4 and the second: No3,
No2,
No5,
No 4. Thestarting
value for allA
’s
was 0. Thesystem
to be solved at the first iteration of theNewton-Raphson
algorithms
as well as thecorresponding
solution are thefollowing:
The
algorithm converged
at the 5th iteration:(A’ A )°.
5
= 6 x10-
17
.
Thecorrespon-ding
values as well as the solutions and the coefficient of determination(CD)
withCD
=(1 —
ciilo, u 2)
where c2i is thediagonal
element of the inverse of the matrix ofsecond derivatives of the
logarithm
ofposterior
density
are:---solution:
[
Al
p2 P3 !4P5]
=
[0.621
0.237 0.271 - 0.902 -0.226]
accuracy:[0.242
0.434 0.404 0.3480.293]
It should be
noted that
the value of the first derivative for a horse in agiven
race isequal
to theexpectation
of the normal order statistic(normal score)
corresponding
to its rank.
Similarly,
second derivatives for agiven
race are functions of the varianceof,
and covariancesbetween,
normal order statistics. This is thelogical
consequence of the choice of 0 for JL asstarting
value: all distributions ofperformances
are the same with a mean of 0 and allintegrals correspond
toexpectations
of normal order statistics. The accumulated values for all races are the sum of these.At convergence, these values have
changed
and the final solution differs from the estimates obtained from theexpectation
of normal order statistics. Theinterpreta-tion of a rank
depends
notonly
on the number ofcompetitors,
which is taken intoaccount
through
the normal orderstatistics,
but also on the level of thecompeti-tion.
At
convergence, the first derivative of thelog
of aposteriori
density
is set to0.
So,
estimates of horses areequal
to the first derivatives of thelog
of likelihood function dividedby
the variance term. These derivatives are different for the samerank in different races.
They depend
on the level of the race estimated aposteriori
by
the estimates of the horsesparticipating
thisparticular
race,taking
into accountall races. In the
example,
for the winners of the 2 races, the first derivatives of the likelihood function were much lower than theexpected
values of order statistics.This is because the
competitors
of these races have much lower estimates than thewinners:
0.237, 0.271,
-0.902 for horses No2,
No 3 and No 4against
0.621 for horse No 1 winner of the first race and0.237, 0.226,
-0.902 for horses No2,
No 5 and No 4against
0.271 for horse No 3 winner of the second race.Therefore,
the first racefor No 1 and the second race for No 3 was easier than if
they
hadcompeted against
3 horses of
equal ability
tothemselves,
ie with the same ui, asimplied
with the normal order statistics. The values of the first derivatives were 0.7589 and0.8475,
respectively,
compared
to 1.0294 for theexpectation
of the normal order statistics of the first out of 4. In the same way, in the first race, horse No 3(0.27)
was beatenby
a horse of lesserability
(No
2(0.24)),
and,
therefore was morepenalized
than ifit had been defeated
by
a horse ofequal
ability.
The first derivative was-0.5165,
compared
to -0.2970 for theexpectation
of the normal order statistics of the thirdout of 4.
APPLICATION Data
horses in these races were recorded in the file. Ten races
(38
horses)
were discardedbecause
they
involvedonly
horses that did notcompete
more than once, andwhich,
therefore,
weretotally
disconnected from the rest of the file. We had to limit theanalysis
to&dquo;placed&dquo;
horses in each race,ie,
horses ranked among the best 4 or5,
because the
ranking
of otherparticipants
were not available. This does notprevent
us from
testing
andcomparing
our method to usualearning
criteriaassuming
that these races involvedonly
4 or 5 horses.Indeed,
this is neccessary for a faircomparison
sinceearnings
also involveonly &dquo;placed&dquo;
horses. With ourapproach,
&dquo;non
placed&dquo;
horsescould,
of course, be treated as the othersprovided
thatthey
are filed.The data set was made up of 251 races
(211
with 4 horses ranked and 40 with5 horses
ranked),
involving
490 different horses. The total number ofperformances
was 1044
places,
ie 2.1 per horse on average, with a maximum of 9 and a minimumof
1. A
horsecompeted
against
3.3 horses on average. The model used was:where: -
y!! _ &dquo;real&dquo;
performance
ofhorse j
in the kth race ofj;
- u j
= additive
breeding
value of horsej;
-p! = environmental effect common to the different
performances
of horsej;
-e jk
= residual effect in kth race about
&dquo;expected&dquo; performance
lLj.No fixed effect was considered because
particular
conditions of each race(dis-tance, type
ofground, season...)
are the same for all horses in the race and so have no effect on the result and because trainer and driver effects cannot be used on asmall data set
(only
one horse for themajority
of trainers ordrivers).
The
expectations
and variance-covariance matrices are:where h
2
=0
,2/
U2
is theheritability
and r =(
U2
+a;)/a;
is therepeatability
of the trait. Values ofh
2
= 0.25 and r = 0.45 were chosen asthey correspond
to usualestimates of these
parameters
obtained fromcompetitions.
RESULTS
The elements of
system
(I)
were recalculated at eachNewton-Raphson
iteration with Dutt’s!1973)
method forintegrals. Convergence
was reached after 5 iterations(with
(ð.’ ð.) .
5
/490
= 2 x10-
15
).
The accuracies of these solutions were measuredby
coefficient of determination(CD).
If cii is adiagonal
element of the matrix of secondderivatives,
CD =(1 -
c
ii/
o
u
).
Breeding
value estimates had a mean of0,
a standard deviation of0.30,
with amaximum of 0.94 and a minimum of -0.82. The mean accuracy was
0.23,
with astandard deviation of
0.08,
a maximum of 0.43 and a minimum of 0.12.These values were
compared
to criteriausually employed
in trotters(Thery,
0.73 with
Log(yearly earning),
0.88 withLog(yearly
earning
per&dquo;place&dquo;),
0.79with
Log(yearly
earning
perstart).
The correlation with a selection indexusing
asperformance
the mean of thelogarithm
ofearnings
in each race(with
parameter
values
h
2
= 0.25 and r =0.45)
was 0.94. Correlations with criteria related toracing
time werelower,
as were correlations betweenearnings
andracing
time. The correlation was -0.43 between our estimate and the best time per kilometer and - 0.47 between our criterion and a selection indexusing
asperformance
the averageracing
time(with
parameter
valuesh
2
= 0.25 and r =0.45).
Thesefigures
alsosuggest
that the bestracing
time is not agood
measure of success in a race for2-yr-old
horses.This
application
suggests
somepeculiarities
of our method. The first one relatesto the
spread
of accuracy values. Thesedepend
notonly
on the number of&dquo;places&dquo;
but also on the
&dquo;place&dquo;
of the horse in the race. Accuraciesranged
from 0.25 to 0.33 and from 0.20 to 0.28 for horseshaving
3 and 2&dquo;places&dquo;, respectively.
The minimalaccuracy
corresponding
to asingle
&dquo;place&dquo;
(0.12)
was smaller than theheritability
(0.25).
This is the result of the loss of information because ranks are used instead of continuousperformances.
The average &dquo;loss&dquo; of accuracyranged
from 0.10points
for horses ranked once to 0.05 for those ranked more than 7 times.The second
point
of interest is the relativeimportance
of the number of horsesper event and the level of the horses
participating
in the event. At convergence, the first derivative of thelogarithm
ofposterior density
isequal
to0,
so estimatesare
equal
to thepart
of the first derivative without variance terms dividedby
these variance terms(see
Appendix
I).
When all horsesparticipating
in an event are ofthe same level
(ie,
have the same realracing
ability)
this derivative isequal
toexpectations
of normal statistics. Theseexpectations
depend
only
on the number of animals per event. In our method the first derivative alsodepends
on the realracing
abilities of thecompetitors.
So the same rank in different events does notgive
the same derivative.Figure
1 shows the distribution of the derivatives in allthe races with 5 horses
&dquo;placed&dquo;
for the different ranks. For agiven rank,
thesederivatives are different in each race and so,
being
first in a race sometimesgives
alower estimate than
being
second in a race of ahigher
level.Our method can be used as a tool to
improve
thecorrespondence
between the level of the race and theprize
money to be distributed. The averagecompetitive
&dquo;level&dquo; of the race can be
approximated
as the mean of the estimates of realproducing ability
(
Jij
)
of each horse. Inpractice,
the correlation between such a measure and thelogarithm
of total endowment of the race was 0.30 for races with 4horses
&dquo;place&dquo;,
and 0.65 for races with 5&dquo;placed&dquo;.
Races with 5 horses&dquo;placed&dquo;
havethe
greatest
prize-money,
and endowment seemed to be agood
indicator of the valueof
participating
horses. It is alsopossible
to calculate aposteriori
theprobabilities
of
obtaining
the observedranking
in each race - or even of fictitious races -using
the estimates for each horse. These
probabilities
weredirectly
calculated from theformula for
P
k
and do not take into account the accuracy of the estimates. The averageprobability
ofobtaining
the observed ranks was11%
and3%
in races with4 and 5
horses,
respectively.
If all horses had the same realproducing
ability,
thisprobability
would be4%
in races with 4 horses(24 possibilities)
and0.8%
in racesDISCUSSION
In the
light
of the results obtained with2-yr-old
trotters,
theproposed
method seemedsatisfactory:
the estimated values are consistent with other criteria.In
practice,
solving
a muchlarger
system
ofequations
presents
difficulties. Twonumerical
problems
arise,
namely
the calculation of theintegrals
P!
and theirderivatives
and the dimensions of the wholesystem.
Two methods forcomputing
the necessaryintegrals
have beensuggested,
the firstbeing
a numerical calculationof multivariate normal
integrals
and the second anapproximation
by Taylor’s
series.Beyond
certaindimensions,
it takes a verylong
time tocompute
multiple integrals
of the normal distribution. For each iteration of
Newton-Raphson
and for each raceof n
horses,
it is necessary to calculate oneintegral
of order(n - 1),
nintegrals
of order
(n - 2)
and[n(n
+1)/2!
integrals
of order(n -
3).
Therefore,
the time needed toaccomplish
this becomesprohibitive
for a number of horses per race> 5 or 6. On the other
hand,
our purpose is to be able toapply
thistechnique
to all
types
of horsecompetitions
(for
example
showjumping)
that sometimes involve more than 100participants. Then,
it is necessary to turn toapproximations
like those
proposed by Henery
(1981)
using
Taylor’s
series. The accuracy of theseapproximations
is difficult to test. Inparticular, approximate
formulae for themoments of order statistics
superior
to 2(Pearson
andHartley,
1972;
David andJohnson,
1954)
need to be tested andcompared
tointegral
calculations ofhigh
order.Such
anapproximation
reduces calculation timesconsiderably.
The momentsof order statistics not
given
in tables can be calculated once and for all.Then,
eachderivative
only
consists of a linear combination of theproducing
abilities of theThe overall dimension of the
system
constitutes a secondproblem. Using
ananimal
model withrepeated records,
this dimension isequal
to the number of horses to be evaluatedplus
the number ofperforming
horses and fixed effects. At thepresent time,
inFrance,
100 000 horses are evaluated injumping
with ananimal model
(BLUP
method)
based onyearly earnings
and 70 000 are evaluatedin
trotting-races
(Tavernier,
198(,)b,
1990).
For eachNewton-Raphson
iteration of theproposed
method,
an iterative solution such as Gauss-Seidel will be needed.This method has been
developed
to include all horses in every raceincluding
&dquo;non-placed&dquo;
horses.However,
they
will have to be treated in aslightly
different manner: the purpose is to consider the horses&dquo;placed&dquo;
as better than the&dquo;non-placed&dquo;,
but detailedranking
of&dquo;non-placed&dquo;
horses is of little interest. Thecompetitor
which nolonger
has a chance offinishing &dquo;placed&dquo;
is notgoing
totry
toimprove
its rankand, therefore,
its rank relative to the other&dquo;non-placed&dquo;
horses does notaccurately
reflect its realability.
Therefore the&dquo;non-placed&dquo;
should be treated ashaving
aperformance
below that of the last&dquo;placed&dquo;. Then,
the likelihoodof the outcome of a race can be written as:
where there are n’ horses in the race and n horses
&dquo;placed&dquo;.
Thisintegral
can beused in this form or
equivalently
as the sum of all theintegrals
over allpossible
rank combinations between&dquo;non-placed&dquo;
horses,
which allows asimplified
application
ofthe calculation
by Taylor’s
approximation.
Another
difficulty
is the estimation of thegenetic
parameters.
The estimation of variance components couldprobably
be madeusing
amarginal
maximum likelihoodapproach
whichrequires
the inversion of the matrix of secondderivatives,
asdiscussed
by
Gianola et al(1986)
andapplied
by
Foulley
et al(1987a, b).
Inpractice,
this method can beapplied only
on a reduced data file or with a &dquo;sire&dquo; model.The
heritability
of asingle
performance
is lower than that ofyearly
earning
criteria.
Yearly
criteria arecompound
functions of the number of events and of success in each event. Forinstance,
forsingle performance,
Meinardus and Bruns(1987)
reported h
2
= 0.18 and r = 0.48 for thelogarithm
ofearnings
injumping
shows,
Klemetsdal(1989)
reported
h
2
= 0.18 and r = 0.65 for time introtting-races,
Thery
(1981)
foundh
2
= 0.23 and r = 0.52 for the same criterion andh
z
= 0.07and r = 0.13 for the
logarithm
ofearnings. However,
the number ofelementary
performances
during
the lifetime of a horse is sufficient toexpect
good
accuracies of estimations.Taking
theprevious
examples
and a number ofyearly
startsequal
to 12(the
average number ofyearly
starts for an adult horse is 12 introtting-races
and 14 in
riding competitions),
the accuracies ofbreeding
value estimationranged
from 0.27 to 0.41 after one year ofperformance.
With a loss of 0.10point
due tothe use of
ranks,
accuracies of evaluations based on ranks would range from 0.17to
0.31,
which is reasonable.This model
requires
asufficiently large
amount ofcomparisons
between horsesto allow a proper classification. The presence of isolated events which do not
the
necessity
ofgood
connections between races, which is theonly
guarantee
of areliable result.
CONCLUSION
This article describes a method of evaluation of the
breeding
value of an animalfrom its rank relative to those of other
competitors
in agiven
event, withoutusing
adirect measure of
performance.
It isinteresting
that the methodsuggests
a solution based on a conventionalgenetic
model. It can beapplied
to an &dquo;individual animal&dquo;model as well as to a &dquo;sire&dquo; model. It takes into account the level of the
competition
which is the main factorinfluencing
a rank’svalue,
together
with the number ofparticipants
in the event.Although
use of ranks may seem to lead to a loss of informationcompared
to aphysical
measure, it is sometimes more reliable. In the case of horse races,ranking
isabsolutely
necessary as a realphysical
measure isnot identifiable. It may also be useful in the case of a distorted scale of measure or
when the usual
physical
measure isnothing
but thetranscription
of a rank.REFERENCES
Arnason
T,
BendrothM, Phillipsson J,
HenrikssonK,
Darenius A(1989)
Genetic evaluation of Swedish trotters. In: Stateof Breeding
Evaluation in Trotters. EAAP Publ No42, Pudoc, Wageningen,
106-130Dansie BR
(1986)
Normal order statistics aspermutation
probability models. Appl
Statist
35,
269-275David
FN,
Johnson NL(1954)
Statistical treatment of censored data. I.Fundamen-tal formulae. Biometrika
41,
228-240David HA
(1981)
Order Statistics.Wiley,
NY,
2ndedn,
p 360Ducrocq V,
Colleau JJ(1986)
Interest inquantitative
genetics
of Dutt’s and Deak’s methods for numericalcomputation
of multivariate normalprobability
integrals.
Genet Sel
Evol 18,
447-474Dutt JE
(1973)
Arepresentation
of multivariateprobability integrals by integral
transforms. Biometrika60,
637-645Fernando
RL,
Gianola D(1984)
Optimal
properties
of the conditional mean as aselection criterion. J Anim Sci 59
(suppl),
177(abstr)
Foulley
JL(1987)
Methodesd’Evaluation
desReproducteurs
pour des Caractères Discrets a DeterminismePodygenique
en Selection Animale. Thesed’6tat,
Univer-site de
Paris-Sud,
Centred’Orsay,
pp 320Foulley
JL,
GianolaD,
Planchenault D(1987a)
Sire evaluation with uncertainpaternity.
Genet SelEvod 19,
83-102Foulley
JL,
ImS,
GianolaD,
Hoschele I(1987b)
Empirical
Bayes
estimation ofparameters
for npolygenic
binary
traits. Genet SelEvod 19,
197-204Gianola
D,
Foulley
JL(1983)
Sire evaluation for orderedcategorical
data with athreshold model. Genet Sel
Evol 15,
201-224Godwin HJ
(1949)
Some low moments of order statistics. Ann Math Statist20,
279-285Goffinet
B,
Elsen JM(1984)
Critbreoptimal
de selection:quelques
r6sultatsg6n6raux.
G6n6t SelEvol 16,
307-318Harville
DA,
Mee RW(1984)
A mixed modelprocedure
foranalyzing
orderedcategorical
data. Biometrics40,
393-408Henery
RJ(1981)
Permutationprobabilities
as models for horse races. JR StatistSoc
43,
86-91Hintz RL
(1980)
Genetics ofperformance
in the horse. J Anirrc Sci51,
582-594 Klemetsdal G(1989)
Norwegian
trotterbreeding
and estimation ofbreeding
values. In: Stateof Breeding
Evaluation in Trotters. EAAP Publication No42, Pudoc,
Wageningen,
95-105Langlois
B(1980a)
Heritability
ofracing ability
inthoroughbreds.
A review. Livest Prod Sci7,
591-605Langlois
B(1980b)
Estimation de la valeurg6n6tique
des chevaux desport
d’apr6s
les sommesgagn6es
dans lescompetitions 6questres frangaises.
Ann Ggn!t Sel Anim12,
15-31Langlois
B(1983)
Quelques
reflexions ausujet
de l’utilisation desgains
pourappr6cier
lesperformances
des chevaux trotteurs. 34th Ann MeetEAAP, Madrid,
Spain,
October3-6, 1983,
Study
Commission on Horse ProductionLanglois
B(1984)
H6ritabilit6 et correlationsg6n6tiques
destemps
records et desgains
établis par les TrotteursFran!ais
de 2 a 6 ans. 35th Ann MeetEAAP,
TheHague,
TheNetherlands,
August
6-9, 1984,
Study
Commission on Horse ProductionLanglois
B(1989)
Breeding
evaluation of French trottersaccording
to their raceearnings.
I. Present situation. In: Stateof
Breeding
Evaluation in Trotters. EAAP. Publication No42, Pudoc,
Wageningen,
27-40Meinardus
H,
Bruns E(1987)
BLUPprocedure
inriding
horses based oncompe-tition results. 38th Ann Meet
EAAP, Lisbon,
Portugal, September
28-October1,
1987,
Study
Commission on Horse ProductionMinkema D
(1989)
Breeding
value estimation of trotters in the Netherlands. In: Stateof Breeding
Evaluation in Trotters. EAAP Publication No42,
Pudoc,
Wageningen,
82-94.Pearson
ES,
Hartley
HO(1972)
Biometrika Tablesfor
Statisticians.Cambridge
University
Press,
vol2,
27-35Pettitt AN
(1982)
Inference for the linear modelusing
a likelihood based on ranks.JR Statist Soc
44,
234-243Tavernier A
(1988) Advantages
of BLUP animal model forbreeding
value estima-tion in horses. Livest Prod Sci20,
149-160Tavernier A
(1989)
Breeding
evaluation of French trottersaccording
to theirrace
earnings.
II.Prospects.
In: Stateof Breeding
Evaluation in Trotters. EAAP Publication No42,
Pudoc, Wageningen,
41-54Tavernier A
(1989b)
Caract6risation de lapopulation
TrotteurFrangais d’apr6s
leur estimationg6n6tique
par un BLUP modèle animal. Ann Zootech38,
145-155 Tavernier A(1990)
Caract6risation des chevaux de concourshippique
fran!ais
d’apr6s
leur estimationg6n6tique
par un BLUP modèle animal. Ann Zootech39,
Thery
C(1981)
Analyse g6n6tique
etstatistique
desperformances
des TrotteursFran!ais
en courses en 1979 et 1980. M6moire deDipl6me
d’EtudesApprofondies
deG6n6tique Quantitative
etAppliqu6e,
Universite ParisXI,
pp 80APPENDIX 1
Calculation of the first and second derivatives of the
logarithm
of the aposteriori density
Let
y(
l
), Y(2),...,Y(n)
be the orderedunderlying performances
of the n horseswhich have
participated
in race k(see,
forexample, David, 1981,
p4).
Further,
letQ(t),k, R(t)(t),k, R(t)(z),k
be:We have:
where: -
(k, (t))
E j
indicates the set of events k in which theanimal j competed
and obtained the rankt;
-
where: -
(t)
E i indicates the horses ranked at theplace
t in the event k and with associated fixed effect iand,
if thehorses j and
haveparticipated
in the same event:I I _ --
!-The second derivatives with
respect
to u and p, or p and p are built in the sameway. The
only
value thatchanges
is the covariance which isequal
to 0 between up and is
equal
to1/(J!
on thediagonal
of the second derivatives withrespect
to pand p. p
where: -
!i(t)
= 0 if fixed effect i does not influence the horse ranked t-
!i(t) =
1 if fixed effect i influences the horse ranked tAPPENDIX 2
Approximation
of first and second derivatives oflog
of the aposteriori
density
using
Taylor’s
seriesexpansion
These
expansions
are drawn from those usedby
Henery
(1981)
and Dansie(1986)
whoapproximate
theprobability
P
k
.
Anexample
of thesedecompositions
isgiven
for
(Q!t!,K/P!;):
where,
for nindependent
normal distributions:-
e t:n
:
expectation
of the tth order statistic -o-tt:
, : variance of the tth order statistic -
<!tp:n : covariance between the tth order statistic and the
pth
order statistic -Pt p z:n