HAL Id: hal-00894134
https://hal.archives-ouvertes.fr/hal-00894134
Submitted on 1 Jan 1996
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
Statistical analysis of ordered categorical data via a
structural heteroskedastic threshold model
Jl Foulley, D Gianola
To cite this version:
Original
article
Statistical
analysis
of
ordered
categorical
data via
a
structural heteroskedastic
threshold model
JL
Foulley
D
Gianola
2
1
Station de
génétique
quantitative etappliquée,
Institut national de la rechercheagronomique,
centre de recherches deJouy,
78352Jouy-en-Josas cedex, France;
2
Department of
Meat and AnimalScience, University of Wisconsin-Madison,
Madison, WI 53706,
USA(Received
2 October1995; accepted
25 March1996)
Summary - In the standard threshold
model,
differences among statisticalsubpopulations
in the distribution of ordered
polychotomous
responses are modeled via differences inlocation parameters of an
underlying
normal scale. A new model isproposed whereby
subpopulations
can also differ indispersion
(scaling)
parameters.Heterogeneity
in such parameters is describedusing
a structural linear model and aloglink
functioninvolving
continuous or discrete covariates. Inference
(estimation,
testing procedures, goodness
offit)
about parameters in fixed-effects models is based on likelihoodprocedures. Bayesian
techniques
are also described to deal with mixed-effects model structures. Anapplication
tocalving
ease scores in the US Simmental breed ispresented;
the heteroskedastic threshold model had a bettergoodness
of fit than the standard one.threshold character
/
heteroskedasticity/
maximumlikelihood/
mixed linear model/
calving difficulty
Résumé - Analyse statistique de variables discrètes ordonnées par un modèle à seuils hétéroscédastique. Dans le modèle à seuils
classique,
lesdifférences
deréponses
entre
sous-populations
selon descatégories
discrètes ordonnées sont modélisées par desdifférences
entreparamètres
de position mesurés sur une variable normalesous-jacente. L’approche présentée
ici suppose que cessous-populations diffèrent
aussi par leursparamètres
dedispersion
(ou
paramètres
d’échelle).
L’hétérogénéité
de cesparamètres
estdécrite par un modèle linéaire structurel et une
fonction
de lienlogarithmique impliquant
des covariables discrètes ou continues.L’inférence
(estimation,
qualité d’ajustement,
testd’hypothèse)
sur lesparamètres
dans les modèles àeffets fixes
est basée sur les méthodes dumaximum de vraisemblance. Des
techniques bayésiennes
sontégalement proposées
pour letraitement des modèles linéaires mixtes. Une
application
aux notes dedifficultés
devêlage
en race Simmental américaine estprésentée.
Le modèle à seuilshétéroscédastiqué
améliore dans ce cas laqualité
del’ajustement
des données par rapport au modèle standard.caractères à seuils
/
hétéroscédasticité/
maximum de vraisemblance/
modèle linéaireINTRODUCTION
An
appealing
model for theanalysis
of orderedcategorical
data is the so-calledthreshold model.
Although
introduced inpopulation
andquantitative
genetics
by Wright
(1934a,b)
and discussed laterby
Dempster
and Lerner(1950)
and Robertson(1950),
it dates back to Pearson(1900),
Galton(1889)
and Fechner(1860).
This model has received attention in various areas such as humangenetics
and
susceptibility
to disease(Falconer,
1965;
Curnow andSmith, 1975), population
biology
(Bulmer
andBull, 1982; Roff,
1994), neurophysiology
(Brillinger, 1985),
animalbreeding
(Gianola, 1982),
surveyanalysis
(Grosbas,
1987),
psychological
and social sciences
(Edwards
andThurstone, 1952;
McKelvey
andZavoina,
1975),
and econometrics(Kaplan
andUrwitz, 1979;
Levy,
1980;
Bryant
andGerner, 1982;
Maddala,
1983).
The threshold model
postulates
anunderlying
(liability)
normaldistribution
rendered discrete via threshold values. The
probability
of response in agiven
category
can beexpressed
as the difference between normal cumulative distributionfunctions
having
asarguments
the upper and lower thresholds minus the meanliability
forsubpopulation
dividedby
thecorresponding
standard deviation.Usually
the locationparameter
(
?7i
)
for asubpopulation
isexpressed
as a linearfunction
77i
=t’O
of someexplanatory
variables(row
incidence vectorti)
(see
theory
ofgeneralized
linearmodels,
McCullagh
andNelder, 1989;
Fahrmeir andTutz, 1994).
The vector of unknowns(e)
may include both fixed and randomeffects and statistical
procedures
have beendeveloped
to make inferences aboutsuch a mixed-model structure
(Gianola
andFoulley,
1983;
Harville andMee, 1984;
Gilmour etal,
1987).
In all thesestudies,
the standard deviations(also
called thescaling
parameters)
are assumed to be known andequal,
orproportional
to knownquantities
(Foulley,
1987;
Misztal etal,
1988).
The purpose of this paper is to extend the standard threshold model
(S-TM)
to a model
allowing
forheteroskedasticity
(H-TM)
withmodeling
of the unknownscaling
parameters.
Forsimplicity,
thetheory
will bepresented using
a fixed-effectsmodel and likelihood
procedures
for inference. Mixed-model extensions based onBayesian
techniques
will also be outlined. Thetheory
will be illustrated with anexample
oncalving difficulty
scores in Simmental cattle from the USA.THEORY
Statistical model
The overall
population
is assumed to be stratified into severalsubpopulations (eg,
subclasses of sex,parity,
age,genotypes, etc)
indexedby
i =1, 2, ... ,
Irepresenting
potential
sources of variation. Let J be the number of ordered responsecategories
indexed
by
j,
and yi+ ={
Yij+
}
be the(J
x1)
vector whose element y2!+ is thetotal number of responses in
category j
for subpopulation
i. The vector y2+ can beand
(3)
I - 1 contrasts amonglog-scaling
parameters
(eg,
ln(Q
i
) -
ln(<7i)
fori =
2, ... , I)
or,equivalently,
I - 1 standard deviation ratios(eg, O
&dquo;d
O
&dquo;d,
withone of these
arbitrarily
set to a fixed value(eg,
<7i =1).
This makes a total of2I + J - 3 identifiable
parameters,
so that the full H-TM reduces to the saturatedmodel for J = 3
categories,
seeexamples
in Falconer(1960),
chapter
18.More
parsimonious
models can also be envisioned. Forinstance,
in a two-way crossclassifiedlayout
with A rows and B columns(I
=AB),
there are 16additive models that can be used to describe the location
(
?7i
)
and thescaling (o
j
)
parameters.
Thesimplest
one would have a common mean and standard deviationfor the I = AB
populations.
The mostcomplete
one would include the main effectsof A and B factors for both the location and
dispersion
parameters.
Here there are2(A+B)+J-5
estimableparameters,
ie,
J-1 thresholdsplus
twice(A-1)+(B-1).
Under an additive model for the locationparameters
71i, it ispossible
to fit the H-TM tobinary
data. For the crossclassifiedlayout
with A rows and Bcolumns,
there are AB -
2(A
+B)
+ 3degrees
of freedom available which means that weneed A
(or B) !
4 to fit an additive modelusing
all the levels of A and B at both the location anddispersion
levels.Finally,
it must be noted that in thisparticular
case,dispersion
parameters
act as substitutes of interaction effects for locationparameters.
Estimation
Let T =
{7!}
for j
=1, 2, ... ,
J - 1 and a =(
T
’, (3’,
b’)’.
In fixed-effects modelswith multinomial
data,
inferences about a can be based on likelihoodprocedures.
Here,
thelog
likelihoodL(a; y)
can beexpressed,
apart
from an additiveconstant,
as: T T
with, given
!4!,
[5]
and!6!,
The maximum likelihood
(ML)
estimator of a can becomputed using
asecond-order
algorithm.
A convenient choice for multinomial data is thescoring
algorithm,
because Fisher’s information measure issimple
here. Thesystem
ofequations
tosolve
iteratively
can be written as:-
-where
U(a; y)
=<9L((x;y)/<9<x
andJ(a)
_
-E[å2L(a;y)/åaåa’]
are the scorefunction and Fisher’s information matrix
respectively;
k is iterate number.Analyt-ical
expressions
for the elements ofU(a; y)
andJ(a)
aregiven
inAppendix
1.These are
generalizations
of formulaegiven by
Gianola andFoulley (1983)
andMisztal et al
(1988).
In someinstances,
one may consider abacktracking procedure
(Denis
andSchnabel,
1983)
to reach convergence,ie,
at thebeginning
of the iterativeprocess,
compute
a!k+1!
asa[k+1]
=ark]
+,cJ[k+1]!a[k+1]
with 0 <w[
k+1
] ::::;
1.A constant value of w = 0.8 has been
satisfactory
in all theexamples
run so far(over
the ni observations made insubpopulation i)
of indicator vectors yir =(Yilr i Yi2r i... i Yijri ... i YiJr)l
such that!r=l
1 or 0depending
on whether aresponse for observation
(r)
inpopulation
(i)
is incategory
(j)
or not.Given ni
independent
repetitions
of
Yin
the sum yi+ ismultinomially
distributedj
with
parameters n
i
= !
yij+ andprobability
vectorIi
i
={lIi
j
}.
j = lIn the threshold
model,
theprobabilities
1I
jj
are connected to theunderlying
normal variables
X
ir
with threshold values Tj via the statementwith To = -oo and Tj =
+
00
,
so that there are J - 1 finite thresholds.With
Xi
r
rv
N(!2,Q2),
this becomes:where
!(.)
is the CDF of the standardized normal distribution.The mean
liability
(
?
7i)
for the ithsubpopulation
is modeled as in Gianola andFoulley (1983)
and Harville and Mee(1984),
and as ingeneralized
linear models(McCullagh
andNelder,
1989)
in terms of the linearpredictor
Here,
the vector(p
x1)
of unknowns(0)
involves fixed effectsonly
and xiis thecorresponding
(p x 1)
vector ofqualitative
orquantitative
covariates.In the
H-TM,
a structure isimposed
on thescaling
parameters.
As inFoulley
et al(1990, 1992)
andFoulley
andQuaas
(1995),
the naturallogarithm
of Qi is written as a linear combination of some unknown(r
x1)
real-valued vector ofparameters
(6),
1p’
being
thecorresponding
row incidence vector ofqualitative
or continuouscovariates.
Identifiability
ofparameters
In the case of I
subpopulations
and Jcategories,
there is a maximum ofI(J - 1)
identifiable(or estimable) parameters
if themargins
ni are fixedby sampling.
Theseare the
parameters
of the so-called saturated model.What is the most
complete
H-TM(or
’full’model)
that can be fitted to the datausing
theapproach
described here? One can estimate:(1)
J - 1 finite thresholdGoodness of fit
The two usual
statistics,
Pearson’sX
Z
and the(scaled)
deviance D* can be usedto check the overall
adequacy
of a model. These arewhere
fig
=77
tj
((x)
is the ML estimate of1I
j
,
andAbove,
D* is based on the likelihood ratio statistic forfitting
the entertainedmodel
against
a saturated modelhaving
as manyparameters
as there arealge-braically
independent
variables in the datavector,
ie,
1(J -
1)
here. Data shouldbe
grouped
as much aspossible
for theasymptotic
chi-square
distribution to hold in[9]
and[10]
(McCullagh
andNelder, 1989;
Fahrmeir andTutz,
1994).
Thedegrees
of freedom to consider here areI(J - 1) (saturated model)
minus((J - 1)
+rank(X)
+rank(P)]
(model
understudy),
where X and P are theinci-dence matrices for
(3
and brespectively.
It should be notedthat
[9]
and[10]
can becomputed
asparticular
cases of the powerdivergent
statistics introducedby
Readand Cressie
(1984).
Hypothesis testing
Tests of
hypotheses
about y =6’)’
can be carried out via either Wald’s test orthe likelihood
ratio
(or
deviance)
test. The firstprocedure
relies on theproperties
of
consistency
andasymptotic
normality
of the ML estimator.For linear
hypotheses
of the formH
o
:
K’y
= magainst
its alternativeH
l
=H
o
,
the
Waldstatistic
is:which under
H
o
has anasymptotic
chi-square
distribution withrank(K) degrees
offreedom.
Above,
r(y)
is anappropriate
block of the inverse of Fisher’s informationmatrix evaluated at y=
y
, where
y
is the ML estimator.The likelihood ratio statistic
(LRS)
allowstesting
nestedhypotheses
of the formHo :
y Eno
against
H
1
:
y - E nno
whereno
and ,f2 are the restricted andunrestricted
parameter
spacesrespectively,
pertaining
toH
o
andH
o
UH
l
.
TheLRS is:
where
y
andy are
the ML estimators of y under the restricted and unrestrictedmodels
respectively.
The criterion!#
also can becomputed
as the difference inThis is
equivalent
to what isusually
done in ANOVAexcept
that residual sums of squares arereplaced
by
deviances.Under
H
o
,
A#
has anasymptotic
chi-square
distribution with r =dim(D)
-
dim(
Do
)
degrees
of freedom. Under the same nullhypothesis,
the Wald and LRstatistics
have the sameasymptotic
distribution.However,
Wald’sstatistic
is based on aquadratic approximation
of theloglikelihood
around its maximum.Including
random effectsIn many
applications,
they
i
r ’s
cannot be assumed to beindependent repetitions
due to some cluster structure in the data. This is the case inquantitative
genetics
andanimal
breeding
withgenetically
relatedanimals,
common environmental effectsand
repeated
measurements on the same individuals.Correlations can be accounted for
conveniently
via a mixed model structure onthe
’T7!S,
written now aswhere the fixed
component
x!13
is asbefore,
and u is a(q
x1)
vector of Gaussianrandom effects with
corresponding
incidence row vectorzi.
For
simplicity,
we will consider a one-way randommodel, ie,
u !N(O,
Ao,
u
2
)
(A
is apositive
definite matrix of known elements such askinship
coefficients),
but the extension to several
u-components
isstraightforward.
The randompart
of the location is rewritten as inFoulley
andQuaas
(1995)
asZ!O&dquo;Ui u*
where u* is avector
of standard normaldeviation,
and au, is the square root of theu-component
ofvariance,
the value of which may bespecific
tosubpopulation
i. Forinstance,
the sire variance may varyaccording
to the environment in which the progeny of thesires is raised.
Furthermore,
it will be assumed that the ratio0’
.,,, /a
i
,
where a
i
is nowthe residual
variance,
is constant acrosssubpopulations.
In a sireby
environmentlayout,
this is tantamount toassuming
homogeneous
intraclass correlations(or
heritability)
acrossenvironments,
which seems to be a reasonableassumption
inpractice
(Visscher, 1992).
Thus,
theargument
h2!
of the normal CDF in[4]
and[7]
becomeswhere p =
<
7
u,
/
<T:.
In the fixed
model,
parameters
T,(3
and b were estimatedby
maximumlikelihood. Given p, a natural extension would be to estimate these and u*
by
the mode of theirjoint posterior
distribution(MAP).
To mimic a mixed-modelstructure,
one can take flatpriors
on T,13
and b. Theonly
informativeprior
is thenon u
*
,
ie,
u* rvN(O, A).
ThusMAP solutions can be
computed
with minor modifications from[8].
Theonly
changes
toimplement
are toreplace:
(i)
a =’, (3’,
T
(
6’)’
by
0 =(
T
’,
0, 6’,
u#’)’
with
u#
=pu
*
;
(ii)
X=J;xi,X2,...,x!,...,x//
by
S =(S1, · .. , Si, ... , SI)’
withS
’
=to the random effects on the left hand side
and _p-
2
A -
lU[k]
to theu#-part
of theright
hand side(see
Appendix
1).
A testexample
is shown inAppendix
2.A further
step
would be to estimate pusing
an EMmarginal
maximum likelihoodprocedure
based onThis may involve either an
approximate
calculation of the conditionalexpecta-tion of the
quadratic
inu#
as in Harville and Mee(1984),
Hoeschele et al(1987)
andFoulley
et al(1990),
or a Monte-Carlo calculation of this conditionalexpectation
using,
forexample,
a Gibbssampling
scheme(Natarajan, 1995).
Alternative pro-cedures forestimating
pmight
be alsoenvisioned,
such asthe
iteratedre-weighted
REML of
Engel
et al(1995).
NUMERICAL APPLICATION
Material
The data set
analyzed
was acontingency
table ofcalving
difficulty
scores(from
1 to4)
recorded onpurebred
US-Simmental cows distributedaccording
to
sex of calf(males, females)
and age of dam atcalving
in years. Scores 3 and 4 werepooled
on account of the lowfrequency
of score 4. Nine levels were considered for ageof dam: < 2 years,
2.0-2.5, 2.5-3.0, 3.0-3.5, 3.5-4.0, 4.0-4.5, 4.5-5.0, 5.0-8.0,
and> 8.0 years. In the
analysis
of thescaling parameters,
six levels were consideredfor this factor: < 2 years,
2.0-2.5, 2.5-3.0, 3.0-4.0, 4.0-8.0,
and > 8.0 years. The distribution of the 363 859 recordsby
sex-age of dam combinations isdisplayed
in tableI,
as well as thefrequencies
of the threecategories
ofcalving
scores. The raw data reveal the usualpattern
ofhighest calving difficulty
in male calves out ofyounger dams.
However,
more can be said about thephenomenon.
Method
Data were
analyzed
with standard(S-TM)
and heteroskedastic(H-TM)
thresholdmodels. Location and
scaling
parameters
were describedusing
fixed modelsinvolv-ing
sex(S)
and age of dam(A)
as factors of variation. In both cases, inference wasbased on maximum
likelihood
procedures.
Alog-link
function was used forscaling
parameters.
With J = 3
categories,
the mosthighly parameterized
S-TMthat
can befitted
for the location structure includes J - 1 = 2 threshold values
(or, equivalently,
thedifference between thresholds
(
72
-
71
)
and a baselinepopulation
effect/
-
l
),
plus
sex(one contrast),
age of dam(eight contrasts)
and their interaction(eight
contrasts)
as elements of(3;
thisgives r(X)
= 17 whichyields
19 as the total number ofThe H-TM to start with was as in the S-TM for location
parameters
(3.
Withrespect
todispersion
parameters 6,
the model was an additive one, with sex(S
*
:
onecontrast)
and age of dam(A
*
:
fivecontrasts)
so thatr(P)
= 6(
Q
= 1 inmale calves and < 2.0 year old
dams).
Thus,
the total number ofparameters
was19 + 6 = 25
and,
thedf
wereequal
to 36 - 25 = 11.RESULTS
All factors considered in the S-TM were
significant
(P
<0.01),
especially
the sexby
age of dam interactions(except
the first one, as shown in tableII).
Hence,
themodel cannot be
simplified
further. This means that differences between sexes werenot constant across age of dam
subclasses,
contrary
to results of aprevious
study
in Simmental also obtained with a fixed S-TM(Quaas
etal,
1988).
Differencesin
liability
between male and female calves decreased with age of dam. However the S-TM did not fit well to thedata,
as the Pearson statistic(or
deviance)
wasX
2
= 419 on 17degrees
offreedom,
resulting
in a nil P-value. An examination ofthe Pearson residuals indicated that the S-TM leads to an underestimation of the
As shown in table
II,
fitting
the H-TM decreased theX
2
and devianceby
afactor of 20 and led to a
satisfactory
fit. Thesignificance
of many interactionsvanished,
and this was reflected in the LRS(P
<0.088)
for thehypothesis
of noS x A interaction in the most
parameterized
model. Several models were tried andtested as shown in table III. The
scaling
parameters
depended
onthe
age of thedam,
theeffect
of sexbeing
notsignificant
(P
<0.163).
Relative to the baselinepopulation,
the standard
deviation increasedby
a factor of about1.05, 1.15, 1.25,
1.40 and 1.50 for cows of
2.0-2.5, 2.5-3.0, 3.0-4.0, 4.0-8.0,
and > 8.0 years of ageat
calving respectively (table IV).
The H-TM made differences between sex liabilities across ages of dam
practically
constant as theinteraction
effects werenegligible
relative to the main effects. The difference between male and female calves was about 0.5.Eventually,
a modelLogistic
heteroskedastic models have been consideredby McCullagh (1980)
andDerquenne
(1995).
Formulae aregiven
inAppendix
1 to deal with this distribution.When the Simmental data are
analyzed
with thelogistic
(table VI),
thedata. Wald’s and deviance statistics were in very
good
agreement
in thatrespect,
with P values of 0.08 and 0.16
respectively,
for the SA interaction. It should be observed that this heteroskedastic model has even fewerparameters
(16)
than thetwo-way
S-TM consideredinitially
(19 parameters).
Inspite
ofthis,
the Pearson’schi-square
(and
also thedeviance)
was reduced from about 419(table
II)
to 32(table V)
with a P-value of 0.04. This fit is remarkable for thislarge
data set(N
=363859),
where one wouldexpect
many models to berejected.
Although
the H-TM may havecaptured
some extra hidden variation due toignoring
randomeffects,
it isunlikely
that the poor fit of the S-TM can be attributedsolely
to theoverdispersion phenomenon resulting
fromignoring genetic
and otherclustering
effects. Thelarge
value of the ratio of the observedX
2
to itsexpected
value
(419/17
=25)
suggests that
thedependency
of theprobabilities
77,!
withrespect
to sex of calf and age of dam is not describedproperly by
a modelwith
constant variance. Whether the poor fit of the S-TM is the result ofignoring
randomeffects,
heterogeneous variance,
orboth, require
furtherstudy, perhaps
simulation.These
results
suggest
that in beef cattlebreeding
thegoodness
of fit of a constantvariance threshold model for
calving
ease can beimproved
by
incorporating
scaleeffects for age of dam either as discrete
classes,
as in thisstudy,
oralternatively
Qias a
polynomial
regression
oflog
ai on age.DISCUSSION
Other distributional
assumptions
The
underlying
distribution wassupposed
to be normal which is a standardassumption
of threshold models in agenetic
context(Gianola, 1982).
However,
other
distributions
might
have been considered formodeling
77!
in!3!.
A classicalchoice,
especially
inepidemiology,
would be thelogistic
distribution with mean 77iprobit.
Interestingly,
there is not much difference between thecomplete
(S+A+SA)
and the additive model(S
+A),
the interaction(SA)
being non-significant
(P
=0.30). Taking
into account the variation in variance in addition to thatexplained
by
the additive model on locationparameters
does notimprove
the fitgreatly.
In thatrespect,
the main source of variation turned out to be sex rather than age ofdam.
Other
options
include the t-distribution(Albert
andChib,
1993),
theEdgeworth
series distribution(Prentice, 1976)
and other non-normal classes of distribution functions(Singh, 1987).
Infact,
the t distributiontv(!2,
s
2
)
withspread
parameter
8
2
and vdegrees
of freedom is themarginal
distribution of a mixture of a normaldistribution
!V(!,<7?),
withQ2
randomly
varying
according
to a scaled invertedX
2
(v,s
2
)
distribution (Zellner,
1976).
Therefore,
a threshold model based on such a t distribution is embedded in ourprocedure by taking
a one-way random model forIncr?,
ie,
ln<r?
=In
8
2
+ ai with thedensity
functionp(a
i
lv)
of the randomvariable ai as
presented
inFoulley
and(auaas
(1995,
formulae 21 and22);
see alsoGianola and Sorensen
(1996)
for aspecific study
of the threshold model based onthe t-distribution in animal
breeding.
Relationships
with variable thresholdsConceptually, heterogeneity
of thea§s
is viewed here in the same way as inGaussian linear models since it
applies
to anunderlying
random variable that isnormally
distributed.
However,
theunderlying
variables are notobservable,
andthe
corresponding
real line includes cutoffpoints,
thethresholds,
that make theoutcomes discrete. It is of interest to address the
question
of howchanges
indispersion
can beinterpreted
withrespect
to the thresholdconcept.
Let us illustrate this
by
asimple example involving
J = 3categories,
and aone-way classification model
(i
=1, 2, ... , I)
as, for
example,
sex of calf in the Simmentalbreed. We will assume that the
origin
is at the firstthreshold,
and that the unit of measurement is the standard deviation within males(
QM
= 1).
The differencewhere
II
M1
,
lI
M2
are theprobabilities of response
in the first and secondcategories,
respectively,
for male calves. A similarexpression
isobtained
for female calves(F),
so
This is
precisely
the ratio of the difference between thresholds 1 and 2 that would be obtained when evaluatedseparately
in each sex. ThusFormulae
[19]
and[20]
areanalogous
toexpressions given by
Wright
(1934b) (the
reciprocal
of the distance between the thresholds on this scalegives
the standarddeviation on the
postulated
scale on which thethresholds
areseparated by
a unitdistance,
p545)
and Falconer(1989,
formula18.5,
p307)
except
that these authorsset to
unity
the difference between thresholds in the baselinepopulation,
rather than the standarddeviation,
which we find moreappealing
conceptually.
In the case of the Simmental data shown in
table
I,
applying
formulae[19]
and[20]
using
observedfrequencies
of responsesgives:
If more than three
categories
areobserved,
this formula also holds for thedifferences T3 - 72, T4 - 73,..., - -17JTJ-2, so that the ratio between standard deviations in
subpopulation
(i)
and a referencepopulation
(R)
can beexpressed
as:which involves
(J - 2)
algebraically
independent
equalities.
In the case of three
categories
and asingle
classification,
the saturated modelhas 21
parameters
(
T2
-
T
i ,
, 1J.L U2-. - ;/!7)<!2/<7’i) - - -
ai!a1,
... ,arla1).
Numericalvalues
of a
i!
a
1
computed
from[20]
are also ML estimates(eg, â
2/
¡
h
=0.973).
Formula
[21]
indicates that there is a link between H-TM and models withvariable thresholds
(Terza, 1985).
Ascompared
tothese,
the main features of the H-TM are:i)
amultiplicative
model on ratios of standard deviations or differences betweenthresholds,
rather than a linear model on suchdifferences;
ii)
a lower dimensionalparameterization
due to theproportionality
assumption
made in
[18]
rather
than acategory-specific
parameterization,
ie:where
6 j
is the vector of unknownspertaining
to the difference(T! -
Ti
).
For J =
3,
the two modelsgenerate
the same number ofparameters
butthey
areFurther extensions
The H-TM opens new
perspectives
for theanalysis
of ordinal responses.Interesting
extensions may include:
i) implementing
other inferenceapproaches
for mixed models such as Gilmour’sprocedure
based onquasi-likelihood,
or afully Bayesian analysis
ofparameters
using
Monte-Carlo Markov-Chain methodsalong
the lines of Sorensen et al(1995);
ii)
assessing
thepotential
increase in response to selectionby selecting
onestimated
breeding
values calculated from an H-TM versus anS-TM;
iii)
incorporating
a mixed linear model onlog-variances
as described in San Cristobal et al(1993)
andFoulley
andQuaas
(1995)
for Gaussianobservations;
iv)
carrying
out ajoint
analysis
of continuous and orderedpolychotomous
traits
asalready proposed
for the S-TMby Foulley
et al(1983),
Janss andFoulley (1993)
and Hoeschele et al(1995).
Further research is also needed at the theoretical level to look at the
sampling
properties
of estimators based onmis-specified
models. Forinstance,
one may beinterested
in theasymptotic
properties
of the ML estimator of(
T
’, 0’,
6’l’
derived under theassumption
ofindependence
of they
i
,’s
when thishypothesis
does not hold. Thisproblem
has been discussed ingeneral by
White(1982),
and it may beconjectured
from the results ofLiang
et al(1992)
that the ML estimators ofthese
parameters
remain consistent. Itmight
also be worthwhile to assess the effect ofdepartures
fromindependence
ontesting
procedures.
Thegeneralized
chi-square
procedure
forgoodness
of fit derivedby
McLaren et al(1994)
might
be useful inthat
respect
foranalyses
based onlarge samples.
ACKNOWLEDGMENTS
Part of this research was conducted while JL
Foulley
was aVisiting
Professor at theUniversity
of Wisconsin-Madison. Hegreatly acknowledges
the support of this institution and of the Direction desproductions
animales and Direction des relationsinternationales,
INRA. Themanuscript
islargely
drawn from invited lecturesgiven by
the senior authorat the Midwestern Animal Science
Meeting
(Des
Moines, Iowa,
April
121995),
theUniversity
of Wisconsin-Madison(12
April
1995),
CornellUniversity
(15
May
1995)
and theUniversity
of Illinois(28
August
1995).
Special
thanks areexpressed
to all those who contributedsubstantially
to the discussionsgenerated thereby,
and also to B Klei and BCunningham
from the US Simmental Association forproviding
the data set oncalving
ease scoresanalyzed
in thisstudy.
The H-TM has beenapplied
tocalving performance
of other cattle breeds(US
Holstein,
French Blonded’Aquitaine,
Charolais, Limousin,
MaineAnjou).
It has also beenapplied
toprolificacy
records insheep.
We aregrateful
to JBerger
(Iowa
StateUniversity,
Ames),
FMénissier,
JSapa
(INRA
Genetics,
Jouy)
and JPPoivey
(INRA
Genetics,
Toulouse)
forproviding
thecorresponding
data sets.REFERENCES
Albert
JH,
Chib S(1993)
Bayesian analysis
ofbinary
andpolychotomous
response data.J Am Stat Assoc
88,
669-679Brillinger
DR(1985)
What doseismology
andneurophysiology
have in common ?Bryant WK,
Gerner(1982)
The demand for service contracts. J Business55,
345-366 BulmerMG,
Bull JJ(1982)
Models ofpolygenic
sex determination and sex ratio control.Evolution
36,
13-26Collet
(1991)
Modelling Binary
Data.Chapman
andHall,
LondonCurnow
R,
Smith C(1975)
Multifactorial models for familial diseases in man. J R StatSoc A
138,
131-169Dempster ER,
Lerner IM(1950)
Heritability
of threshold characters. Genetics35,
212-236 Denis JE Jr, Schnabel RB(1983)
NumericalMethods for
Unconstrained optimization andNon-Linear
Equations.
Prentice-HallInc, Englewood
CliffsDerquenne
(1995)
Heteroskedasticlogit
models. 50th Session of the International Statis-tical Institute(ISI), Bejing, China,
21-29August
Edwards
AL,
Thurstone LL(1952)
An internalconsistency
check for scale values deter-minedby
the method of successive intervals.Psychometrika 17,
169-180Engel B,
BuistW,
Visscher A(1995)
Inference for the threshold models with variancecomponents from the
generalized
linear mixed modelperspective.
Genet Sel Evol 27,15-22
Falconer DS
(1960)
An, Introduction toQuantitative
Genetics. 1stedition,
Oliver andBoyd, London
Falconer DS
(1965)
The inheritance ofliability
to certain diseases estimated from the incidence among relatives. Ann Hum Genet29, 51-76
Falconer DS
(1989)
An Introduction toQuantitative
Genetics. 3rdedition, Longman
Scientific and
Technical,
HarlowFahrmeir
L,
Thtz G(1994)
Multivariate StatisticalModelling
Based on Generalized LinearModels.
Springer-Verlag,
New YorkFechner GT
(1860)
Elemente derPsychophysik,
2 vols: Vol 1 translated(1966).
In:Elements
of Psychophysics
(DH
Howes,
EGBoring,
eds),
Holt,
Rinehart andWinston,
New York
Foulley
JL(1987)
M6thodes d’evaluation desreproducteurs
pour des caract6res discrets a d6terminismepolygénique
en selection animale. PhDthesis, University
ofParis-Sud-Orsay
Foulley JL,
GianolaD,
Thompson
R(1983)
Prediction ofgenetic
merit from data oncategorical
andquantitative
variates with anapplication
tocalving
difficulty,
birthweight
andpelvic opening.
Genet SelEvol 15,
401-424Foulley JL,
GianolaD,
San CristobalM,
Im S(1990)
A method forassessing
extent andsources of
heterogeneity
of residual variances in mixed linear models. JDairy
Sci73,
1612-1624Foulley JL,
San CristobalM,
GianolaD,
Im S(1992)
Marginal
likelihood andBayesian
approaches
to theanalysis
ofheterogeneous
residual variances in mixed linear Gaussian models.Comput
Stat Data Anal13,
291-305Foulley JL, Quaas
RL(1995)
Heterogeneous
variances in Gaussian linear mixed models. Genet SelEvol 27,
211-228Galton F
(1889)
Natural Inheritance.Macmillan,
LondonGianola D
(1982)
Theory
andanalysis
of threshold characters. J Anim Sci 54, 1079-1096 GianolaD,
Foulley
JL(1983)
Sire evaluation for orderedcategorical
data with a thresholdmodel. Genet Sel
Evol 15,
201-224Gianola
D,
Sorensen D(1996)
Abstract to theEAAP, Lillehammer, Norway,
25-29August
Gilmour
A,
AndersonRD,
Rae A(1987)
Variance components on anunderlying
scale forordered
multiple
thresholdcategorical
datausing
ageneralized
linear model. J Anim Breed Genet104,
149-155Grosbras JM
(1987)
Les données manquantes. In: Lesso!edages
(JJ
Droesbeke,
BFichet,
Harville
DA,
Mee RW(1984)
A mixed modelprocedure
foranalyzing
orderedcategorical
data. Biometrics 40, 393-408Hoschele
I,
GianolaD, Foulley
JL(1987)
Estimation of variance components withquasi-continuous data
using Bayesian
methods. J Anim Breed Genet104,
334-349H6schele
I,
TierB,
Graser HU(1995)
Multiple-trait genetic
evaluation for onepolychoto-mous trait and several continuous traits with
missing
data andunequal
models. J Anim Sci73,
1609-1627Janss
L,
Foulley
JL(1993)
Bivariateanalysis
for one continuous and one thresholddichotomous trait with
unequal design
matrices and anapplication
to birthweight
andcalving difficulty.
Livest Prod Sci 33, 183-198Kaplan RS,
Urwitz G(1979)
Statistical models of bondratings:
amethodological inquiry.
J Business
52,
231-261Levy
F(1980)
Changes
inemployment
prospects for black males.Brooking Papers 2,
513-538
Liang KY, Zeger SL,
Qaqish
B(1992)
Multivariateregression analysis
forcategorical
data. J R Stat Soc B54,
3-40Maddala GS
(1983)
LimitedDependent
andQualitative
Variables in Econometrics.Cam-bridge University Press,
New YorkMcCullagh
P(1980) Regression
models for ordinal data. J R Stat SocB42,
109-142McCullagh
P,
Nelder J(1989)
Generalized Linear Models. 2nded, Chapman
andHall,
London
McKelvey R,
Zavoina W(1975)
A statistical model for theanalysis
of ordinal leveldependent
variables. J Math Soc4,
103-120McLaren
CE,
LeclerJM,
Brittenham GM(1994)
Thegeneralized
chi squaregoodness-of-fit
test. Statistician
43,
247-258Misztal
I,
GianolaD,
H6schele I(1988)
Threshold model withheterogeneous
residual variance due tomissing
information. Genet Sel Evol20,
511-516Natarajan
R(1995)
Estimation in Models for MultinomialResponse
Data:Bayesian
andFrequentist Approaches.
PhDthesis,
CornellUniversity,
IthacaPearson K
(1900)
Mathematical contributions to thetheory
of evolution. VIII. On the inheritance of characters notcapable
of exactquantitative
measurement. Phil TransRoy
Soc London A195,79
Prentice RL
(1976)
Generalisation ofprobit
andlogit
models. Biometrika32,
761-768Quaas RL,
ZhaoY,
Pollak EJ(1988)
Describing
interactions indystocia
scores with athreshold model. J Anim Sci
66,
396-399Read
I,
Cressie N(1988)
Goodness-of-Fit
Statisticsfor
Discrete Multivariate Data.Springer-Verlag,
New YorkRobertson A
(1950)
Proof that the additiveheritability
on the p scale isgiven by
theexpression
z2 h!/fj(¡.
Genetics35,
234-236Roff DA
(1994)
The evolution ofdimorphic
traits:predicting
thegenetic
correlation between environments. Genetics136,
395-401San Cristobal
M, Foulley JL,
Manfredi E(1993)
Inference aboutmultiplicative
het-eroskedastic components of variance in a mixed linear Gaussian model with anap-plication
to beef cattlebreeding.
Genet Sel Evol 25, 3-30Singh
M(1987)
A non-normal class of distribution for dosebinary
response curve. JA
PP
I
Stat14,
91-97Sorensen
DA,
AndersenS,
GianolaD, Korsgaard
I(1995)
Bayesian
inference in threshold modelsusing
Gibbssampling.
Genet SelEvol 27,
229-249Terza JV
(1985)
Ordinalprobit:
ageneralization.
Commun Stat Theor Meth14,
1-11 1White H
(1982)
Maximum likelihood estimationof misspecified
models. Econometrica50,
1-25Wright
S(1934a)
Ananalysis
ofvariability
in number ofdigits
in an inbred strain ofguinea pigs.
Genetics 19, 506-536Wright
S(1934b)
The results of crosses between inbred strains ofguinea pigs differing
innumber of
digits.
Genetics19,
537-551Zellner A
(1976)
Bayesian
andnon-Bayesian analysis
of theregression
model with multivariate Student-t error terms. J Am Stat Assoc71,
400-405APPENDIX 1
Expressions
for the score function U and the information matrix JConcerning
derivatives withrespect
to the vector(3
of locationparameters,
onehas:
Finally,
U can beexpressed
as:with
expressions
for£&dquo;
vp and v8given
in(A.l!, [A.2]
and(A.3!.
Elements of the information matrix
J(a)
include theexpectations
of minus the second derivatives. Thefollowing
derivatives will be considered:threshold-threshold; (3-threshold-threshold;
6-threshold; 13 - 13; (3-6;
and S - b.(3-Threshold
derivativesThe
expectation
of the first term vanishes becauseE(yz!)
=nj
1
Ijj.
where T =
{t!!} is
a(J - 1)
x(J -
1) symmetric
band matrixhaving
as elements:t
ij
=E(-å
2
L/årJ),
andt
j
,
j+1
=’
)
j+1
j
år
L/år
2
E(-å
given
in[A.4]
and(A.5!.
These
expressions
can be extended toobtain the
MAP ofparameters
in a mixed-model structureby replacing
(i)
13 by
e =((3!,
u#’)’
withu# r-
N(0,
P
Z
A)
(p
2
=U
2
i
/
0
,?
=constant);
(ii)
X by
S =(
Sl
S2 .... , Si,...,)’
S
I
withs!
=(x!, (Jiz
D
;
and
making
theappropriate
adjustments
forprior
information as shown below:APPENDIX 2