G. DE S OETE
On the relation between two generalized cases of Thurstone’s law of comparative judgment
Mathématiques et sciences humaines, tome 81 (1983), p. 47-57
<http://www.numdam.org/item?id=MSH_1983__81__47_0>
© Centre d’analyse et de mathématiques sociales de l’EHESS, 1983, tous droits réservés.
L’accès aux archives de la revue « Mathématiques et sciences humaines » (http://
msh.revues.org/) implique l’accord avec les conditions générales d’utilisation (http://www.
numdam.org/conditions). Toute utilisation commerciale ou impression systématique est consti- tutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
ON THE RELATION BETWEEN TWO GENERALIZED CASES OF
THURSTONE’S
LAW OF COMPARATIVE JUDGMENT*
G. DE SOETE
Choice behavior
is of centralinterest
to many socialscientists.
Theobservation
thatchoice behavior
is ofteninconsistent
has ledpsychologists
to
develop probabilistic
models torepresent pairwise choice
data. These models allow one toderive
frompaired comparison
data numerical estimates of theutilities
of thechoice objects involved.
It should be noticed thatthese
psychological
models may be very usefulin
many other socialscience
disciplines, ranging
frommarketing
topolitical science.
I.
THURSTONE’S
LAW OF COMPARATIVE JUDGMENTOne of the most
popular probabilistic
models forrepresenting pairwise
choice data is
Thurstone’s (1927)
Law ofComparative Judgment (LCJ).
Accor-ding
toThurstone,
theutility
of a choiceobject
is not constant butfluctuates from time to time.
Therefore,
in his model theutility
of astimulus i
(i = 1, n)
isrepresented
as a random variableU..
When presen-1
ted a
pair
ofstimuli,
thesubject
issupposed
to choose the stimuluswith
thehighest
momentaryutility. Consequently,
theprobability
ofpreferring
*"Aspirant"
of theBelgian "Nationaal
Fonds voorWetenschappelijk Onderzoek"
at the
Department
ofPsychology, University
ofGhent, Belgium.
The author is
indebted
toYoshio
Takane for his critical comments on anearlier
draft ofthis
paper.i
to j (i ~ j ) , written
asp ii,
isAssuming
that has amultivariate
normaldistributioni
equation
1 becomeswhere
denotes the standard normal distribution function and6..
the so-ij called
comparatal dispersion (Gulliksen, 1958), i.e.,
where
e(i.) is
a n-element column vector whosek’th component
isdefined by
J
6 ik - ajk with u
Kroneckerdelta).
Equation
2 constitutesThurstone’s complète
LCJ. The modelincorpora-
tes n +
n(n+l)/2 parameters. However,
any transformation of the formwhere a is a
positive scalar, c
anarbitrary n-component
vector, and 1 avector of n ones, leaves the choice
probabilities
invariant.Therefore,
theeffective
number ofparameters (or
thedegrees
of freedom for themodel)
reduces to
n(n+l)/2 -
2. Since this number still exceeds the number of ob- served choiceproportions En(n-l)/21,
restrictions need to beimposed
on theparameters
in order for the model to be of anypractical
use. The most fre-quently
used restriction inempirical
researchrequires
to be constant1J
for all
(i,j).
This is for instance the case when E is of the formwhere a is a
positive scalar, c
anarbitrary n-component
vector, and I an1Strictly speaking,
all that is needed to arrive atequation
2 ismarginal
univariate
normality
ofU. However, assuming multivariate normality
istheoretically
more attractive.identity
matrix of order n. When this constraint isimposed
on(2),
the mo-del is referred to as the LCJ Case V. Without loss of
generality,
Case Vcan be written as
The effective number of
parameters
in this model amounts to n - 1.Equation
3 states that the choiceprobabilities
are astrictly
in-creasing
function of the difference between the mean utilities of the twoalternatives.
Experimental
research(Becker, DeGroot, and Marschak, 1963;
Krantz, 1967;
Rumelhart andGreeno, 1971; Tversky
andRusso, 1969; Tversky
and
Sattath, 1979) suggests, however,
thatempirical
choiceproportions
areinfluenced not
only by
differences inutility,
but also - to some extent -by
thesimilarity
orcomparability
of the stimuli.Subjects
tend to berather indifferent between very dissimilar
stimuli,
even if the stimuli dif- fer a lot inutility.
Similarstimuli,
on the otherhand,
tend togive
riseto extreme choice
proportions,
even whenthey
do not differ that much inutility.
In a series of studies conductedby Sjdberg (1975, 1977, 1980;
Sjbberg
andCapozza, 1975),
it was found that the discriminaldispersion
in(2)
is related to thedissimilarity
between thé alternatives.Consequently, constraining
the discriminaldispersions
to beequal
to each other - as inCase V - is from a
psychological point
of view not very realistic. There-fore,
researchers became interested indeveloping
Thurstonian models which do allow thecomparatal dispersions
to reflect thedissimilarity
betweenthe stimuli.
Recently,
two such models have beenproposed.
The first one iscalled
by
Takane(1980)
the factorial model ofcomparative judgment,
whilethe second one is known as the
wandering
vector model(Carroll, 1980).
Be-cause these cases of the LCJ differ from the five cases discussed
by
Thur-stone
(1927), they
are calledgeneralized
cases. Thesegeneralized
casesattempt
to extract from asingle paired comparisons
matrix information notonly
about the mean utilities of thealternatives,
but also about the simi-larities
between the choiceobjects.
In the next sections we discuss thetwo models as well as how
they
relate to each other. In the finalsection,
both models are
applied
forcomparative
purposes on a data setobtained by
Rumelhart and Greeno
(1971).
II. THE FACTORIAL MODEL OF COMPARATIVE JUDGMENT
The factorial model of
comparative judgment
has beenproposed by
Takane(1980)
andby
Heiser and de Leeuw(1981).
In the factorialmodel,
the co-variance
matrix
Z isrequired
to have aprescribed
rank r(r n).
Since Eis
positive semi-definite,
it can bedecornposed
as followswhere X is a n x r matrix.
Equation
4 is aspecial
case of ageneral
modelfor
constraining
E discussedby
Arbuckle andNugent (1973,
eq.23).
Given thisrestriction,
thecomparative
varianceij
1J becomeswhere the column vector
x.
stands for thei’th
row of X.Consequently,
the-i
comparatal dispersions
may beinterpreted
as Euclidean distances in a r-di- mensional space. In this space each choiceobject
i isrepresented
as apoint x..
This space can bethought
of as ageometric representation
of the-i
subject’s cognitive
mapwhich presumably
influenced his or herpreference judgments.
The
parameters
to be estimated in the factorial model are u and X.The choice
probabilities
p..
are invariant under anytransformation
of the 1Jform
where a is a
positive scalar,
T anyorthogonal
matrix of order r,and c
anarbitrary r-component
vector. The effective number ofparameters
in the factorial model is n + nr -r(r+l)/2 -
2(and
not nr -r(r+l)/2 -
1 as Ta-kane
(1980,
p.194) incorrectly states2).
An
algorithm
forobtaining
maximum likelihood estimatesof y
and Xhas been
presented by
Takane(1980),
while Heiser and de Leeuw(1981)
dis-cuss how to fit the factorial model
by
least squares methodsusing
theSMACOF
approach (cf.
de Leeuw andHeiser, 1980).
III. THE WANDERING VECTOR MODEL
The
wandering
vectormodel, originally
introducedby
Carroll(1980),
attempts
ageometric representation
of both thesubject
and the choice ob-jects
in ajoint
r-dimensional space. Eachobject
i isrepresented
as apoint x.,
-i while thesubject
isrepresented
as a vectororiginating
from theorigin. According
to themodel,
each time thesubject
ispresented
apair
ofstimuli he or she chooses the stimulus with the
(algebraically) largest
or-thogonal projection
on thesubject
vector. Thewandering
vector model does not assume that theposition
of thesubject
vector is constant, but rather that itchanges ("wanders")
somewhat from time to time.Formally,
it is as-sumed that
y,
3 the terminus of thesubject
vector, follows a multivariate normal distribution. Without loss ofgenerality
the variance-covariance ma-trix of this distribution can be set
equal
to theidentity matrix, i.e.,
When
presented
apair
ofstimuli (i,j),
thesubject
issupposed
tosample
a
y
from this distribution. Thesubject
then chooses i whenever the ortho-gonal projection
ofx.
-1on y
L exceeds theorthogonal projection
ofx.
-J on y, L orwhenever
2The degrees
of freedomgiven
in Table 2 of Takane(1980)
are however cor- rect.Consequently,
theprobability
ofpreferring
ito j
isSince it follows from
(5)
that isnormally
distributed with mean(x. -
-ix.)’v
-1 - and varianced2,(X), if equation (6)
becomes1J
These choice
probabilities
are not affectedby
any transformation of X of the formwhere a is a
positive scalar,
T anyorthogonal
matrix of order r, and c anarbitrary r-component
vector.Consequently,
the number ofdegrees
of freedom for thewandering
vector model is nr -r(r-l)/2 -
1. For severalinteresting
extensions of the model as well as for a maximum likelihood estimation pro-
cedure,
the reader is referred to De Soete and Carroll(1982).
IV. THE RELATION BETWEEN THE TWO MODELS
Heiser and de Leeuw
(1981,
p.50)
assert that the factorial model of compa- rativejudgment
and thewandering
vector model areessentially equivalent.
This
is, however,
notcompletely
true. From thedescription
of bothmodels,
it should be clear that the
wandering
vector model is aspecial
case of thefactorial
model. Morespecifically,
if we constrain the meanutilities p
in the factorial model to be a linear function of the stimulus coordinates
X,
we obtain thewandering
vector model. Or in otherwords,
thewandering
vector model is
only equivalent
to the factorial model if weimpose
on thelatter the
constraint
If this constraint is consistent with the
data,
thewandering
vectormodel should be
preferred
to the factorial model since it involves n - r - 1 lessparameters.
Animportant advantage
of thewandering
vector model isthat the stimulus
configuration
conveys information about the mean utilities of the choiceobjects.
V. ILLUSTRATIVE APPLICATION
Rumelhart and Greeno
(1971)
obtainedpreference judgments
from 234undergra-
duates about nine
celebrities.
These celebrities consisted of threepoliti-
cians
(L.B.
Johnson(LJ),
Harold Wilson(HW),
Charles De Gaulle(CD)),
threeathletes
(Johnny
Unitas(JU),
Carl Yastrzemski(CY),
A.J.Foyt (AF)),
andthree*movie stars
(Brigitte
Bardot(BB),
ElizabethTaylor (ET), Sophia
Loren(SL)).
Thesubjects
were treated asreplications
of each other. Takane(1980)
fitted the LCJ Case V as well as the two-dimensional factorial model ofcomparative judgment
to these datausing
maximumlikelihood estimation.
In order to compare the two
generalized
cases of the LCJ with eachother,
the same data were
analyzed according
to thewandering
vector model in twodimensions
using
the maximum likelihood estimationprocedure
describedby
De Soete and Carroll
(1982).
Table 1.
Summar
of theanalyses
on the Rumelhart and Greeno(1971)
data---
Obtained by
Takane. These values are notexactly
identical to those repor- ted in Table 2 of Takane(1980)
due to an error in the data on which the latter table is based.*
p
0.001Table 1
summarizes
the results of theanalyses.
The null model refersto a
completely
saturated model in which no structural constraints are im-posed
on thep...
The three other models arespecial
cases of this null mo-1J del.
Whenever two models are
hierarchically
related to eachother,
it ispossible
to devise a likelihood ratio test. Morespecifically,
suppose that model w is subsumed under model Q and that the maximum of the likelihoodfunction obtained under both models is written
respectively
asL~
andL~,
then under the null
hypothesis
that w fits the dataequally
well asQ,
thestatistic
is
asymptotically
distributed as achi-square
withm - mw degrees
of free-dom,
wherem~
and~
denote thedegrees
of freedom forrespectively
modeland Another statistic that is useful for
comparing
thegoodness-of-fit
ofdifferent
models;
isAkaike’s (1977)
AIC which is defined for model H asThis statistic has the
advantage
ofexplicitly correcting
for thegain
ingoodness-of-fit
due to an increased number ofparameters.
The smaller theAIC,
the better the fit between the data and the model.As can be inferred from the tests
against
the null modelgiven
inTable
1,
both the factorial model and thewandering
vector modelgive
agood
account of thedata,
while the LCJ Case V should berejected.
Sincethe
wandering
vector model is subsumed under the factorialmodel,
the twocan be
compared
with each otherby
means of a likelihood ratio test. The re-levant
chi-square
statistic has 6degrees
of freedom and amounts to 8.6 which is notsignificant.
Thissuggests
that thewandering
vector modelfits the data
equally
well as the factorial model which has however six pa-rameters more.
Consequently,
we can conclude that(of
the models consideredhere)
thewandering
vector modelgives
the mostparsimonious representation
of the Rumelhart and Greeno
(1971)
data. The fact that the smallest AIC is obtained for thewandering
vector model further confirms this conclusion.A
plot
of theconfiguration
obtained with thewandering
vector modelis
presented
inFigure
1. In thisplot
the arrowpoints
in the direction ofthe
subject
vector. The three stimulus clusters canclearly
bedistinguis-
hed. _
Figure
1.Configuration
obtained for the Rumelhart and Greeno(1971)
data with the
wandering
vector model in twodimensions
VI. RECAPITULATION
In this paper we discussed the relation between two
generalized
cases ofThurstone’s
Law ofComparative Judgment, namely
the factorial model of com-parative judgment proposed by
Takane(1980)
and Heiser and de Leeuw(1981)
and the
wandering
vector modelsuggested by
Carroll(1980).
It was shownthat the
wandering
vector model is aspecial
case of the factorial model.Furthermore we
argued
that whenever the two models fit the dataequally
well(as
is the case with the Rumelhart and Greenodata)
thewandering
vectormodel should be
preferred
because itgives
a moreparsimonious
account ofthe data.
BIBLIOGRAPHY
(1)
AKAIKEH., "On entropy
maximizationprinciple",
In: Krishnaiah P.R.(ed.), Applications
ofstatistics, Amsterdam,
North-Holland Pu-blishing Company,
1977.(2)
ARBUCKLE J. and NUGENTJ.H., "A general procedure
forparameter
esti- mation for the law ofcomparative judgment",
Brit. J. of Math.and Stat.
Psych.,
26(1973), 240-260.
(3)
BECKERG.M.,
DEGROOT M.H. and MARSCHAKJ., "Probabilities
of choice among very similarobjects",
Behav.Sci.,
8(1963), 306-311.
(4)
CARROLLJ.D., "Models
and methods for multidimensionalanalysis
ofpreferential
choice(or
otherdominance) data",
In: Lantermann E.D. andFeger
H.(eds.), Similarity
andchoice, Bern,
HuberVerlag,
1980.(5)
DE LEEUW J. and HEISERW., "Multidimensional scaling
with restrictionson the
configuration",
In: Krishnaiah P.R.(ed.),
Multivariateanalysis V, Amsterdam,
North-HollandPublishing Company,
1980.(6)
DE SOETE G. and CARROLLJ.D., "A
maximum likelihood estimation method forfitting
thewandering
vectormodel", Paper presented
at theannual
meeting
of thePsychometric Society, Montréal, Canada,
1982.(7)
GULLIKSENH., "Comparatal dispersion,
a measure of accuracy ofjudg-
ment", Psychometrika,
23(1958),
137-150.(8)
HEISER W.J. and DELEEUW, J., "Multidimensional mapping
ofpreference data",
Math. Sci.hum.,
19(1981),
39-96.(9)
KRANTZD.H., "Rational
distance function for multidimensionalscaling"
J. of Math.
Psych.,
4(1967),
226-245.(10)
RUMELHART D.L. and GREENOJ.G., "Similarity
between stimuli: an expe-rimental test of the Luce-and Restle choice
models",
J. of Math.Psych.,
8(1971),
370-381.(11)
SJOBERGL., "Uncertainty
ofcomparative judgment
and multidimensionalstructure",
Multiv. Behav.Res.,
11(1975),
207-218.(12)
SJOBERGL., "Choice frequency
andsimilarity",
Scandinavian J. ofPsych.,
18(1977),
103-115.(13)
SJOBERGL., "Similarity
andcorrelation",
In: Lantermann E.D. andFeger
H.(eds.), Similarity and choice, Bern,
HuberVerlag,
1980.(14)
SJOBERG L. and CAPOZZAD., "Preference
andcognitive
structure ofItalian
political parties", Italian J.
ofPsych.,
2(1975),
391-402.
(15)
TAKANEY., "Maximum
likelihood estimation in thegeneralized
case ofThurstone’s
model ofcomparative judgment", Japanese Psych. Res.,
22
(1980),
188-196.(16)
THURSTONEL.L., "A
law ofcomparative judgment", Psych. Rev.,
34(1927),
273-286.(17)
TVERSKY A. and RUSSOJ.E., "Substitutability
andsimilarity
inbinary choice",
J. of Math.Psych.,
6(1969),
1-12.(18)
TVERSKY A. and SATTATHS., "Preference trees", Psych. Rev.,
86(1979),
542-573.