• Aucun résultat trouvé

An overview of the Normal Ogive Harmonic Analysis Robust Method (NOHARM) approach to item response

N/A
N/A
Protected

Academic year: 2022

Partager "An overview of the Normal Ogive Harmonic Analysis Robust Method (NOHARM) approach to item response"

Copied!
8
0
0

Texte intégral

(1)

An overview of the Normal Ogive Harmonic Analysis Robust Method (NOHARM) approach to item response

theory

James J. Lee

a,

and Minji K. Lee

b

aUniversity of Minnesota Twin Cities bMayo Clinic

Abstract Here we provide a description of the IRT estimation method known as Normal Ogive Harmonic Analysis Ro- bust Method (NOHARM). Although in some ways this method has been superseded by new computer programs that also adopt a specifically factor-analytic approach, its fundamental principles remain useful in certain applications, which in- clude calculating the residual covariance matrix and rescaling the distribution of the common factor (latent trait). These principles can be applied to parameter estimates obtained by any method.

Keywords factor analysis; item response theory; psychometrics.

[email protected]

Introduction

Here we provide a description of the item response the- ory (IRT) estimation method known as Normal Ogive Har- monic Analysis Robust Method (NOHARM), which was im- plemented in a pioneering computer program developed by Fraser and McDonald (1988).

For some time NOHARM was the tool of choice for fit- ting multidimensional IRT models. To our knowledge, no other program until recently permitted the specification of a confirmatory multidimensional model with correlated factors or provided the residual covariance matrix in its output. Programs released within just the last few years—

including IRTPRO (Cai, Du Toit, & Thissen,2011) and the mirtpackage for the Rcomputing platform (Chalmers, 2012)—now supply at least the first of these two desider- ata, but they tend to be more computationally burdensome than NOHARM. Therefore it is important to weigh the rela- tive advantages of the tools that are now available.

First, NOHARM does not allow the pseudo-guessing parameter—the probability of passing an item by a very low-ability examinee—to be estimated. In practice, a poor choice of a fixed value can lead to what factor analysts call a Heywood case (an estimated nonpositive residual variance). In most applications this theoretical problem is of little consequence, but it can be somewhat trouble- some in analyses of differential item functioning. Second, whereas NOHARM only uses factor-analytic limited infor- mation (item pass rates and covariances) to estimate IRT parameters, many other programs use full information (the

frequencies with which the possible zero-one strings of item responses occur in the data) (Bolt,2005). In principle, this raises the possibility that NOHARM’s estimates are sta- tistically inefficient. NOHARM has performed well, how- ever, in simulation studies of cases where it can be com- pared with other approaches. Moreover, there are useful objects in certain empirical analyses that continue to be most easily calculated using the principles of NOHARM, even if the parameter estimates have been obtained by some other method. In summary, whereas the latest gen- eration of IRT programs is to be recommended, NOHARM still appears to have a place in the analysis of datasets con- taining hundreds of items (e.g., Flanagan et al.,1962). Such analyses promise to contribute much to differential psy- chology, and we anticipate that a greater recognition of the link between factor analysis and IRT will motivate such re- search.

Many critical pieces of information regarding NO- HARM are not easy to obtain, as they are spread across arti- cles, book chapters, and manuals published over a span of decades (McDonald,1967,1982,1985,1997,1999; Maydeu- Olivares,2001; McDonald & Fraser,2003). Because the el- egant principles of NOHARM continue to be enlightening and useful, we will try to produce a largely self-contained account of these principles that corrects the typographical errors that have unfortunately accrued in the relevant liter- ature.

TheQuantitativeMethods forPsychology

1

(2)

Function Approximation with Orthonormal Polynomials The basic principle of NOHARM is the approximation of theitem characteristic curve(ICC)—the function giving the probability of the correct response as a function of the com- mon factor(s)θ—with a polynomial that is convenient for computational purposes. It turns out that a linear combi- nation of orthogonal polynomials serves this purpose; for an elementary account of such polynomials, including reg- ularity conditions that we omit, the reader may consult the short monograph by Jackson (1941).

The sequence (Pk)k=0ofkth-degreeorthonormal poly- nomials, with respect to the interval (a,b) and the density functiong(x), is defined by

Zb a

Pm(x)Pn(x)g(x)d x=δm,n, (1) whereδm,nis the Kronecker delta.

The polynomialsPk(x) can be used for the formal ex- pansion of an arbitrary function

f(x)= X k=0

ckPk(x). (2)

Multiplication of the expansion byg(x)Pk(x) and integra- tion fromatobgives

ck= Z b

a

f(x)Pk(x)g(x)d x, (3) and in this way each coefficient can be obtained.

We now show that the best polynomial approximation of an arbitrary function (“best” in the sense of least squares) is the one given by Equation (2). Define the partial sum sn(x) as the termination of (2) afternterms,Pn

k=0ckPk(x).

Also define the remainderrn(x)=f(x)−sn(x). As a conse- quence of the definitions,

Z b

a

sn(x)Pk(x)g(x)d x=ck, Zb

a

rn(x)Pk(x)g(x)d x=0, k=0, 1, . . . ,n. (4) Letπn(x) be an arbitrary polynomial of thenth degree at most. Then define

πn(x)−sn(x)=δn(x)=

n

X

k=0

dkPk(x), so that

f(x)−πn(x)=rn(x)−δn(x).

The integral of the squared error inπn(x) as an approxima- tion off(x), weighted from point to point by the factorg(x),

is Zb

a

g(x)£

f(x)−πn(x)¤2

d x

= Z b

a

g(x) [rn(x)−δn(x)]2d x

= Z b

a

g(x) [rn(x)]2d x−2 Z b

a

g(x)rn(x)δn(x)d x +

Zb a

g(x) [δn(x)]2d x.

But by virtue of Equation (4) and the expression ofδn(x) in term of thePk,

Zb a

g(x)rn(x)δn(x)d x=0, Zb

a

g(x) [δn(x)]2d x=

n

X

k=0

dk2. As a result,

Z b a

g(x)£

f(x)−πn(x)¤2

d x≥ Z b

a

g(x) [rn(x)]2d x.

Evidently, regardless of whethersn(x) ultimately converges tof(x), no other polynomial of degreenor lower produces a smaller weighted integral of squared errors. The rele- vant implication for our purposes is that the appropriate sequence of orthonormal polynomials constitutes the op- timal basis for approximation of the ICC.

The Hermite Polynomials

The interval (a,b) and density function g(x) determine the polynomials that satisfy the orthonormality condition given by Equation (1). In the case where the interval is the real line and the density function is associated with the standard normal distribution, the polynomials are known as theHermite polynomials, which can be defined as

Pk(x)=(−1)kp 2π pk! exp¡

x2/2¢ dk

d xkφ(x), (5) whereφ(x) is the standard normal density. The first ten Hermite polynomials are

P0(x)=1, P1(x)=x, P2(x)=¡

x2−1¢ /p

2!, P3(x)=¡

x3−3x¢ /p

3!, P4(x)=¡

x4−6x2+3¢ /p

4!, P5(x)=¡

x5−10x3+15x¢ /p

5!, P6(x)=¡

x6−15x4+45x2−15¢ /p

6!, P7(x)=¡

x7−21x5+105x3−105x¢ /p

7!, P8(x)=¡

x8−28x6+210x4−420x2+105¢ /p

8!, P9(x)=¡

x9−36x7+378x5−1260x3+945x¢ /p

9!.

TheQuantitativeMethods forPsychology

2

(3)

Figure 1 The densities of 1,000 random draws from the bivariate normal distribution with zero means, unit variances, and correlation equal to 0.70 were calculated with both the true density function and the ninth order of Mehler’s expansion.

0.00 0.05 0.10 0.15 0.20

0.000.050.100.150.20

bivariate normal density

Mehler's expansion

Other accounts may omit the normalizing constants 1/p k!

or give the polynomials in a slightly different form follow- ing from the non-normalized density function exp¡

x2¢ . Many texts provide an elementary proof that the Her- mite polynomials satisfy the orthonormality property with respect to the standard normal density over the real line (e.g., Jackson,1941). In any event a particular specific case is readily verified upon recognizing that upon integration Equation (1) becomes a linear combination of normal mo- ments. Here we will set out a generalization of (1) that bears specifically on parameter estimation with NOHARM and the assessment of model fit: ifxandyfollow the bivariate normal distribution with zero means, unit variances, and correlationρ, then

Ï

Pm(x)Pn(y)φ(x,y)d x d y=δm,nρm. (6) whereφ(x,y) is the joint density. This result is a conse- quence of a remarkable identity known asMehler’s expan- sion,

φ(x,y)=φ(x)φ(y) X k=0

ρkPk(x)Pk(y). (7)

For small correlations Mehler’s expansion is highly accu- rate even when truncated at a few terms. Figure1shows that even a density associated with a correlation as high as 0.70 can be approximated quite closely by taking the ex- pansion to ninth order.

We close this section by proving Equation (6). Multi- ply both sides of Equation (7) byPm(x)Pn(y) to bring the left-hand side into conformity with the integrand of (6). In- tegrating both sides overx eliminates all terms from the right-hand side except for

Pn(y)φ(y)ρmPm(y), which when integrated overyleads to (6).

The Expansion of the Normal-Ogive Item Response Func- tion

Recall that the normal-ogive ICC is given by ϕj(θ)=E(Yj|θ)=

Z αjjθ

−∞

p1 2πexp

µ

−1 2t2

d t. (8) When the upper limit of the integrand is expressed in this way—with an intercept and slope parameter—we call Equation (8) the IRT parameterization of the ICC. The

TheQuantitativeMethods forPsychology

3

(4)

factor-analyticparameterization can be obtained by apply- ing the transformation

τj= −αj

q1+β2j, λj= βj

q1+β2j, (9)

whereupon the upper limit of integration in Equation (8) becomes

λjθ−τj

q1−λ2j. (10)

The IRT parameterization derives its name from its sim- ilarity to the well-known formulation of Lord and Novick (1968). The factor-analytic parameterization, on the other hand, has the following interpretation. Suppose that the jth item is associated with an underlying quantitative re- sponse tendencyYjand a thresholdτj such that the ob- served item score

yj=

(1 ifyj >τj, 0 ifyjτj.

Now suppose thaty∈Rp, the vector of response tenden- cies, fits the factor model

y=Λθ+², (11) whereΛ∈Rp×q is a matrix of factor loadings. Then the threshold and factor loading of itemj in this model are in fact given by Equation (9).

We wish to find the expansion of the normal-ogive ICC in terms of the Hermite polynomials. For this purpose it is convenient to use the transformation

αj=−µj

σj

, βj= 1

σj

. (12)

We now drop the item subscript j until it is needed again.

Case ofc0

To obtain the first coefficient in the expansion, we must evaluate the integral

c0= Z

ϕ(x)P0(x)φ(x)d x

= Z

−∞

p1 2π

Z (x−µ)/σ

−∞

exp µ

z2 2

¶ 1 p2πexp

µ

x2 2

d z d x.

Recall that the change-of-variables formula, Z

g()

f(x)dx= Z

f(g(u))¯

¯det(Dg)u¯

¯du,

often simplifies the evaluation of an integral. In this in- stance the appropriate change of variables is

(u,v)=

µ σx+z

p1+σ2,−xz p1+σ2

¶ , which has the inverse

(x,z)=

µ σuv

p1+σ2, uv p1+σ2

¶ .

Note that the Jacobian matrix of this transformation,

∂u

∂x

∂u

∂z

∂v

∂x

∂v

∂z

=

 p σ

1+σ2 p 1

1+σ2

−1 p1+σ2

p σ 1+σ2

 ,

has a determinant equal to unity.

In our case, the transformation converts the line z = (x−µ)/σinto the linev= −µ/p

1+σ2, and we have

c0= Z

u=−∞

p1 2π

Z −µ/p 1+σ2 v=−∞

exp

·

−1 2

(u+σv)2 1+σ2

¸ 1 p2πexp

·

−1 2

(σuv)2 1+σ2

¸ d u d v

= Z

u=−∞

Z −µ/p 12 v=−∞

p1 2πexp

µ

u2 2

d u 1

p2πexp µ

v2 2

d v

= 1 p2π

Z −µ/p 1+σ2 v=−∞

exp µ

v2 2

d v

=Φ µ

µ p1+σ2

¶ ,

TheQuantitativeMethods forPsychology

4

(5)

where we now useΦto denote the cumulative probability function of the standard normal distribution. Using Equa- tions (9) and (12), we see that

c0=Φ(−τ) . (13)

Case ofc1

Now we must evaluate the integral c1=

Z

ϕ(x)P1(x)φ(x)d x

= Z

−∞

p1 2π

Z(x−µ)/σ

−∞

exp µ

z2 2

¶ 1 p2πxexp

µ

x2 2

d z d x.

Employing the same change of variables used to obtainc0, we get

c1= Z

u=−∞

Z −µ/p 12 v=−∞

σuv p1+σ2

p1 2πexp

µ

u2 2

d u 1

p2πexp µ

v2 2

d v

= Z

u=−∞

Z −µ/p 1+σ2

v=−∞v

p1+σ2 p1

2πexp µ

u2 2

d u 1

p2πexp µ

v2 2

d v

= 1 p1+σ2

Z−µ/p 12

v=−∞ − 1

p2πvexp µ

v2 2

d v.

Since the integrand has been reduced to the derivative of the standard normal density function, we can now write

c1= 1 p1+σ2φ

µ

µ p1+σ2

¶ ,

whereφrepresents the standard normal density function.

By Equations (9) and (12), this can be rewritten as

c1=λφ(τ). (14)

Case ofc2 We have c2=

Z

ϕ(x)P2(x)φ(x)d x

= Z

−∞

p1 2π

Z(x−µ)/σ

−∞

exp µ

z2 2

¶ 1 p2π

x2−1 p2 exp

µ

x2 2

d z d x. The same change of variables yields

c2= 1 p2¡

1+σ2¢

× Z

u=−∞

Z−µ/p 1+σ2 v=−∞

£(σuv)2−1−σ2¤

φ(u)d uφ(v)d v.

With the help ofMathematica, we can evaluate the inte- gral to get

c2= 1 p2¡

1+σ2¢ p µ

1+σ2φ µ

µ p1+σ2

= 1

p2λ2τφ(τ).

(15)

Case ofc3

Because the fit to the sample covariance matrix often can- not be improved more than negligibly by taking the expan- sion of each ICC beyond the third power, this is the last co- efficient used by the NOHARM program to compute its pa- rameter estimates. We have

c3= Z

ϕ(x)P2(x)φ(x)d x

= Z

−∞

p1 2π

Z (x−µ)/σ

−∞

exp µ

z2 2

¶ 1 p2π

x3−3x p6 exp

µ

x2 2

d z d x. With our change of variables and the help of Mathematica, we get

c3= 1 p6

1

¡1+σ2¢3/2

µ µ2 1+σ2−1

φ

µ

µ p1+σ2

= 1

p3λ3τ2−1 p2 φ(τ).

(16)

The General Expression forck,k≥1

Upon studying Equations (14), (15), and (16), we might conjecture that the general expression is

ck= 1

pkPk−1(τ)φ(τ), k≥1. (17) McDonald (1967) gave a proof of Equation (17) using Fourier transforms.

TheQuantitativeMethods forPsychology

5

(6)

Figure 2 Increasingly higher-order expansions in Hermite polynomials of a normal-ogive item characteristic curve cor- responding toτ= −0.4,λ=0.8, andc=0.2.

-3 -2 -1 0 1 2 3

0.00.20.40.60.81.0

common factor (θ)

probability of correct response

linear third order fifth order seventh order ninth order true ICC

Figure2shows the quality of the ICC approximation for odd values ofkup to 9. As one should expect in the case of a easy item, the linear approximation fails at high levels ofθ. The NOHARM program takes the expansion in Her- mite polynomials to third order, which still does not closely match the ICC at the extremes of theθdistribution. Nev- ertheless third order seems sufficient for accurate parame- ter estimation in the case of a pseudo-guessing parameter equal to zero.

Generalization to Multiple Dimensions

We now treat multiple common factors (latent traits) col- lected in the vectorθ ∈Rq. In the IRT parameterization of the ICC, the discrimination parameters of a given item are collected in the vectorβ∈Rq. It is convenient to find a transformation ofθsuch that the multidimensional equiv- alent of Equation (8)—where the upper limit of integration becomesα+β0θ—is in fact a function of only one element

(say the first). Define

T=1 δ

β0 β02 ... β0q

 ,

where

δ=β0β

and the rows ofTafter the first are chosen so that the matrix is orthonormal. Now define

θ=δ

dTθ, (18)

where

d2=β0Ψβ, Ψ=Cov(θ).

In particular, the first element ofθis θ=1

dβ0θ. The inverse of Equation (18) is

θ=d δT0θ,

TheQuantitativeMethods forPsychology

6

(7)

which implies that

Φ(α0θ)=Φ µ

α+d δβ0T0θ

¶ . Since, by construction,

β0T0

δ 0 · · · 0¤ , we thus have

Φ(α0θ)=Φ¡

α+d×θ¢ .

This is a function of the single variableθand thus has the same form as Equation (8). We can therefore approximate the ICC with

ϕ(θ)≈c0+X

k=1

ckPk à β0θ

pβ0Ψβ

! ,

replacingβ1in our previous expressions withd. Consulting Equation (9), we see that the replacement leads to

c0

à α

p1+β0Ψβ

! ,

c1=

s β0Ψβ 1+β0Ψβ×φ

à α

p1+β0Ψβ

! , and so forth.

The Expected Covariance Between Two Items We can write the response to itemjas

Yj=cj+(1−cj)h cj0P0³

θj´

+cj1P1³ θj´

+cj2P2³ θj´

+ · · · i

where we have introduced the pseudo-guessing parameter with the symbolcj.

The response can then be rewritten as Yj=bj0P0³

θj´

+bj1P1³ θj´

+bj2P2³ θj´

+ · · ·, (19) where

bj0=cj+(1−cj)cj0, bj1=(1−cj)cj1, bj2=(1−cj)cj2, and so on.

We collect the linear combinationsθj into a single vec- torθ?∈Rpand assume that the transformationθθ?is one-to-one. Now consider the two itemsjandj0. We have

E(YjYj0)=Eθ?£

E(YjYj0|θ?

=Eθ?£

E(Yj|θ?)E(Yj0|θ?

=Eθ?n h bj0P0

³θj´ +bj1P1

³θj´ + · · ·i

× h

bj00P0³ θj0

´

+bj01P1³ θj0

´ + · · ·

io .

The argument in brackets resolves into terms of two types:

products of the form

bj kbj0kPk³ θj´

Pk³ θj0

´

and cross-products where the indices of the Hermite poly- nomials do not agree. The multivariate normality ofθ is a sufficient condition for the use of Equation (6), which upon integration leads the cross-products to vanish and the products to become

bj kbj0k

h Corr³

θj,θj0

´ik

,

and since the variance of eachθj remains unity if all ele- ments ofθare standardized, we then have

E(YjYj0)=X

k=0

bj kbj0k

Ãβ0jΨβj0

djdj0

!k

, (20)

from which the covariance is obtained by subtracting the product of the marginal pass rates. With estimates of the item parameters in hand, however obtained, one can use Equation (20) to compute the residual covariance matrix as part of assessing model-data fit.

Equation (20) is in fact used by NOHARM to obtain esti- mates of the item parameters. Eachτjis estimated in close form by equating itemj’s pass rate toΦ(−τ), and the item parameters can in principle be estimated by choosing the βbj andcbj to minimize

p

X

j=1

X

j>j0

£yjyj0−En

¡YjYj0¢¤2

,

whereEn

¡YjYj0¢

is the truncation of (20) atnth order. In the current implementation of NOHARM,nis set equal to 3. Thecj must be fixed by the user because the triplets (αj,βj,cj) are nearly unidentified when this approach is employed, but Figure2suggests that taking the expansion to higher order might overcome this difficulty.

Authors’ note

Correspondence concerning this article should be ad- dressed to James J. Lee, Department of Psychology, Uni- versity of Minnesota Twin Cities, Minneapolis, MN 55455.

Email:[email protected]

TheQuantitativeMethods forPsychology

7

(8)

References

Bolt, D. M. (2005). Limited- and full-information estima- tion of item response models. In A. Maydeu-Olivares

& J. J. McArdle (Eds.),Contemporary psychometrics: A festschrift for Roderick P. McDonald(pp. 27–71). Mah- wah, NJ: Erlbaum.

Cai, L., Du Toit, S. H. C., & Thissen, D. (2011).IRTPRO: Flex- ible, multidimensional, multiple categorial IRT mod- eling. Chicago, IL: Scientific Software International.

Chalmers, R. P. (2012).mirt: A multidimensional item re- ponse theory package for theRenvironment.Journal of Statistical Software,48, 1–29.

Flanagan, J. C., Dailey, J. T., Shaycroft, M. F., Gorham, W. A., Orr, D. B., & Goldberg, I. (1962).Design for a study of American youth. Boston, MA: Houghton Mifflin.

Fraser, C. & McDonald, R. P. (1988). NOHARM: Least squares item factor analysis. Multivariate Be- havioral Research, 23, 267–269. doi:10 . 1207 / s15327906mbr2302_9

Jackson, D. (1941).Fourier series and orthogonal polynomi- als. Oberlin, OH: Mathematical Association of Amer- ica.

Lord, F. M. & Novick, M. R. (1968).Statistical theories of mental test scores. Reading, MA: Addison-Wesley.

Maydeu-Olivares, A. (2001). Multidimensional item re- sponse theory modeling of binary data: Large sam- ple properties of NOHARM estimates.Journal of Ed-

ucational and Behavioral Statistics,26, 51–71. doi:10.

3102/10769986026001051

McDonald, R. P. (1967). Nonlinear factor analysis. Rich- mond, VA: Psychometric Corporation.

McDonald, R. P. (1982). Some alternative approaches to the improvement of measurement in education and psychology: Latent trait models. In D. Spearritt (Ed.), The improvement of measurement in education and psychology(pp. 213–237). Hawthorn, Australia: Aus- tralian Council for Educational Research.

McDonald, R. P. (1985). Unidimensional and multidimen- sional models for item response theory. In D. J. Weiss (Ed.), Proceedings of the 1982 Item Response The- ory and Computerized Adaptive Testing Conference (pp. 127–148). Minneapolis, MN: University of Min- nesota Press.

McDonald, R. P. (1997). Normal-ogive multidimensional model. In W. J. van der Linden & R. K. Hamble- ton (Eds.),Handbook of modern item response theory (pp. 257–269). New York, NY: Springer. doi:10 . 1007 / 978-1-4757-2691-6_14

McDonald, R. P. (1999).Test theory: A unified treatment.

Mahwah, NJ: Erlbaum.

McDonald, R. P. & Fraser, C. (2003).NOHARM: A Windows program for fitting both unidimensional and multi- dimensional normal ogive models of latent trait the- ory. Niagara College. Welland, Ontario. Retrieved from http://noharm.niagararesearch.ca/nh4man/nhman.

html

Citation

Lee, J. J., Lee, M. K. (2016) An overview of the Normal Ogive Harmonic Analysis Robust Method (NOHARM) approach to item response theory.The Quantitative Methods for Psychology,12(1), 1-8.

Copyright© 2016 Lee, & Lee. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

Received: 07/03/2015∼Accepted: 05/08/2015

TheQuantitativeMethods forPsychology

8

Références

Documents relatifs

(a) From top to bottom, amplitude spectrum of the SG data, amplitude spectrum of the reconstructed signal and amplitude spectrum of the residuals (original signal minus

Bordas would like to acknowledge partial support for his time from European Research Council Starting Independent Research Grant (ERC Stg grant agreement No. 279578)

The HFCO system has been described by the global po- tential energy surface of Yamamoto and Kato. 26 The formu- lation, as presented above, is basically designed to be used within

Une précision de cette formule a été faite par le même auteur en 1974 où la sortie est constatée dès que les inflorescences sont dégagées du bourgeonnement, exprimée par

Any algebraic variety endowed with a pair of commut- ing rational parallelisms of type g is isogenous to an algebraic group endowed with its two canonical parallelisms of left and

The decomposition of the displacements in thin structures has been introduced in [13] and in [14] and then used in [3], [4] and [5] for the homogenization of the junction of rods and

[6] Rare-earth attack is controlled by the chemical state of the rare-earth fission products, which is determined by the oxygen partial pressure in the fuel.. The second

The same conditions as previous model were considered in order to determine the lateral load carrying capacity of the hemp concrete subjected to lateral loads and also