Digitized
by
the
Internet
Archive
in
2011
with
funding
from
Boston
Library
Consortium
Member
Libraries
I.(>
HB31
.M415
iworking
paper
department
of
economics
CONDITIONAL
INDEPENDENCE
INSAMPLE
SELECTION
MODELS
JoshuaD. Angrist 96-27 October,1996
massachusetts
institute
of
technology
50 memorial
drive
Cambridge, mass.
02139
CONDITIONAL
INDEPENDENCE
INSAMPLE
SELECTION
MODELS
JoshuaD. Angrist
Conditional Independence in
Sample
SelectionModels
by,
Joshua D. Angrist
MIT,
Hebrew
University, andNBER*
October 1996
'Thanks go to
Guido
Imbens,Whitney
Newey,
PaulRosenbaum,
Tom
Stoker, and especially toGary
Chamberlain for helpful discussions andcomments.
Any
errors are, ofcourse, solely thework
Conditional Independence in
Sample
SelectionModels
Econometricsampleselection modelstypicallyuse a linearlatent-indexwith constantcoefficients
to
model
the selection process and the conditionalmean
ofthe regression error in the selected sample.A
featurecommon
tomost
ofthese models is that the conditionalmean
function forregression errors isan invertible function ofthe selection propensity score, i.e., the probability ofselection conditional on
covariates. Consequently, conditioning on the selection propensity score controls selection bias, a fact
which
underliesmuch
ofthe recent literature on non-parametric and semi-parametric selection models. This literature has not addressed the question ofwhether the propensity-score conditioning property isnecessarily a feature ofsample selection models. In this note, I describe the conditional independence
properties that
make
it possibletouse the selection propensity scoreto control selection bias in a generalsample selection model.
The
resulting characterization does not rely on a latent index selectionmechanism
and imposes no structure other than an assumption ofindependence between the regressionerror term andthe regressors in
random
samples. This approach leads to a simplerule that can be usedto determine ifconditioning on the selection propensity score is sufficient to control selection bias.
Joshua D. Angrist
MIT
Department
ofEconomics
50Memorial
DriveCambridge,
MA
02139-4307
ANGRIST@MIT.EDU
1. Introduction
Sample
selectionproblems occurinboth experimentalandobservationalstudies.Econometricians'interest in sample selection
models
originated with research in laboreconomics
using the distribution ofwages
forworkerstoinferthedistributionofofferedwages
foreveryone (Gronau, 1974;Heckman,
1974).A
similar problem occurswhen
instrumental variables that are being used to correct for endogeneity inwage
equations also affectemployment
status (see, e.g., Angrist, 1995). Inexperiments, the bias induced by controlling for "post-treatmentvariables" that are correlatedwith theoutcome
ofinterest can also beinterpreted as a form ofselection bias
(Ham
and Lalonde, 1996;Rosenbaum,
1984).The
traditional econometric solution to problems ofthis type invokes an assumption ofjointnormalityfortheregressionerrorandthe errorterminalatent-index selection
mechanism.
This approachleads to a
maximum
likelihood ortwo-step selection correction in the form ofinverse Mills-ratio termsadded tothe regression function ofinterest(see
Heckman,
1976and 1979).A
number
ofless restrictivesemi-parametricvariations on this normal selection
model
have also beenintroduced (see, e.g.,Ahn
andPowell, 1993, and references in
Newey,
Powell, and Walker, 1990 andHeckman,
1990).Both
theparametricand semi-parametricformulations have
two
common
threads. First, a latent indexframework
is used to characterize the
mean
of unobserved regression error terms conditional on the regression covariates and the sample selection rule. Second, the problem ofselection bias is either implicitly orexplicitly handled by conditioning on an invertible function ofthe selection propensity score, i.e., the
probability ofselection given covariates.1
This paper describes the conditional independence properties that
make
it possible to use theselection propensity score to control selection bias in regression estimates.
The
problem studied hereincludes applications involving experimental data and instrumental variables as special cases.
My
firstobjectiveis anonparametricformulationofthesampleselection problem
where
theonlystructureconsists'The propensity-score terminology originates with
Rosenbaum
andRubin
(1983, 1984),who
introduced the propensity-scoremethod
ofcontrolling for confounding in evaluation research.of the regression equation of interest and an independence restriction on the regression error.
The
elementary properties of conditional independence, as discussed by
Dawid
(1979) and Florens andMouchart
(1982), are then applied to this formulation. I believe this approach provides a very general description ofthe identificationproblem in sample selectionmodels. Italso leads toan easily interpretedrule thatcanbeusedtocheck
when
nonparametricconditioningonthe selectionpropensity score isindeedsufficient to control selection bias.
2. Motivation
The
equation ofinterest isy,
=
X,'P+
e„ (1)where
y, is the dependent variableand X, isthe k x 1 vectorofregressors,assumed
to be independent ofe, in the population.
We
observe arandom
sample ofn observations onX„
but y, is not observed foreverybody.
The
coefficient vector isassumed
to be such thatX/P
gives the population conditionalexpectation function, so that if
we
did observe y, for everyone in the sample, P could be consistentlyestimated by ordinary least squares (OLS). Although independence of regressors and errors is not necessary to justify this definition of P, the full independence assumption is customary in papers on
selection bias (e.g., Chamberlain, 1986).
At
the end ofthe paper Ishow
that full independence appearsto be offundamental importance for identification strategies that use the selection propensity score to
control selection bias.
Let w, indicate selection status (i.e.,
w
=
l indicates workers inan applicationwhere
y- is the logofofferedwages).
The
selection problem is oftenmotivatedusing alatentindextocapturethe possibilitythat X, affects w, as well as yj. In particular, suppose that
w,= irX/6-T^X)]
(2)we
haveE[e,|
X„
w,=l]=
E[e,|X„
X,'5>
tjj*
0.Selection bias arises because the term E[e,I X,,
w=l]
is generally a function ofX, even thoughE[e,| X,]
=
inrandom
samples.Parametricsolutions to selection problems ofthistype typically beginby
assuming
thatthe errorterms (£„ ti,) are jointly normally distributed, so that
E[e,|
X„w
=
l]=
pE[v,|x„X,'8>TU
(3)where
p is the coefficient from a regression ofe, onv
;.
Because
ofnormality andjoint independence of(Ej, T|,) with X,,
we
can simplify further:E[tliIX,'S
>
TiJ=
-^(X,'5)/a)(X,'5)=
XCXi'8),where
<j) and are the normal density and distribution functions forx\{
.
Adding A^X/8)
as a regressorto (1)provides afeasiblesolutiontothe selection
problem
inthiscontextbecausetherequired parameterscan be estimated from a Probit regression of
w
(on
X
(.
Critics oftraditional econometricselection
models
(e.g., Hartigan and Tukey, 1986; Little, 1985)point outthatthese
models
combine
awide
variety ofassumptions that are hard to assess and interpret.In particular,the validity ofparametric corrections forsample selection bias clearlyturns toalargeextent
ondistributionalassumptions, functional formrestrictions,andexclusionrestrictions (i.e.,zerorestrictions
on (3 and/or8) for
which
thereisrarely, ifever,anyempiricaljustification.2These
features ofthe normalselection
model
have ledeconometricians to develop less restrictive models forselection problems. Forexample,
Newey,
Powell, andWalker
(1990),Lee
(1982), and others haveshown
that the distributionalassumptions in these
model
can be relaxed in a semi-parametric framework.One
approach to semi-parametric selection corrections begins with the observation that even in"Mroz
(1987)documents
the fact that empirical results can be sensitive to these assumptions. Seethe absence ofparametricdistributional assumptions, selection-correctionterms typically consistof
non-linear but invertible functions ofthe selection propensity score, P[W;=1 | ZJ. This feature ofselection
models
has beennotedbyHeckman
andRobb
(1986), anditmotivatesthesemi-parametric estimators forselectionmodelsdiscussedby
Newey
(1988),Robinson
(1988),Choi
(1992),andAhn
and Powell(1993).3On
the other hand, earlierwork
on semi-parametric selectionmodels
has not clarified whether thepropensity-scoreconditioning property is anecessary featureofthe selectionproblem. Inthenextsection,
I use simple conditional independence notationto provide a general characterizationofmodels for
which
this propensity-score conditioning property holds.3. Conditional independence and sample selection
Conditional independence is defined as follows:
Definition 1 (Florens and Mouchart, 1982). Let A,,
A
2,A
3 denoterandom
variables defined on acommon
probability space withjoint probability measure P[A,,A
2,A
3], and let P[A, |A
2,A
3 ] denote aconditional probability statement for A, given
some
values ofA
2 andA
3.Then
we
write A,JJ
A
21A
3 ifand only ifP[A, |
A
2,
A
3]
=
P[A, |A
3], a.s.
Note
that conditional independence, like independence, is transitive.The
conditional independence concept, although basic andperhaps evenobvious, has proven widely useful.Dawid
(1979) popularizedthe notation, A, JJ
A
21A
3 , and argued that
many
important ideas in statistics can best be understood intheseterms. Chamberlain (1982) and Florens and
Mouchart
(1982) used conditional independence ideasto explorethe relationship between alternate definitions of"causality" in time-series econometrics.
And
Educational researchers and statisticians have also discussed conditioning on the selection
Rosenbaum
andRubin
(1983) usedconditional independence argumentsto establishthepropensity-scoreresult for evaluation problems with selection on observables.4
The
properties ofconditional independence can be characterized as follows:Lemma
1. Let R,, R,,R
3, and
R
4 be
random
variables defined on acommon
probability space withjointprobability measure.
Then
the following are equivalent:(i) R,
U
R
2 IR
3 and R, II R-4 I (R
2> R3) (ii) R,n
(R*
RJ
I R3 (iii) R, II R-4 IR
3 and R, IIR
2 I (R 3,R
4 )This is
Theorem
A.l in Florens andMouchart
(1982), presented here insomewhat
simpler notation, andLemma
4.3 inDawid
(1979).Dawid
states the equivalenceof(ii) and(iii) only, butthis also impliestheequivalence of(i) and (iii) by using
symmetry
ofR
2 andR
4 in part (ii).3.1 Selection bias
The
problem of selection bias can be described in conditional independence terms with nostructure other than the regression ofinterest and the independence restriction on £,. This restriction is
stated formally below:
Assumption 1. Let {(y„ X,')'; i=l, . . ., n} denote i.i.d. observations on a (k+l)xl
random
vector.Then
e,U
X,where
8,=
(y,-X,'(3) andE&J
X,]=X,'p\4
The
propensity-score result for evaluation research says that in an experimentwhere
treatment israndomly assigned conditional on covariates. it is
enough
to condition on the probability ofselectiongiven covariates to eliminate confounding from any relationship between the covariates and the
We
would
first like toknow when
asample selection process causesAssumption
1 to fail. Letw, indicate selection status as before. Instead ofobserving {(y„ X,')'; i=l, . . ., n},
we
observe{(w,y„ Wj, X/)'; i=l, . . ., n}. Important consequences ofthe selection process are
summarized
below:Proposition 1.
Suppose
that X, has at least 2 points of support and that P[w,=l | X,] is a non-trivial function ofX,.Then
givenAssumption
1, e, JJ X, | w, iff e, JJ w, | X,.Proof. Let Q. denotethe sample space. Setting R,=e„
R
2=
X;,R
3=£2,and
R
4=w„
parts (i)and (iii) ofthelemma
imply {£, JJ w, |X„
£, JJ X,} iff {£, JJ X, |w„
e,U
w
,}- Since e, JJ X, is maintained, e, JJw, | X, implies E, JJ X, I w, and e, JJ w, , while e, JJ w, | X, implies either e, JJ X, | w, or e,
JJ w, or both.
To
complete the proposition,we
need to rule out e, JJ w, with e, JJ X, | w,when
e,JJ w, | X,.
Then
£, JJ w, | X, always implies s, JJ X, | w,.Note
that=
E[e, |XJ
=
E[E(e, IX„
w,)IXJ
If e, JJ w, with e, JJ X, | w,, this implies
=
E[E(e,| w,)|XJ
=
E(e,|w=0)
+
[E(e, I w,=l)-E(e| | w,=0)]P[w,=l | XJ.But this cannot be true
when
P[w =
l |XJ
takes on 2 ormore
distinct values since E(e, Iw
t)
is not zero
and E(e,I
w
=
l) is not equal to £(£, |w^O).
The
property £, JJ X, | w,means
that estimatorswhich
are based onAssumption
1 will beconsistent or unbiased in theselected sample as well as in a
random
sample.The
property w, JJ£, I X,means
that selection status is independent ofthe regression errorconditional on X,. Thus, any selectionmechanism
thatis related tothe dependentvariableconditional on the regressors causesAssumption
1 tofail in the selected sample ifselection status is also a function ofX,. Conversely, if
Assumption
1 failsinthe selectedsample,selection status
must
berelatedtothedependentvariable.Of
course, thefirstpointthat Ej and the latent-index errorri; are uncorrelated).
The
propositionshows
that this and the converse are general features ofsample selection problems. In particular, these properties hold regardless oftheunderlying selection
mechanism
or model.Proposition 1 applies equally to experimental data. For illustration,
assume
constant treatmenteffects
where
y^y.o+P
and yi0=u+e, denote the counterfactualoutcomes
for treated and nontreatedindividuals. Let X,be anindicatorof
randomized
treatmentassignment.By
virtueofrandom
assignment,asimplecomparison of
means
provides anunbiasedestimatorof(3inrandom
samplesfrom
thepopulationatrisk oftreatment. Butthe independence
between randomized
treatment assignmentand counterfactualoutcomes
is lostwhenever
theexperimentis analyzedconditionalon a variablethatisbothcorrelatedwithoutcomes
and affected by thetreatmentassignment.Suppose
thatw
;
is asecond
outcome
variable intheexperiment, correlated for withyj for
some
reason other thanX,.Examples
of such correlatedoutcome
pairs include death and health status, or
employment
status and wages. Correlationbetween
w
; and e,
given X,
means
that e,^
w
; I Xj. If, as
seems
likely, v/{
is also affected by
X„
then Proposition 1implies that estimation of (3 conditional on
w
s
is necessarily subject to selection bias.5
Finally, note that Proposition 1 is also relevant for
IV
estimation. For applications involvinginstrumental variables, the key identifying assumption isthatthe instruments (playing the role ofX,) are
independentoftheregressionerrorterm(definedaround an
endogenous
regressor)inthe targetpopulation.The
implications ofProposition 1 forIV
estimation are that in caseswhere
the sample selection ruleinvolves the instruments, the instruments will be independent of£, in the selected sample ifand only if
selection status is independent ofe, conditional on the instruments.
5
Rosenbaum
(1984)makes
a similar point regarding the consequences ofconditioning on3.2
The
selection propensity scoreThe
selectionpropensityscore is the conditional probability ofselection given X,.The
factthatthe selection propensity score is a function ofX, is emphasized by the notation
e(X,)
=
P(w,| X,),which
is used in the definition below:Definition 2.
The
selection propensity score is said to control selection bias ife, JJ X,Iw„
e(X,).Ifconditioning on the selection propensity score controls selection bias, it
may
be possibleto estimate (3in the selected sample because then
we
haveE[y,I
X„
w,=l]=
X/p
+
E[e,| w,=l, e(X,)]. (4)Of
course, forthis tobe ofanypractical use, thatmust
beenough
variability in X,to identify (3once e(X,)is fixed. In the literature on semi-parametric selection models, this variation is obtained by imposing a
priori exclusion restrictionson (3 and/oron e(X,). SeeChamberlain (1986) foraprecise statement ofthe
restrictions required for identification in this context. Here I focus on Definition 2 as a likely starting
point for any possible strategy to identify p.
The
relationshipbetweenDefinition2andothertheconditional independencepropertiesofsampleselection models is outlined in the following proposition:
Proposition 2.
The
following are equivalent:(i) w,
U
X,| e„ e(X,)(ii) (e„ w,) II X,| e(X,)
(iii) e,
U
X,|w„
e(X.)Proof. First observe that w,
U
X,| e(X,) becauseP[w =
l |X„
e(X,)]=
e(X,) and that £, TJ X,| e(X,)because e, ]J
X
s and e(X,)
is a function ofX,. Moreover,
lemma
1 implies:{(i), e,
H
X,| e(X,)}»
(ii) <=> {(iii), w,U
X,| e(X,)}. (6) Therefore, since w,U
X,| e(X,) and e:
U
X,I e(X,) hold automatically in this case, the result is
established.6
Line (iii) ofProposition2 isthe
same
as Definition2, i.e., this part ofthe propositionconsists ofthe statementthatconditioningonthe selection propensity score controls selectionbias. Lines (i)and(ii)
oftheproposition therefore provideequivalentnecessaryand sufficientconditionsforDefinition2.
Note
that while the statements w, JJ
X
s| e(X,) and e
; JJ
X,| e(X,) automatically hold, line (ii)
makes
thestronger statement that (e
;,
w
;) are jointly independent of X, given e(X,).Of
course, this jointindependence is notimpliedby marginal independence. Therefore,one consequence ofthe if-and-only-if
relationship
between
(ii)and(iii) isthatconditioningonthe selectionpropensity scoredoesnot necessarilycontrol selection bias.
The
equivalence oflines (i) and (iii) also has practical consequences because line (i) is easy toverify in the contextofspecific
models
for themechanism
determining selection status. In particular, aweak
sufficient condition for(i) is given in Proposition 3:Proposition 3.
Suppose
the selectionmechanism
is monotonic, i.e., for anytwo
values x', x° in thesupport of
X„
eitherP[w =
l | x 1, £,]
> P[w=l
| x°, £,], a.s. orP[w,=l j x', e,]
<
P[w,=l | x°, e,], a.s., (7)where
conditioningon e, iswhat
makes
these inequalitiesrandom.Then
conditioningon the probabilityofselection controls selection bias.
6
Choi (1992)
shows
that part (ii) ofProposition 2 [joint independence],imposed
on a latent-indexProof.
Note
that ife(x')=e(x°),=
J {P[w,=l | x 1, e(x'), e,]-P[w,=l | x°, e(x'), e,]}h(e,)de,
Given
amonotonicselectionmechanism,
thequantity inbrackets isalways non-negative. Forthe integraltoequal zero,
we
must
therefore haveP[w
=
l | x', e(x'),£,]=P[w=
l | x°, e(x'), £,]. This argumentshows
that for a monotonic selection
mechanism,
P[w
=
l | x1
, £,]=P[w,=l | x°, e,]
whenever
e(x')=e(x°). (8)The
proposition can then be established by observing that (8) implies line (i) ofProposition 2, i.e., thatthe probability ofselection is conditionally independent ofX, given e(X,) and £,.
To
get an idea ofhow
general this result is,notethat any latent-index selectionmechanism
withconstant coefficients and errors independent ofX, is monotonic. This fact does not depend on other
assumptions about the error distribution. Monotonicity is satisfied by a
wide
range ofless restrictivemodels as well. Consider, for example, a latent index
model
withrandom
coefficients:w^irx/s.-Ti.x)],
which
can be re-written,w,
=
lrXi'S'-Tii'x)],where
8*=
E[5,] and r\*=
r\l
+
X,(8*-8,)is an errorterm that is not independent ofX, even ifT), is
independent ofX,.
As
long as the coefficient 8, has thesame
sign for all i, Proposition 3 implies thatconditioning on the selection propensity score controls selection bias in this model.
Finally, it is worth emphasizingthe role played by full independence in obtainingthe result that
conditioningonthe selection propensity score controlsselection bias.
Suppose
that insteadof(1)we
have a heteroscedastic regression model,y,
=
X,'P+
[X,'y]e,=
X,'(3+
£, (9)where
E, is stillassumed
independentofX
;
.
The
heteroscedasticerrorterm,£,, isonly mean-independentofX,. In this case, conditioning on the probability ofselection does not control selection bias because
E[y,|
X„w,=l] =
X
1
'(3
+
E[^|
w=l,
XJ
=
X,'(3+
[X/YMe,
| w,=l,e(X,)]. (10)Equation (10)
means
that E[^,|w
=
l, X,] is a function ofX, and notjust e(X,). In other words,SitfxJ
w„
e(X,).Even
if E[e,Iw
=
l, e(X,)] is fixed at anumber
e, estimates in the selected sample are biased andconverge to (3+ye.
The
identification failure occurs in this case in spite ofthe fact that the selectionmechanism
still satisfies part (i) ofProposition 2, i.e., w, TJ X,| ^,, e(X,).4. Conclusions
This paper lays out
some
ofthe basic conditional independence properties ofsample selectionmodels, focusing on those that justify conditioning on the selection propensity score to control selection
bias.
One
reason I think this approach to sample selectionmodels
is valuable is that it leads to a verygeneral formulationof keyidentifying assumptions. Anotheristhatitleadsto a
weak
sufficientconditionforthe validity ofidentification strategies that either explicitly or implicitly involve conditioning on the
selection propensity score. Finally, the results given here are cast in terms of potentially observable
quantities (w,,
X„
and £,), and not inherently unobservable latent index error terms. Thismeans
thatresearchers could in principle construct an experiment that
would
test the identifying assumptions.An
additional contribution ofthis papermay
be to help bridge the gapbetween
the approachestaken by econometricians' and statisticians' to theproblem ofselection bias.
Use
ofthe propensity scoretocontrol confounding inevaluation research
was
introducedbyRosenbaum
andRubin
(1983) andmany
statisticians are familiar with this idea. In a discussion of econometric models for
program
evaluation,Holland (1989, page 876) notesthat econometricians also use the propensity score, but he suggests that
this use is "quite different forthe
two
approaches." Similarly,Heckman
andRobb
(1986) arguethat theeconometricapproachtoselection
models
usesthepropensity score"inadifferentway
thanthatadvocated byRosenbaum
andRubin
[1983]." This papershows
that differences between the role played by thepropensity score in the econometrics and statistics literature
may
not beas great as previously believed. Inevaluation research, conditioning onthepropensity scorecontrols for the biascaused by anassociationbetween treatmentstatus and observedexogenous covariates, while in selection models, conditioning on
the propensity score controls forthe biascausedby an association
between
selection status and observedexogenous covariates. In both cases, conditioning on the propensity score solves a problem of
confounding that arises because of dependence between a
dummy
endogenous
regressor and othercovariates.
MIT LIBRARIES
3
9080 01428
2708
References
Ahn,
H., and J.L. Powell (1993), "Semi-parametric Estimation ofCensored SelectionModels
with a Nonparametric SelectionMechanism,"
Journal of Econometrics 58, 3-29.Angrist, J.D. (1995), "Conditioning on the Probability ofSelection to Control Selection Bias,"
NBER
Technical
Working
PaperNo.
181, June.Chamberlain, G.C. (1982),
"The
General Equivalence ofGranger
andSims
Causality,"Econometrica
50,569-581.
Chamberlain, G. C. (1986), "Asymptotic Efficiency in Semi-parametric
Models
with Censoring," Journalof
Econometrics 32, 198-218.
Choi,
Kyungsoo
(1992), "Semiparametric Estimation ofSample
SelectionModels Using
a Series Expansion and the Propensity Score," University ofChicagoDepartment
ofEconomics,mimeo.
Dawid,
A.P. (1979), "Conditional Independence in Statistical Theory," Journal ofthe Royal Statistical Society SeriesB
41, 1-31.Florens, J.P, and
M.
Mouchart
(1982),"ANote
on Noncausality," Econometrica 50, p. 583-591.Gronau, R. (1974),
"Wage
Comparisons,A
Selectivity Bias," Journal ofPoliticalEconomy
82, 1119-1144.Ham,
John, and R.J. Lalonde (1996),"The
Effect ofSample
Selection and Initial Conditions in DurationModels: Evidence
from
Experimental Data," Econometrica 64 (January), 175-205.Hartigan, John, and J.
Tukey
(1986), "Discussion 3:Comments
on
'AlternativeMethods
for Evaluating theImpact ofInterventions," by J.
Heckman
and R.Robb,
in H. Wainer, ed.,Drawing
Inferencesfrom
Self-selected Samples,
New
York: Springer-Verlag, pp. 57-62.Heckman,
J. J., (1974),"Shadow
Prices,Market
Wages,
andLabor
Supply," Econometrica 42, 679-693.Heckman,
J. J., (1976),"The
Common
Structure ofStatisticalModels
ofTruncation,Sample
selection, LimitedDependent
variables and a Simple EstimatorforSuch
Models," Annalsof Economic and
SocialMeasurement
5, 475-492.Heckman,
J.J., (1979),"Sample
Selection Bias as a Specification Error," Econometrica 47, 153-161.Heckman,
J.J., (1990), "Varieties ofSelection Bias,"American
Economic Review
80, 313-318.Heckman,
J.J., and RichardRobb
(1986), "AlternativeMethods
for Solving theProblem
ofSelection Bias inEvaluating the Impact of Treatments on Outcomes," in H. Wainer, ed.,
Drawing
Inferencesfrom
Self-selectedSamples,
New
York: Springer-Verlag.Holland, P. (1989), "It's
Very
Clear,"Comment
on J.J.Heckman
and V.J. Hotz, "ChoosingAmong
Alternativemethods
ofEstimating the Impact ofSocial programs:The Case
ofManpower
Training," Journal ofthe
American
StatisticalAssociation 84, 875-877.Lee, Lung-Fei (1982),
"Some
Approaches
to the Correction ofSelectivity Bias,"Review
ofEconomic
Studies49, 355-372.
Little, Roderick J.A.,
"A
Note
About Models
for Selectivity Bias," Econometrica 53, 1469-1488.Mroz, T.A. (1987), "The Sensitivity of an Empirical
Model
ofMarriedWomen's
Hours
ofWork
toEconomic
and Statistical Assumptions,"
Econometrica
55, 765-799.Newey,
W.K,
J.L. Powell, and J.Walker
(1990), "Semi-parametric Estimation ofSelection Models:Some
Empirical Results,"
American
Economic Review
80(2), 324-328.Newey,
W.K.
(1988),"Two-Step
Series Estimation ofSample
Selection Models,"mimeo, Department
of Economics, Princeton University, October.Olsen, R.J., (1980),
"A
Least Squares Correction for Selectivity Bias," Econometrica 48, 1815-1820. Olsen, R.J., (1982), "Distributional Tests for Selectivity Bias and aMore
Robust Likelihood Estimator,"International
Economic Review
23, 223-240.Powell, B., and L.C. Steelman (1984), "Variations in State
SAT
Performance: Meaningful or Misleading?,"Harvard
EducationReview
54, 389-412.Robinson, P.M., (1988), "Root-N-Consistent Semi-parametric Regression," Econometrica 56, 931-954.
Rosenbaum,
P.R., (1984),"The Consequences
ofAdjustment for a Concomitant Variable that hasBeen
Affected by the Treatment," Journal ofthe Royal Statistical Society SeriesA
147 Part 5, 656-666.Rosenbaum,
P.R., and D.B. Rubin, (1983), "The Central Role ofthe Propensity Score in Observational Studiesfor Causal Effects," Biometrika 70, 41-55.
Rosenbaum,
P.R., and D.B. Rubin, (1984), "Reducing Bias in Observational studies Using Subclassification onthe Propensity Score,"
JASA
79, 516-524.Rosenbaum,
P.R., and D.B. Rubin, (1985), "Constructing a ControlGroup
using Multi-variateMatching
Methods
that include the Propensity Score,"American
Statistician 39, 33-38.Wainer,