• Aucun résultat trouvé

12 08

N/A
N/A
Protected

Academic year: 2022

Partager "12 08"

Copied!
41
0
0

Texte intégral

(1)

w o r k i n g

p a p e r

F E D E R A L R E S E R V E B A N K O F C L E V E L A N D

12 08

Local Average Neighborhood Effects from Moving to Opportunity

Dionissi Aliprantis and Francisca G.-C. Richter

(2)

Working papers of the Federal Reserve Bank of Cleveland are preliminary materials circulated to stimulate discussion and critical comment on research in progress. They may not have been subject to the formal editorial review accorded offi cial Federal Reserve Bank of Cleveland publications. The views stated herein are those of the authors and are not necessarily those of the Federal Reserve Bank of Cleveland or of the Board of Governors of the Federal Reserve System.

Working papers are available on the Cleveland Fed’s website at:

www.clevelandfed.org/research .

(3)

Working Paper 12-08 March 2012

Local Average Neighborhood Effects from Moving to Opportunity Dionissi Aliprantis and Francisca G.-C. Richter

This paper estimates Local Average Treatment Effects (LATEs) of neighborhood quality from the Moving to Opportunity (MTO) housing mobility experiment in a generalized model with multiple treatment levels. We propose a new approach to identifying parameters that exploits the identifi cation of unobservables in the multi-level model. The variation in neighborhood quality induced by MTO only allows us to identify LATEs of moving from the fi rst to the second decile of the national distribution of quality, but in other applications the approach may allow for the estimation of Marginal Treatment Effects. Estimated LATEs on employ- ment, labor force participation rates, earnings, income, welfare receipt, and body mass index are consistent with standard theories of neighborhood externalities.

Keywords: Marginal Treatment Effect, Local Average Treatment Effect,

Essential Heterogeneity, Strong Ignorability, Average Causal Response, Moving to Opportunity.

JEL Codes: C31, C36, C50, D04, I38, R23.

Dionissi Aliprantis is at the Federal Reserve Bank of Cleveland (dionissi.alipran- tis@clev.frb.org). Francisca G.-C. Richter is also at the Bank (francisca.g.richter@

clev.frb.org). The authors thank Joel Elvery, Fabian Lange, Jon James, and Ben

Craig for helpful comments. They also thank Emily Burgen and Nelson Oliver for

valuable research assistance.

(4)

1 Introduction

Since neighborhood externalities are widely believed to be a major determinant of outcomes in the United States, empirical evidence on the existence and magnitude of such neighborhood effects has been of great interest to social scientists. However, neighborhood characteristics are examples of treatments that are infeasible to randomly allocate to individuals. Researchers have not agreed on how best to apply non-experimental methods to investigate neighborhood effects, with many researchers broadly distrustful of non-experimental tools.

The results from the Moving to Opportunity (MTO) housing mobility program are a prominent example of the difficulties confronted when studying neighborhood effects. In MTO rental vouchers were randomly assigned to public housing residents of poor neighborhoods that could be used to re- locate to low-poverty neighborhoods. The lack of large effects from the program were disappointing (Kling et al. (2007), Sanbonmatsu et al. (2006)), especially given the strong positive effects from the program serving as its motivation, Gautreaux (Polikoff (2006), Rubinowitz and Rosenbaum (2000)). There has been controversy regarding the most appropriate techniques to learn about neighborhood effects from MTO (Ludwig et al. (2001), Clampet-Lundquist and Massey (2008), Sampson (2008)).

This paper generalizes previous analyses of effects from neighborhood quality using MTO voucher assignment as an instrumental variable. Specifically, we take as our starting point the analysis in Kling et al. (2007) and relax two key identifying assumptions (See Aliprantis (2012) for a full discussion.). We first relax the assumption that neighborhood quality is selected by households under strong ignorability by assuming selection occurs in a model with essential het- erogeneity. We also create and use in our analysis a linear index of neighborhood quality that uses several neighborhood characteristics in addition to the neighborhood poverty rate.

In order to identify neighborhood effects under these relaxed assumptions we specify and esti- mate a joint model of potential outcomes and selection into a multi-valued treatment. Our model adapts the assumptions from Heckman et al. (2006) to the specific application of MTO. We iden- tify individuals’ unobservable component in an ordered choice model, something not possible in the case of a binary treatment. We use these estimates and the exogenous variation induced by MTO on an underlying latent index to identify those individuals contributing to specific Local Average Treatment Effects (LATEs). In the context of our model these LATEs are the components of the Average Causal Response (ACR) from Angrist and Imbens (1995). Although the variation from MTO does not precisely estimate Marginal Treatment Effects (MTEs), we illustrate why we believe this approach also has potential for identifying specific MTEs in other applications with discrete instruments affecting all choice margins simultaneously.

The estimation results from the ordered choice model indicate that MTO had strongly het- erogeneous effects across sites. Those sites with residents living in the lowest quality neighbor- hoods experienced the greatest impact on neighborhood quality from the offer of an MTO voucher.

Ordered choice estimates also show that the Section 8 voucher was more effective at improving

neighborhood quality at very low levels, but that the experimental voucher became more effective

(5)

before the fifth percentile of the national distribution of neighborhood quality.

The variation in neighborhood quality induced by MTO is only able to precisely identify LATE parameters pertaining to moves from the first to the second decile of the national distribution of neighborhood quality. This move is exogenously induced by the MTO experimental voucher for about 13 percent of a restricted subsample of MTO volunteers. The LATE estimates for this subpopulation are consistent with the theory that improving neighborhood characteristics improves outcomes. There was a large increase in household income for those who moved from the first to second decile of neighborhood quality, although this increase was not a result of increases in the earnings of the respondent. The LATE on employment was quite large, and the fact that this estimate was larger than the increase in labor force participation rates is suggestive that employment improved due to changes in the local labor market rather than changes in household behavior alone.

Adult respondents and their children were also less likely to currently be on welfare if the program induced them from the first to second deciles of quality. And the LATE on body mass index also indicates there were improvements in health from exogenously increasing neighborhood quality.

The remainder of the paper is organized as follows: Section 2 develops the ordered choice model, defines the the treatment effect parameters of interest, and discusses assumptions necessary to identify these parameters. Section 3 describes the MTO experiment and Section 4 discusses the data used in estimation. Sections 5 and 6 present the estimation results, and Section 7 concludes.

2 Model

The model we use is based on the Marginal Treatment Effect (MTE) framework developed in Heckman et al. (2006) and Heckman and Vytlacil (2005), henceforth referred to as HUV and HV, respectively, and originally introduced by Bj¨orklund and Moffitt (1987). In a multi-valued treat- ment setting, identification of level-specific MTEs over the whole range of unobserved heterogeneity would typically be achieved using continuous instruments for each transition. Experimental voucher assignment from MTO is not such a set of instruments. Thus while we are able to relax some of the identifying assumptions in Kling et al. (2007), the three-level instrument from MTO only allows us to identify level-specific Local Average Treatment Effects (LATEs) for those transitions that are induced by the instrument.

2.1 Ordered Choice Model

We begin by modeling the way agents select into different levels of a multi-valued treatment.

Consider a model in which there is a treatment D with J levels (ie, D ∈ {1, . . . , J }), a binary instrument Z (ie, Z ∈ {0, 1}), and individuals have observable characteristics X. We assume that for each treatment level j ≤ J − 1 there is a latent index D

j

representing the net marginal benefit of moving to treatment level j + 1 from level j. Agents select level k

that maximizes total utility u(D):

k

= argmax

k∈{1,...,J}

u(k) (1)

(6)

where

u(k) =

 

u(1) if k = 1;

u(1) + $

k−1

j=1

D

j

if k ≥ 2.

We specify the J − 1 latent indeces to be functions of observable characteristics (Z, X) and unob- servable characteristics V with the following structure:

A1: D

ij

= µ(X

i

) + γ

j

Z

i

− V

i

− C

j

for j = 1, . . . , J − 1,

A2: C

j

< C

j+1

for j = 1, . . . , J − 1 with C

0

= −∞ and C

J

= ∞,

A3: C

j

− γ

j

< C

j+1

− γ

j+1

for j = 1, . . . , J − 1 with C

0

− γ

0

= −∞ and C

J

+ γ

J

= ∞.

Here µ(X) is the gain from moving up one level and is independent of the level. C

j

is a transition-specific cost that increases at each level. V is a continuously distributed random variable representing the unobserved cost for individual i from moving up one level. And γ

j

can be inter- preted as a reduction in the transition specific cost brought about by varying the instrument that preserves the cost ranking among levels (A3). Note that the initial level u(1) is important only as a way of indexing the model.

Let D

0

> max{0, D

1

} and define D

J

≡ −∞. Then given Z, the net marginal benefit D

j

is decreasing in j, so for k

∈ {1, . . . , J }

D = k

⇐⇒ D

k

≤ 0 < D

k−1

. (2) The model described in Equation 1, together with assumptions A1-A3, allow us to write condition 2 as an ordered choice model in terms of the unobservable characteristics that determine selection into treatment, V :

D = j ⇐⇒ µ(X) + γ

j

Z − C

j

< V ≤ µ(X) + γ

j−1

Z − C

j−1

. (3) Denoting the CDF of V conditional on X by F

V|X

(·), we can define π

Zj

(X) ≡ P r(D > j|Z, X) = F

V|X

(µ(X) + γ

j

Z − C

j

) and U

D

≡ F

V|X

(V |X). Defining π

Z0

≡ 1 for the sake of exposition, we can alternatively write the model in terms of the conditional quantiles of V , U

D

:

D(Z, X) =

%

J

j=1

j × 1 &

π

jZ

(X) < U

D

≤ π

j−1Z

(X) ' .

Figure 1 shows the ordered choice model in A1-A3 graphically. The top panel reflects marginal

utilities D

j

and the bottom panel illustrates the level of utility. D = k is selected by the agent

with the given realization of (Z, X, V ) since D

k−1

> 0 and D

k

≤ 0.

(7)

2.2 Marginal and Local Average Treatment Effects

We are interested in learning about features of the joint distribution of some outcome Y given treatment levels D ∈ {1, . . . , J}, observable characteristics X, and the unobservable com- ponent of selection into treatment, U

D

. In this analysis we will focus on the conditional mean E[Y (D, X, U

D

)|U

D

= u

D

]. We assume potential outcomes are a function of treatment level, ob- servable characteristics, and level-specific unobservable characteristics:

Y

j

(X, U

j

) = Y (D = j, X, U

j

) for j = 1, . . . , J. (4) As in HUV we add the following assumptions to A1-A3:

A4 (U

ij

, V

i

) ⊥ ⊥ Z

i

|X

i

for all j = 1, . . . , J

A5 γ

j

≥ 0 for all j = 1, . . . , J − 1 and γ

j

> 0 for at least one j ∈ {1, . . . , J − 1}

A6 V

i

is continuously distributed A7 E[ |Y

j

| ] < ∞ for all j = 1, . . . , J

A8 0 < P r(D

i

= j|X

i

) < 1 for all j = 1, . . . , J for all X ⊂ suppX

Given a realization of (X, U

D

) = (x, u

D

), the Marginal Treatment Effect (MTE) for the transition j to j + 1 is defined as the average effect of moving individuals with (x, u

D

) from level j to j + 1:

,

M T Ej,j+1

(x, u

D

) ≡ E[Y

j+1

− Y

j

|X = x, U

D

= u

D

] for j = 1, . . . , J − 1. (5) Given a realization of X = x, the corresponding Local Average Treatment Effect (LATE) for individuals with U

D

∈ (a, b) is defined as the average MTE over that interval:

,

LAT Ej,j+1

(X, a, b) ≡ (

b

a

,

M T Ej,j+1

(X, u

D

)du

D

b − a for j = 1, . . . , J − 1. (6) 2.3 Identification of MTE Parameters

One way of identifying MTE parameters in this model is to augment assumptions A1-A8, as done by HUV, with an assumption about the ordered choice model. HUV assumes there exist instrumental variables W

j

for each j = 1, . . . , J −1 such that the distribution of C

j

(W

j

), conditional on Z, X, and {C

h

: h -= j}, is nondegenerate and continuous. We label this assumption, which does not hold in our model, A9. Under A9 each margin of choice can be varied exogenously and independently of all others.

When evaluated at u

D

= π

jZ

(X), ,

M T Ej,j+1

(X, u

D

= π

Zj

(X)) represents the gross gain of mov-

ing from j to j + 1 for individuals that are indifferent between levels j and j + 1. HUV show

that index sufficiency holds in this model so that E[Y |Z, X] is equivalent to E[Y | π

Z

(X)], where

(8)

π ≡ [π

Z1

(X), π

Z2

(X), . . . , π

ZJ−1

(X)]. HUV further show that E[Y | π

Z

] is differentiable due to As- sumptions A4, A6, and A7. The j to j + 1 MTE can be interpreted as the change in mean outcome due to externally increasing π

jZ

while leaving all other π

kZ

’s fixed for k -= j:

,

M T Ej,j+1

(X, u

D

= π

Zj

) = ∂E[Y |X, π

Z

]

∂π

jZ

. (7)

Identification of MTE parameters is achieved in HUV using the exogenous variation in π

Zj

(X) induced by the W

j

to estimate the right hand side of Equation 7.

It is difficult to imagine a set of instruments {W

j

} each of which exogenously varies one margin of choice while leaving all other margins of choice unaffected in the context of residential choice.

Even if they existed, it is doubtful these instruments could be manipulated to identify the MTE function over the entire support of the distribution of (X, U

D

). It would be more likely that each level-specific instrument W

j

would vary the choice margin over some interval, but not over the entire unit interval. A discussion of related issues can be found in Carneiro et al. (2011) for a binary context.

The discrete instrument of voucher assignment in MTO does not continuously vary π

jZ

(X) for any j nor does it provide an independent source of variation for each of the transitions. Violation of assumption A9 leads us to adopt an alternative approach to identifying parameters that is based on the identification of U

D

in the ordered choice model and is therefore not possible in the binary model. Remembering that D = j iff π

jZ

< U

D

≤ π

Zj−1

and assuming π

0j

(x) < π

1j

(x) < π

0j−1

(x), an implication from the ordered choice model is that when Z = 0 an individual with X = x and u

D

∈ [π

j0

(x), π

1j

(x)] selects into D = j, but exogenously changing Z to 1 induces such individuals into D = j + 1. Thus for u

D

∈ [π

j0

(x), π

j1

(x)],

E[Y |x, u

D

, Z = 1] − E[Y |x, u

D

, Z = 0] = E[Y (j + 1)|x, u

D

] − E[Y (j)|x, u

D

] (8)

= ,

M T Ej,j+1

(x, u

D

).

Given estimates π ) and u )

D

, we can construct an empirical estimate of , )

M T Ej,j+1

(x, u

D

) using the left hand side of Equation 8. Specifically, when ) π

j0

(x) < π )

1j

(x) < π )

0j−1

(x), for ) u

D

∈ [ ) π

j0

(x), π )

j1

(x)]

we can estimate the MTE parameter as

, )

M T Ej,j+1

(x, u

D

) = E[Y |x, ) u

D

, Z = 1] − E[Y |x, ) u

D

, Z = 0]. (9) 2.4 Identification of LATE Parameters

As the building blocks of LATEs and other parameters, MTEs are the ultimate object of interest

in our model. Although we can identify the MTE function over a subset of its support as discussed

above, there are still reasons for estimating LATE parameters. One reason is that LATE parameters

will be more precisely estimated and will be less dependent on functional form and distributional

assumptions than MTE parameters. Another reason for estimating LATEs is that discussing their

(9)

identification will also help to clarify the identification of MTEs in our model.

Turning to the estimation of LATEs, we start by deriving an expression for E[Y |x, u

D

, Z = 1] − E[Y |x, u

D

, Z = 0] general enough to allow for any ordering relationship between π

0

(X) and π

1

(X). Assumption A5 requires that π

1j

(X) ≥ π

j0

(X) for all j and π

j1

(X) > π

j0

(X) for some j, since the instrument monotonically increases individuals’ latent index. And from A4 the variation in average outcomes induced by the instrument can be attributed to changing π

Z=0

to π

Z=1

(from this point forward we keep conditioning on X implicit for the sake of exposition):

E[Y |Z = 1] − E[Y |Z = 0] =

%

J

j=1

*

1

0

1{π

1j

≤ u

D

< π

j−11

}E[Y (D = j, u

D

)]du

D

(10)

%

J

j=1

*

1

0

1{π

j0

≤ u

D

< π

j−10

}E[Y (D = j, u

D

)]du

D

=

%

J

j=1

+*

π1j1 π1j

E[Y (D = j, u

D

)]du

D

(11)

*

π1j1

π1j

%

J

k=1

,

1{π

k0

≤ u

D

< π

k−10

}E[Y (D = k, u

D

)]

- du

D

. .

Given equation 6 and since

E[Y (D = j, u

D

)] − E[Y (D = j − m, u

D

)] =

j−m+1

%

k=j

/

E[Y (D = k, u

D

)] − E[Y (D = k − 1, u

D

)]

0

we can rewrite Equation 10 as:

E[Y |Z = 1] − E[Y |Z = 0] =

J−1

%

j=1

*

π1j

π0j

,

M T Ej,j+1

(u

D

)du

D

=

J−1

%

j=1

/

,

LAT Ej,j+1

0j

, π

1j

) 1

π

1j

− π

0j

2 0

(12)

Given estimates π ) and u )

D

from the ordered choice model we can construct empirical estimates of LATEs. When ) π

j0

< ) π

1j

< π )

0j−1

the LATE is estimated as:

, )

LAT Ej,j+1

j0

, π

j1

) = E[Y | u )

D

∈ [ ) π

j0

, π )

1j

], Z = 1] − E[Y | u )

D

∈ [ ) π

j0

, ) π

j1

], Z = 0]. (13) The precise LATEs identified will be determined by the exogenous variation in the choice probabil- ities π

Z

induced by the instrument. Two examples with D ∈ {1, 2, 3} are illustrated in Figures 2 and 3, and these examples are discussed in the Appendix to clarify issues related to identification.

A related discussion can also be found in Section VI A of HUV.

(10)

2.5 Discussion of Identifying Assumptions

We believe Assumptions A1-A8 represent the weakest assumptions under which we can identify interesting parameters in a model with a multi-valued treatment. We now consider alternative sets of identifying assumptions and discuss why we believe these assumptions are either too strong or too weak relative to A1-A8.

2.5.1 Strengthening A1-A8

One alternative approach would be to strengthen assumptions A1-A8. One particular assump- tion would allow us to estimate average treatment effects over the support of the distribution of a continuous treatment D using the generalized propensity score as developed in Imai and van Dyk (2004) or Hirano and Imbens (2005). However, while relatively standard, the Strong Ignorability (SI) assumption necessary for identification in Imai and van Dyk (2004) is restrictive relative to the framework we have adopted from HV and HUV.

1

When f denotes the distribution of potential outcomes, SI can be written as:

f{Y (D = j)|X} = f {Y (D = j)|u

D

∈ [a, b], X} for all j ∈ {1, . . . , J }. (14) From our specification of potential outcomes in Equation 4, this is the same as:

f {u

j

|X} = f {u

j

|u

D

∈ [a, b], X} for all j ∈ {1, . . . , J }, (15) or

u

j

⊥ ⊥ u

D

|X for all j ∈ {1, . . . , J }. (16) When calculating E[Y (D = j|U

D

)] as in Figures 2 and 3, the expectation is taken over the distri- bution of U

j

conditional on X and U

D

. Thus SI requires that conditional on X, the distribution of the Y

j

, and therefore their expected values as well, would have to be the same for all values of U

D

as shown in Figure 4. This assumption is most likely to hold when observable characteristics in X are able to explain most of the variability in choice, so deciding whether to adopt the SI assumption will depend on the particular application and data available.

Figure 5 shows a binary example to help illustrate the differences between the heterogeneity in treatment effects allowed under our assumptions A1-A8 and under SI. Let θ = µ(X) be an index of observable characteristics. The average effect of treatment varies across θ as shown in the top panel of the Figure. In the center panel we can see a cross section of potential outcomes conditional on θ = θ

. Since E[β |θ] = E[Y (1) − Y (0)|θ] is the same for all u

D

conditional on θ = θ

, any variation in treatment identifies E[β|θ]. We could use variation in treatment induced by an instrument, but we could also simply compare those individuals in the population or control group with u

D

< π

10

and those with u

D

> π

10

to estimate the treatment effect. That is, when matching under SI there is no need for an instrument.

1

See Imbens (2004) for a discussion of this assumption in models with a binary treatment.

(11)

In contrast to the center panel, the bottom panel shows a possible example of MTEs that depend on u

D

even conditional on θ. This is defined in HUV as a model with Essential Heterogeneity (EH).

In this case we need an instrument to generate variation in treatment status, and the variation gen- erated by the instrument determines what part of the distribution of Y (2)−Y (1)|θ

we can identify.

Since E[Y (2) − Y (1)|θ

] = (

1

0

,

M T E1,2

, u

D

)du

D

, in the example in the bottom panel we cannot identify E[Y (2) − Y (1)|θ

], but rather only (

π11

π01

,

M T E1,2

, u

D

)du

D

. That the treatment effects we can identify are determined by the response of individuals to the instrument re-emphasizes that the neighborhood effects identified by MTO, or any other housing mobility experiment, depend on how people endogenously respond to the experiment. That is, under EH it is not possible to clearly interpret the neighborhood effects we observe through MTO without first understanding how the experiment impacted selection into treatment (Aliprantis (2012), Clampet-Lundquist and Massey (2008), Sampson (2008)).

2.5.2 Weakening A1-A8

Another alternative would be to weaken A1-A8, which would allow us to estimate the Average Causal Response (ACR) parameter introduced in Angrist and Imbens (1995). Under Assumptions A1-A8 the ACR is:

,

ACR

(Z = 0, Z = 1) = E[Y |Z = 1] − E[Y |Z = 0]

E[D|Z = 1] − E[D|Z = 0] =

$

J−1

j=1

[ (

π1j

π0j

,

M T Ej,j+1

(u

D

)du

D

]

$

J−1

j=1

(j + 1) × (π

1j

− π

0j

)

By relaxing Assumptions A2 and A3 we could specify a set of identifying assumptions equivalent to the assumptions in Angrist and Imbens (1995) (See Vytlacil (2006) for a proof.), and we would still be allowing for EH. We do not estimate ACRs under these weaker assumptions, however, because they yield only one summary parameter that is quite difficult to interpret. By imposing the structure of the choice model in Section 2.1 we are able to allow for EH while at the same time decomposing the ACR into its contributing LATEs. These components of the ACR are considerably more interesting than the single ACR parameter by itself. Given our current econometric tools we believe assumptions A1-A8 strike the right balance of structure for interpretation.

3 Moving To Opportunity (MTO)

Moving To Opportunity (MTO) was inspired by the promising results of the Gautreaux pro-

gram. Following a class-action lawsuit led by Dorothy Gautreaux, in 1976 the Supreme Court

ordered the Department of Housing and Urban Development (HUD) and the Chicago Housing Au-

thority (CHA) to remedy the extreme racial segregation experienced by public-housing residents in

Chicago. One of the resulting programs gave families awarded Section 8 public housing vouchers

the ability to use them beyond the territory of CHA, giving families the option to be relocated

either to suburbs that were less than 30 percent black or to black neighborhoods in the city that

(12)

were forecast to undergo “revitalization” (Polikoff (2006)).

The initial relocation process of the Gautreaux program created a quasi-experiment, and its results indicated housing mobility could be an effective policy. Relative to city movers, suburban movers from Gautreaux were more likely to be employed (Mendenhall et al. (2006)), and the chil- dren of suburban movers attended better schools, were more likely to complete high school, attend college, be employed, and had higher wages than city movers (Rosenbaum (1995)).

2

MTO was designed to replicate these beneficial effects, offering housing vouchers to eligible households between September 1994 and July 1998 in Baltimore, Boston, Chicago, Los Angeles, and New York (Goering (2003)). Households were eligible to participate in MTO if they were low-income, had at least one child under 18, were residing in either public housing or Section 8 project-based housing located in a census tract with a poverty rate of at least 40%, were current in their rent payment, and all families members were on the current lease and were without criminal records (Orr et al. (2003)).

Families were drawn from the MTO waiting list through a random lottery. After being drawn, families were randomly allocated into one of three treatment groups. The experimental group was offered Section 8 housing vouchers, but were restricted to using them in census tracts with 1990 poverty rates of less than 10 percent. However, after one year had passed, families in the experimental group were then unrestricted in where they used their Section 8 vouchers. Families in this group were also provided with counseling and education through a local non-profit. Families in the Section-8 only comparison group were provided with no counseling, and were offered Section 8 housing vouchers without any restriction on their place of use. And families in the control group received project-based assistance.

3

4 Data

The first source of data we use in our analysis is the MTO Interim Evaluation. The MTO Interim Evaluation contains variables listing the census tracts in which households lived at both the baseline and in 2002, the time the interim evaluation was conducted. These census tracts are used to merge the MTO sample with decennial census data from the National Historical Geographic Information System (NHGIS, Minnesota Population Center (2004)), which provide measures of neighborhood characteristics.

4.1 Variables

A variable measuring neighborhood quality is created as a linear combination of several neigh- borhood characteristics. Neighborhood characteristics measured by NHGIS variables are first trans-

2

It has also been found that suburban movers have much lower male youth mortality rates Votruba and Kling (2009) and tend to stay in high-income suburban neighborhoods many years after their initial placement (DeLuca and Rosenbaum (2003), Keels et al. (2005)).

3

Section 8 vouchers pay part of a tenant’s private market rent. Project-based assistance gives the option of a

reduced-rent unit tied to a specific structure.

(13)

formed into percentiles of the national distribution from the 2000 census. Principal components analysis is then used to determine which single vector combines the most information about the national distribution of the poverty rate, the percent with high school degrees, the percent with BAs, the percent of single-headed households, the male Employed-to-Population Ratio (EPR), and the female unemployment rate. Table 1 shows that the resulting univariate index explains 63 per- cent of the variance of these neighborhood characteristics, and that no additional eigenvector would explain more than 13 percent of the variance of these variables. Table 2 displays the coefficients relating each of these variables to the index vector. Note that the magnitudes of the coefficients for most variables are similar to the magnitude of the coefficient for poverty.

In addition to the quality as measured in 2000 of the neighborhood of residence at the baseline and in 2002, another neighborhood characteristic recorded at the baseline of the MTO data used in the analysis is whether the respondent had family living in their neighborhood of residence. The household characteristic variables used in the analysis were measured at the MTO baseline and include whether the adult respondent had ever applied for a Section 8 voucher, whether a member of the household was a victim of a crime in the previous 6 months, whether the respondent had a car that ran, and whether there were teenage children in the household.

Outcome variables from the MTO Interim Evaluation include the self-reported total household income (all sources summed), the individual earnings in 2001 of the sample adult, the labor market status of the adult at the time of the interim survey (ie, Whether they were employed, unemployed, or not in the labor force at all.), and the respondent’s body mass index (BMI). Weights are used in constructing all estimates.

4

4.2 Sample and Descriptive Statistics

The focus of this analysis is adults in the MTO Interim Evaluation sample. We choose not to include children in our analysis for two reasons. First, careful analysis of program effects from MTO on children’s outcomes has already been conducted (Sanbonmatsu et al. (2006), Kling et al.

(2007)). Second, as shown in Figure 6 and investigated in depth in DeLuca and Rosenblatt (2010), MTO did not induce large changes in school quality.

5

Both intuition and previous findings in the literature suggest schools may be the most relevant “neighborhood” for children (Oreopoulos (2003)).

We further restrict our sample in three ways to ensure that we are focusing on a relatively homogeneous population, facilitating the estimation of the ordered choice model. We first exclude Boston from the analysis because it is clearly an outlier relative to the other MTO sites. In Figure 7a we see that unlike all of the other sites, the baseline neighborhood quality in Boston was

4

Weights are used for two reasons. First, random assignment ratios varied both from site to site and over different time periods of sample recruitment. Randomization ratio weights are used to create samples representing the same number of people across groups within each site-period. This ensures neighborhood effects are not conflated with time trends. Second, sampling weights must be used to account for the sub-sampling procedures used during the interim evaluation data collection.

5

The advent of school choice may be important for these results: Thirty percent of MTO control group children

in Chicago and Los Angeles were attending magnet schools (Sanbonmatsu et al. (2006), p 684).

(14)

not confined to the first percentiles of the national distribution of neighborhood quality. Table 3 quantifies these differences precisely. We can see that the control group in Boston looks more like the experimental group in every other site. Observations from Boston comprised a little less than a quarter of the full sample.

6

Second, we drop all households living at baseline in a neighborhood above the tenth percentile of the national distribution of quality. Returning to Table 3, we can see that the median baseline neighborhood quality for MTO participants was below the first percentile of the national distri- bution. For Chicago, Los Angeles, and New York City, nearly all participants lived at baseline in neighborhoods below the 10th percentile of the national distribution. In Baltimore, however, at baseline about 20 percent of participants lived in much higher quality neighborhoods, driven mainly by the male EPR and the share of adults holding a BA in their neighborhoods. These individuals are also dropped from our sample.

Finally, we drop individuals who moved to neighborhoods in 2002 above the median of the national distribution of quality in 2000. We can see from Figure 8c that this is a small subsample to discard.

The resulting sample used in the analysis has more than 2,500 adults, approximately 65 percent of the original adult sample. Approximately 1,800 are in either the experimental or control groups.

Figure 8 shows the distribution of the sample both at the baseline and in 2002. 75 percent of the sample lives in 2002 in a neighborhood below the 10th percentile of the 2000 national distribution of neighborhood quality. Figure 8a confirms that MTO induced small shares of participants into high quality neighborhoods (Aliprantis (2012)). Neighborhoods in New York City are among the ones with the lowest quality in the sample at baseline. Figure 7f shows that in 2002, neighborhood quality for the New York City control group remained almost unchanged, while at other sites quality experienced small improvements.

5 Ordered Choice Model Estimates

5.1 Specification

In the ordered choice model from Section 2.1 the marginal benefit of choosing to move from treatment level j to j + 1 is

D

j

= µ(X) + Zγ

j

− C

j

− V.

We specify

µ(X) = β

1

X

1

+ · · · + β

m

X

m

(17)

γ

j

= γ

0

+ γ

j

(18)

C

j

= δ

0

+ δ

j

(19)

6

Section 5 will also present estimation results from a model including data from Boston that further confirms the

singular nature of the site.

(15)

where both γ

0

and δ

0

are site-specific, and we assume V ∼ N (0, 1). X is composed of variables measuring baseline neighborhood quality, previous use of a Section 8 voucher, whether a household member was the victim of a certain crimes in the past six months, whether the respondent had a car, whether there were teens in the house, and whether the respondent had family members in the neighborhood. The instrument Z is an indicator variable for assignment to the experimental voucher group in the MTO experiment: Z

i

= 1{i is in experimental group}.

7

An implication of the ordered choice model that was stated in Equation 3 is:

D = j ⇐⇒ µ(X) + Zγ

j

− C

j

< V ≤ µ(X) + Zγ

j−1

− C

j−1

. (20) So if treatment levels are defined such that D = j for q

∈ [q

j

, q

j

], then the probability of observing D = j is

P r(D = j) = F

V

[µ(X) + Zγ(q

j

) − C(q

j

)] − F

V

[µ(X) + Zγ(q

j

) − C(q

j

)], allowing us to write the log-likelihood of the data as

LL =

%

N

i=1

%

J

j=1

1{D

i

= j} ln(P r(D

i

= j)). (21)

This ordered probit model is estimated with the gologit2 procedure in Stata.

5.2 The Identification of Unobservables

We now discuss how we use the parameter estimates from the discrete ordered choice model θ )

J

≡ ( β, ) ) γ, δ) to recover estimates of each individual’s unobservable ) V . If we take the limit of the ordered choice model as J → ∞, the benefit of moving to level j + 1 from j, D

j

, becomes the marginal benefit of moving to a neighborhood with higher quality. We denote the marginal benefit of moving to a higher quality neighborhood when in a neighborhood of quality q as

D

(q) = µ(X) + Zγ(q) − C(q) − V

where the cost and program effect components of marginal benefit are now allowed to be continuous functions of quality:

µ(X) = β

1

X

1

+ · · · + β

m

X

m

(22)

γ (q) = f (q) (23)

C(q) = g(q). (24)

7

We will actually define binary instruments for both the experimental and Section 8 groups, (Z

exp, ZS8

). Although

we will use information from both voucher groups and the control group to estimate the ordered choice model, for

the sake of tractability both the ensuing discussion of the ordered choice model and the estimation of LATEs will

focus on comparing the experimental voucher with the control treatment.

(16)

Given estimates of γ(q

j

) = γ

j

from the ordered choice model, we linearly interpolate γ (q) = γ

0

+ γ

j

+ (q − q

j

)( γ

j+1

− γ

j

q

j+1

− q

j

) for q ∈ (q

j

, q

j+1

) = (q

j+1

, q

j+1

)

and an analogous function g for cost. We set {q

j

}

J−1j=1

to the cutoffs separating the deciles of the sample distribution of neighborhood quality.

If we assume D

is strictly decreasing and households maximize their utility at q

, then q

satisfies the following first order condition equivalent to conditions in Equations 2 and 3 in the multilevel case:

D

(q

) = 0. (25)

Given the estimated parameters and functions θ ≡ ( β, ) ) γ(q), C(q)) constructed from the discrete ) choice model parameters θ )

J

, we use the FOC in Equation 25 to identify unobservables as:

8

V ) = µ(X) + ) Z ) γ (q

) − C(q )

).

For each individual we also estimate )

π

q0

j

= Φ[ β )

1

X

1

+ · · · + β )

m

X

m

− C(q )

j

)] (26) )

π

q1

j

= Φ[ β )

1

X

1

+ · · · + β )

m

X

m

− C(q )

j

) + ) γ(q

j

)]. (27) 5.3 Estimation Results

As a first check on model fit, Figure 9 shows the unconditional average π

qZ

from the data together with fractional polynomial fitting of ) π

qZ

estimates as functions of µ(X). We can see that ) the model fits the data well.

As measured by the estimated µ(X), the distributions of observable characteristics conditional ) on voucher status confirm that voucher assignment was successfully randomized (Figure 10a). The distribution of unobservables follows a standard normal distribution very closely (Figure 10b), and the distributions of unobservables conditional on experimental group status look similar (Figure 10c).

Figure 11a shows the joint distribution of ( µ(X), ) u )

D

) along with fractional polynomial fittings of ) π

qZ

for several quantiles q as functions of ) µ(X). Looking at this joint distribution more closely in Figure 11b, we can see that from one perspective MTO had only modest effects on neighborhood quality, while from another it had relatively large effects.

One perspective looks at Figures 11a and 11b and sees that even for the experimental group, the vast majority of MTO volunteers remained in a neighborhood in the first decile of the national distribution of quality. Even while remembering that our sample does not include those who moved

8

This is a notable departure from HUV. In the binary case, HUV define the support over which MTEs can be

identified as the intersection of the sets of predicted values of the propensity score obtained from the treated and

untreated samples. With linear interpolation we obtain estimates of

UD

throughout the entire interval (0, 1).

(17)

to neighborhoods above the national median, we still see that in our sample (as in the entire MTO population) only a small share of volunteers were induced across the 30th percentile of neighborhood quality.

Another perspective first notes that at baseline the MTO volunteers were representative of the one percent living in the left tail of the distribution of neighborhood quality (Figure 7a). Although MTO volunteers tended to live in better neighborhoods by the time of the interim evaluation, it was still the case that less than 15 percent of the control group in our sample lived in neighborhoods of quality above the 10th percentile of the national distribution (Figure 8d). Given such an initial distribution, the MTO voucher did induce many volunteers into higher quality neighborhoods.

The cutoffs for moving across the 20th and 30th percentiles of the national distribution for the experimental group ( π )

120

(X) and ) π

130

(X)) look similar to the cutoff for the 10th percentile for the control group ( ) π

100

(X)).

Perhaps the most interesting result from the estimation of the ordered choice model is the strong heterogeneity in the effects of the program as displayed in Figure 12. Those sites with the lowest marginal benefit of increasing neighborhood quality in the absence of the program (Los Angeles and New York City, see Figure 12a.) were also the sites where the MTO experimental voucher had the largest effect on the marginal benefit of moving (Figure 12d). And not only were effects heterogeneous by site, but also by voucher type, which is important for the design of future housing mobility programs (Galiani et al. (2012)). The standard Section 8 housing voucher was more effective at reducing moving costs than the experimental voucher at very low levels of neighborhood quality, but this relationship reverses somewhere before the fifth percentile of the national distribution of neighborhood quality.

Figure 12c shows the observed component of the marginal benefit function for the estimated model that includes Boston. We can see again that Boston is a clear outlier relative to the other MTO sites, warranting its removal from the analysis. Table 4 reports the estimated coefficients in µ(X).

6 Neighborhood Effects Estimates

6.1 LATEs

When estimating effects we redefine treatment levels in terms of deciles of the national distri- bution (ie, D = 1 if q

∈ [0, 10), . . . , D = 5 if q

∈ [40, 50).). Referring back to Figure 11b, it is not surprising that either single-level LATEs or combinations of several level LATEs are imprecisely estimated as one moves to higher quality. The only single-level LATE we are able to precisely estimate is ,

LAT E1,2

(max{ π )

120

, ) π

010

}, ) π

101

).

Table 5 shows estimates for several outcomes of ,

LAT E1,2

that are averaged over observables X together with estimates of the interval (max{ ) π

201

, ) π

100

}, π )

101

) to which the LATE pertains. Standard errors for each of these estimates are computed using 400 bootstrap replications.

9

9

Weights were used in constructing point estimates, but were not used in constructing standard errors. Weighted

(18)

A first point we can see in Table 5 is that the identified parameter ,

LAT E1,2

is the average MTE over only 12.5 percent of our subsample of MTO volunteers, with this 12.5 percent being located among those individuals with low unobservables U

D

. Although we are only able to identify a few parameters among many of interest, the estimated LATEs for this subpopulation conform to the theory that improving neighborhood characteristics improves outcomes.

There was a large increase in household income for those who moved from the first to second decile of neighborhood quality, although this increase was not a result of increases in the earnings of the respondent. The LATE on employment was quite large, and the fact this estimate was larger than the increase in labor force participation rates is suggestive that employment improved due to changes in the local labor market rather than changes in household behavior alone. Adult respondents and their children were also less likely to currently be on welfare if the program induced them from the first to second deciles of neighborhood quality. And the LATE on body mass index also indicates there were improvements in health from exogenously increasing neighborhood quality.

6.2 MTEs

Figure 13 shows estimates of E[Y (D = j)|U

D

] when Y (D) is assumed to be linear in U

D

using for identification all ) u

D

∈ [ ) π

jZ

, π )

j−1Z

] for each voucher group. We see the expected pattern of improved outcomes as quality increases. In the case of MTO it is not possible to precisely estimate MTEs using identified unobservables u )

D

and functional form assumptions as illustrated in Figure 13, and as a result this Figure is merely suggestive of the true parameters. We nevertheless show Figure 13 because we believe this approach may be able to estimate MTEs more precisely within an ordered choice framework in other applications.

7 Conclusion

This paper presented a new approach to identifying MTE and LATE parameters that exploits the identification of unobservables in a multi-level model of treatment. Estimation results from an ordered choice model indicated that the Moving to Opportunity (MTO) housing mobility experi- ment had strongly heterogeneous effects across sites. Those sites with residents living in the lowest quality neighborhoods experienced the greatest impact on neighborhood quality from the offer of an MTO voucher. Ordered choice estimates also showed that the Section 8 voucher was more effective at improving neighborhood quality at very low levels, but that the experimental voucher became more effective before the fifth percentile of the national distribution of neighborhood quality.

The variation in neighborhood quality induced by MTO was only able to precisely identify LATE parameters pertaining to a subset of about 13 percent of a restricted subsample of MTO volunteers.

These LATE estimates for employment, labor force participation rates, earnings, income, welfare receipt, and body mass index conformed to standard theories of neighborhood externalities.

and unweighted point estimates were similar.

(19)

References

Aliprantis, D. (2012). Assessing the evidence on neighborhood effects from Moving to Opportunity.

Federal Reserve Bank of Cleveland Working Paper 11-22r .

Angrist, J. D. and G. W. Imbens (1995). Two-stage least squares estimation of average causal effects in models with variable treatment intensity. Journal of the American Statistical Associa- tion 90 (430), 431–442.

Bj¨ orklund, A. and R. Moffitt (1987). The estimation of wage gains and welfare gains in self-selection models. The Review of Economics and Statistics 69 (1), pp. 42–49.

Carneiro, P., J. J. Heckman, and E. J. Vytlacil (2011). Estimating marginal returns to education.

American Economic Review 101 (6), 2754–2781.

Clampet-Lundquist, S. and D. S. Massey (2008). Neighborhood effects on economic self-sufficiency:

A reconsideration of the Moving to Opportunity experiment. American Journal of Sociol- ogy 114 (1), 107–143.

DeLuca, S. and J. E. Rosenbaum (2003). If low-income blacks are given a chance to live in white neighborhoods, will they stay? Examining mobility patterns in a quasi-experimental program with administrative data. Housing Policy Debate 14(3), 305–345.

DeLuca, S. and P. Rosenblatt (2010). Does moving to better neighborhoods lead to better schooling opportunities? Parental school choice in an experimental housing voucher program. Teachers College Record 112 (5), 1443–1491.

Galiani, S., A. Murphy, and J. Pantano (2012). Estimating neighborhood choice models: Lessons from the Moving to Opportunity experiment. Mimeo., Washington University in St. Louis . Goering, J. (2003). The impacts of new neighborhoods on poor families: Evaluating the policy

implications of the Moving to Opportunity demonstration. Economic Policy Review 9(2).

Heckman, J. J., S. Urz´ ua, and E. Vytlacil (2006). Understanding Instrumental Variables in models with essential heterogeneity. The Review of Economics and Statistics 88 (3), 389–432.

Heckman, J. J. and E. Vytlacil (2005). Structural equations, treatment effects, and econometric policy evaluation. Econometrica 73 (3), 669–738.

Hirano, K. and G. W. Imbens (2005). The propensity score with continuous treatments. In A. Gel- man and X.-L. Meng (Eds.), Applied Bayesian Modeling and Causal Inference from Incomplete- Data Perspectives: An Essential Journey with Donald Rubin’s Statistical Family, pp. 73–84. John Wiley & Sons.

Imai, K. and D. A. van Dyk (2004). Causal inference with general treatment regimes: Generalizing

the propensity score. Journal of the American Statistical Association 99 (467), 854–866.

(20)

Imbens, G. W. (2004). Nonparametric estimation of average treatment effects under exogeneity: A review. The Review of Economics and Statistics 86 (1), pp. 4–29.

Keels, M., G. J. Duncan, S. Deluca, R. Mendenhall, and J. Rosenbaum (2005). Fifteen years later:

Can residential mobility programs provide a long-term escape from neighborhood segregation, crime, and poverty? Demography 42 (1), pp. 51–73.

Kling, J. R., J. B. Liebman, and L. F. Katz (2007). Experimental analysis of neighborhood effects.

Econometrica 75 (1), 83–119.

Ludwig, J., G. J. Duncan, and P. Hirschfield (2001). Urban poverty and juvenile crime: Evidence from a randomized housing-mobility experiment. The Quarterly Journal of Economics 116 (2), 655–679.

Mendenhall, R., S. DeLuca, and G. Duncan (2006). Neighborhood resources, racial segregation, and economic mobility: Results from the Gautreaux program. Social Science Research 35 (4), 892–923.

Minnesota Population Center (2004). National Historical Geographic Information System (Pre- release Version 0.1 ed.). Minneapolis, MN: University of Minnesota. http://www.nhgis.org.

Oreopoulos, P. (2003). The long-run consequences of living in a poor neighborhood. The Quarterly Journal of Economics 118 (4), pp. 1533–1575.

Orr, L. L., J. D. Feins, R. Jacob, E. Beecroft, L. Sanbonmatsu, L. F. Katz, J. B. Liebman, and J. R. Kling (2003). Moving to Opportunity: Interim Impacts Evaluation. Washington, DC: US Department of Housing and Urban Development, Office of Policy Development and Research.

Polikoff, A. (2006). Waiting for Gautreaux. Northwestern University Press.

Rosenbaum, J. E. (1995). Changing the geography of opportunity by expanding residential choice:

Lessons from the Gautreaux program. Housing Policy Debate 6 (1), 231–269.

Rubinowitz, L. S. and J. E. Rosenbaum (2000). Crossing the Class and Color Lines: From Public Housing to White Suburbia. University of Chicago Press.

Sampson, R. J. (2008). Moving to inequality: Neighborhood effects and experiments meet social structure. American Journal of Sociology 114 (1), 189–231.

Sanbonmatsu, L., J. R. Kling, G. J. Duncan, and J. Brooks-Gunn (2006). Neighborhoods and academic achievement: Results from the Moving to Opportunity experiment. The Journal of Human Resources 41 (4), 649–691.

Votruba, M. E. and J. R. Kling (2009). Effects of neighborhood characteristics on the mortality

of black male youth: Evidence from Gautreaux, Chicago. Social Science & Medicine 68 (5),

814–823.

(21)

Vytlacil, E. (2006). Ordered discrete-choice selection models and local average treatment effect assumptions: Equivalence, nonequivalence, and representation results. The Review of Economics and Statistics 88 (3), 578–581.

8 Appendix: Two Examples with D ∈ {1, 2, 3}

We illustrate the identification of LATEs in our model for two examples with J = 3 as shown in Figures 2-3. First suppose we tighten Assumption A6 to:

A6* π

1j

> π

j−10

for all j ∈ {1, . . . , J − 1}.

Example I in Figure 2 shows such an ordering when J = 3. Under A6* the right hand side of Equation 12 can be derived quickly since

E[Y |Z = 1] =

%

J

j=1

*

π1j−1

π1j

E[Y (D = j, u

D

)]du

D

=

*

1

π11

E[Y (D = 1, u

D

)]du

D

+

J−1

%

j=1

/*

π1j π0j

E[Y (D = j + 1, u

D

)]du

D

+

*

π0j

π1j+1

E[Y (D = j + 1, u

D

)]du

D

0

and

E[Y |Z = 0] =

%

J

j=0

*

π0j−1

π0j

E[Y (D = j, u

D

)]du

D

=

*

1

π11

E[Y (D = 1, u

D

)]du

D

+

J−1

%

j=1

/*

πj1 π0j

E[Y (D = j, u

D

)]du

D

+

*

πj0

π1j+1

E[Y (D = j + 1, u

D

)]du

D

0

.

Thus the difference in expected outcomes due to changes in the instrument is the sum of integrated MTEs:

E[Y |Z = 1] − E[Y |Z = 0] =

J

%

−1

j=1

3*

π1j π0j

E[Y (D = j + 1, u

D

)]du

D

*

π1j

πj0

E[Y (D = j, u

D

)]du

D

4

=

J

%

−1

j=1

*

πj1

π0j

,

M T Ej,j+1

(u

D

)du

D

=

J−1

%

j=1

/

,

LAT Ej,j+1

0j

, π

1j

) 1

π

j1

− π

0j

2 0 . (28)

Since we can recover π

Z

= [π

Z1

, π

2Z

, . . . , π

ZJ−1

] and we can tell in which [π

0j

, π

j1

] interval u

D

lies from

the data, we can estimate these LATEs using Equation 13. The variation in treatment induced by

(22)

the instrument identifies:

,

LAT E1,2

10

, π

11

) = (

π11

π01

,

M T E1,2

(u

D

)du

D

π

11

− π

01

= E[Y |u

D

∈ [π

01

, π

11

], Z = 1] − E[Y |u

D

∈ [π

01

, π

11

], Z = 0]

and

,

LAT E2,3

02

, π

21

) = (

π12

π02

,

M T E2,3

(u

D

)du

D

π

12

− π

02

= E[Y |u

D

∈ [π

02

, π

12

], Z = 1] − E[Y |u

D

∈ [π

20

, π

12

], Z = 0].

An important issue to remember is that all of the preceding has been implicitly conditioning on observable characteristics X. Figure 14 shows that Example I in Figure 2 is just one cross section taken from an interval of observable characteristics.

Now suppose that we drop A6* and replace it with the less restrictive original Assumption A6.

In this case it is possible that π

1j

> π

j−10

for some j. Let u

D

∈ [π

1m

, π

m−11

] and u

D

∈ [π

n0

, π

n−10

] for some m, n ∈ {1, . . . , J − 1} where m > n. Then Equation 12 implies

E[Y (u

D

)|Z = 1] − E[Y (u

D

)|Z = 0] =

m−1

%

j=n

,

M T Ej,j+1

(u

D

).

Thus if a = max{π

1m

, π

n0

} and b = min{π

1m−1

, π

0n−1

}, we can identify:

E[Y |u

D

∈ [a, b], Z = 1] − E[Y |u

D

∈ [a, b], Z = 0] = (

b

a

$

m−1

j=n

,

M T Ej,j+1

(u

D

)du

D

b − a (29)

This scenario highlights that the precise LATEs identified will be determined by the exogenous variation in the choice probabilities π

Z

induced by the instrument. Note that if π

j1

> π

j−10

for some j, the corresponding LATE parameter is still separately identified over the interval

,

max{π

j0

, π

1j+1

}, min{π

0j−1

, π

j1

} -

.

But if max{π

0j

, π

1j+1

} -= π

0j

or min{π

0j−1

, π

j1

} -= π

j1

, then the level-specific LATE parameters will not be separately identified over the entire interval [π

0j

, π

j1

].

Comparing Example I and Example II in Figure 3 helps to illustrate how the ordering of the π

jZ

determines identification. Since π

12

> π

01

in Example II, the instrument identifies

,

LAT E1,2

21

, π

11

) = (

π11

π12

,

M T E1,2

(u

D

)du

D

π

11

− π

12

= E[Y |u

D

∈ [π

12

, π

11

], Z = 1] − E[Y |u

D

∈ [π

12

, π

11

], Z = 0]

(23)

However, over the interval [π

10

, π

12

] = [π

10

, min{π

21

, π

11

}] we cannot separately identify each level- specific LATE. Instead the instrument identifies:

,

LAT E1,3

10

, π

12

) = ,

LAT E1,2

01

, π

21

) + ,

LAT E2,3

10

, π

12

)

= E[Y |u

D

∈ [π

10

, π

12

], Z = 1] − E[Y |u

D

∈ [π

10

, π

21

], Z = 0]

= (

π12

π10

5 ,

M T E1,2

(u

D

) + ,

M T E2,3

(u

D

) 6 du

D

π

12

− π

01

.

But over the interval [π

20

, π

01

] = [π

20

, min{π

10

, π

12

}] the instrument does again separately identify the LATE parameter:

,

LAT E2,3

02

, π

10

) = E[Y |u

D

∈ [π

02

, π

10

], Z = 1] − E[Y |u

D

∈ [π

02

, π

01

], Z = 0] = (

π10

π02

,

M T E2,3

(u

D

)du

D

π

01

− π

20

.

Figure 15 shows these LATEs graphically.

(24)

Figures

Marginal Utility D

j

| (Z, X, V )

k-3 k-2 k-1 k k+1 j

-3 -2 -1 0 1 2 3 4 D

j

u(D) | (Z, X, V ) where k − 2 = 3 and J = k + 2 = 7

0 j 1 2 3 4 5 6 7 u(D = j)

k-2 k-1 k k+1 k+2

Figure 1: The First Stage Ordered Choice Model

(25)

Observed E[Y (D)|v, Z] when D ∈ {1, 2, 3}

= E[Y |v, Z = 1]

= E[Y |v, Z = 0]

0 v 5 10 15 20 25 30 35 Y

E[Y

(3)|v]

E[Y

(2)|v]

E[Y

(1)|v]

#M T E2,3 (v)

#M T E1,2 (v)

F

V

(·) and π

Zj

1

0.75

0.50

0.25

v 0 π

11

π

10

π

21

π

20

UD=FV(·|X)

Figure 2: Example I: Potential Outcomes and Marginal Treatment Effects Given µ(X)

(26)

Example I: Observed E[Y (D)|u

D

, Z ] when D ∈ {1, 2, 3}

= E[Y |u

D

, Z = 1]

= E[Y |u

D

, Z = 0]

π

11

π

10

π

21

π

02

U

D

0 5 10 15 20 25 30 35 Y

E[Y

(3)|u

D

]

E[Y

(2)|u

D

]

E[Y

(1)|u

D

]

#M T E2,3 (uD)

#M T E1,2 (uD)

Example II: Observed E[Y (D)|u

D

, Z ] when D ∈ {1, 2, 3}

= E[Y |u

D

, Z = 1]

= E[Y |u

D

, Z = 0]

π

11

π

12

π

10

π

02

U

D

0 5 10 15 20 25 30 35 Y

E[Y

(3)|u

D

]

E[Y

(2)|u

D

]

E[Y

(1)|u

D

]

#M T E2,3 (uD)

#M T E1,2 (uD)

Figure 3: Examples I and II: Potential Outcomes and Marginal Treatment Effects Given µ(X)

Références

Documents relatifs

From these plots, it is clear that the spline model allows to recover the asymmetries in the kernel, by allowing the phase to be non-zero in the minimum phase case (right bottom

Applicants have European, Kanak (the indigenous people) and Wallisian name and various signal of stability. Using data from the local census, we link the location of the offer and

to a voucher group has little effect on those families who do not actually move with a voucher. Both types of estimates are of interest. ITT estimates are relevant for public

We show that the input-to-state stability of such systems guarantees some robustness with respect to a class of signals with bounded average-energy, which encompasses the

Table IV presents the amounts of bentazon transferred by mass flow in mobile water, for the six soils studied, according to initial soil moisture (dry, dampened to 50% of RC, or

where data specifies the data set, y the outcome variable, w the binary variable treatment indicator, x the list of covariates to be used in the matching, and M the number of

Checking balance through empirical distribution functions, interval 1 of the estimated propensity score distribution Original sample of treated vs final sample of matched

In addition, considering the fact that neighborhood variables can be used to model the impact of neighborhood effects on housing values in the Metropolitan Area of Paris, we have