357
Views
2
CrossRef citations to date
0
Altmetric
Articles

Estimation and inference for distribution and quantile functions in endogenous treatment effect models

, &
 

Abstract

Given a standard endogenous treatment effect model, we propose nonparametric estimation and inference procedures for the distribution and quantile functions of the potential outcomes among compliers, as well as the local quantile treatment effect function. The preliminary distribution function estimator is a weighted average of indicator functions, but is not monotonically increasing in general. We therefore propose a simple monotonizing method for proper distribution function estimation, and obtain the quantile function estimator by inversion. Our monotonizing method is an alternative to Chernozhukov et al. (Citation2010) and is arguably preferable when the outcome has unbounded support. We show that all the estimators converge weakly to Gaussian processes at the parametric rate, and propose a multiplier bootstrap for uniform inference. Our uniform results thus generalize the pointwise theory developed by Frölich and Melly (Citation2013). Monte Carlo simulations and an application to the effect of fertility on family income distribution illustrate the use of the methods. All results extend to the subpopulation of treated compliers as well.

JEL CLASSIFICATION:

Acknowledgment

We would like to thank Blaise Melly for providing us with the data for the application. Yu-Chin Hsu gratefully acknowledges the research support from National Science Council Grant (101-2410-H-001-109-MY2) and the Career Development Award of Academia Sinica, Taiwan. All errors and omissions are our responsibility.

Notes

1 In the standard framework, one would define Yd,z as the potential outcome in the population that would obtain if one were to set D = d and Z = z exogenously and impose the exclusion of the instrumental assumption: P(Yd,1=Yd,0)=1 for d{0,1}. This is equivalent to our approach where we define Yd directly.

2 As in Donald et al. (Citation2014b), we called q(x) as the instrument propensity score to distinguish from the conventional use of the term propensity score (the conditional probability of being treated).

3 SLE with trimming can nevertheless improve estimation results. See Donald et al. (Citation2014b) for more details.

4 The sign of κ̂1,i also depends on the sign of Γ̂1, which converges in probability to Γ1>0. For simplicity we assume that Γ̂1 is strictly positive. Similarly, we assume that Γ̂0>0.

5 We do impose later in Assumption 3.6 that the conditional densities of the potential outcomes are bounded away from zero. However, this assumption is only needed to ensure that if one inverts the distribution function to estimate the quantile function over the entire [0,1] range, then the inverse still converges at the parametric rate. For example, if one is interested only in an “interior” range of quantiles, say, [0.2,0.8], then this assumption is again unneeded.

6 If Y=R, one can still estimate the quantile functions at the parametric rate over some compact subset of the unit interval, say, [ϵ,1ϵ] for 0<ϵ<1/2, provided that the density functions are strictly positive on the interval.

7 See Comments 6–8 in Donald et al. (Citation2014b) for more details.

8 The weak convergence is in the sense of Definition 1.3.3 of van der Vaart and Wellner (Citation1996), and (Y) denotes the set of all uniformly bounded real functions on Y.

9 This means that for any ϵ>0 and η>0, there exist δ>0 small enough and N > 0 large enough such that for all n > N, P(supνd(y,y)<δ|n(F˜Yd|C(y)FYd|C(y))n(F˜Yd|C(y)FYd|C(y))|ϵ)1η.

10 The nonparametric bootstrap is potentially very time consuming given the objects of interest have to be re-estimated for each bootstrap sample. In contrast, the computational burden of the multiplier bootstrap is substantially reduced as all resampling procedures can be simulated simultaneously. However, applying multiplier bootstrap requires a consistent estimation of the functions involved in the influence function. We therefore provide such estimators in this paper.

11 The family income is reported as zero for 5.1% in 1980, 4.6% in 1990, and 4.0% in 2000 sample.

12 Note that Γ1t is not the ATT of Z on D, but Γ1t/P(Z=1) is. Similarly, E{q(X)[ZD1(Yy)q(X)(1Z)D1(Yy)1q(X)]}/P(Z=1) is the ATT of Z on D1(Yy). Hence, the identification result of FY1|Ct(y) in (Equation7.1) can be interpreted as of the ATT of Z on D1(Yy) over the ATT of Z on D.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.