357
Views
2
CrossRef citations to date
0
Altmetric
Articles

Estimation and inference for distribution and quantile functions in endogenous treatment effect models

, &
Pages 22-50 | Published online: 02 Dec 2020
 

Abstract

Given a standard endogenous treatment effect model, we propose nonparametric estimation and inference procedures for the distribution and quantile functions of the potential outcomes among compliers, as well as the local quantile treatment effect function. The preliminary distribution function estimator is a weighted average of indicator functions, but is not monotonically increasing in general. We therefore propose a simple monotonizing method for proper distribution function estimation, and obtain the quantile function estimator by inversion. Our monotonizing method is an alternative to Chernozhukov et al. (Citation2010) and is arguably preferable when the outcome has unbounded support. We show that all the estimators converge weakly to Gaussian processes at the parametric rate, and propose a multiplier bootstrap for uniform inference. Our uniform results thus generalize the pointwise theory developed by Frölich and Melly (Citation2013). Monte Carlo simulations and an application to the effect of fertility on family income distribution illustrate the use of the methods. All results extend to the subpopulation of treated compliers as well.

JEL CLASSIFICATION:

Acknowledgment

We would like to thank Blaise Melly for providing us with the data for the application. Yu-Chin Hsu gratefully acknowledges the research support from National Science Council Grant (101-2410-H-001-109-MY2) and the Career Development Award of Academia Sinica, Taiwan. All errors and omissions are our responsibility.

Notes

1 In the standard framework, one would define Yd,z as the potential outcome in the population that would obtain if one were to set D = d and Z = z exogenously and impose the exclusion of the instrumental assumption: P(Yd,1=Yd,0)=1 for d{0,1}. This is equivalent to our approach where we define Yd directly.

2 As in Donald et al. (Citation2014b), we called q(x) as the instrument propensity score to distinguish from the conventional use of the term propensity score (the conditional probability of being treated).

3 SLE with trimming can nevertheless improve estimation results. See Donald et al. (Citation2014b) for more details.

4 The sign of κ̂1,i also depends on the sign of Γ̂1, which converges in probability to Γ1>0. For simplicity we assume that Γ̂1 is strictly positive. Similarly, we assume that Γ̂0>0.

5 We do impose later in Assumption 3.6 that the conditional densities of the potential outcomes are bounded away from zero. However, this assumption is only needed to ensure that if one inverts the distribution function to estimate the quantile function over the entire [0,1] range, then the inverse still converges at the parametric rate. For example, if one is interested only in an “interior” range of quantiles, say, [0.2,0.8], then this assumption is again unneeded.

6 If Y=R, one can still estimate the quantile functions at the parametric rate over some compact subset of the unit interval, say, [ϵ,1ϵ] for 0<ϵ<1/2, provided that the density functions are strictly positive on the interval.

7 See Comments 6–8 in Donald et al. (Citation2014b) for more details.

8 The weak convergence is in the sense of Definition 1.3.3 of van der Vaart and Wellner (Citation1996), and (Y) denotes the set of all uniformly bounded real functions on Y.

9 This means that for any ϵ>0 and η>0, there exist δ>0 small enough and N > 0 large enough such that for all n > N, P(supνd(y,y)<δ|n(F˜Yd|C(y)FYd|C(y))n(F˜Yd|C(y)FYd|C(y))|ϵ)1η.

10 The nonparametric bootstrap is potentially very time consuming given the objects of interest have to be re-estimated for each bootstrap sample. In contrast, the computational burden of the multiplier bootstrap is substantially reduced as all resampling procedures can be simulated simultaneously. However, applying multiplier bootstrap requires a consistent estimation of the functions involved in the influence function. We therefore provide such estimators in this paper.

11 The family income is reported as zero for 5.1% in 1980, 4.6% in 1990, and 4.0% in 2000 sample.

12 Note that Γ1t is not the ATT of Z on D, but Γ1t/P(Z=1) is. Similarly, E{q(X)[ZD1(Yy)q(X)(1Z)D1(Yy)1q(X)]}/P(Z=1) is the ATT of Z on D1(Yy). Hence, the identification result of FY1|Ct(y) in (Equation7.1) can be interpreted as of the ATT of Z on D1(Yy) over the ATT of Z on D.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 578.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.