Estimation and inference for distribution and quantile functions in endogenous treatment effect models: Econometric Reviews: Vol 41 , No 1

Abstract

Given a standard endogenous treatment effect model, we propose nonparametric estimation and inference procedures for the distribution and quantile functions of the potential outcomes among compliers, as well as the local quantile treatment effect function. The preliminary distribution function estimator is a weighted average of indicator functions, but is not monotonically increasing in general. We therefore propose a simple monotonizing method for proper distribution function estimation, and obtain the quantile function estimator by inversion. Our monotonizing method is an alternative to Chernozhukov et al. (Citation2010) and is arguably preferable when the outcome has unbounded support. We show that all the estimators converge weakly to Gaussian processes at the parametric rate, and propose a multiplier bootstrap for uniform inference. Our uniform results thus generalize the pointwise theory developed by Frölich and Melly (Citation2013). Monte Carlo simulations and an application to the effect of fertility on family income distribution illustrate the use of the methods. All results extend to the subpopulation of treated compliers as well.

Keywords:

JEL CLASSIFICATION:

Acknowledgment

We would like to thank Blaise Melly for providing us with the data for the application. Yu-Chin Hsu gratefully acknowledges the research support from National Science Council Grant (101-2410-H-001-109-MY2) and the Career Development Award of Academia Sinica, Taiwan. All errors and omissions are our responsibility.

Notes

1 In the standard framework, one would define $Y_{d, z}$ as the potential outcome in the population that would obtain if one were to set D = d and Z = z exogenously and impose the exclusion of the instrumental assumption: $P (Y_{d, 1} = Y_{d, 0}) = 1$ for $d \in {0, 1} .$ This is equivalent to our approach where we define Y_d directly.

2 As in Donald et al. (Citation2014b), we called q(x) as the instrument propensity score to distinguish from the conventional use of the term propensity score (the conditional probability of being treated).

3 SLE with trimming can nevertheless improve estimation results. See Donald et al. (Citation2014b) for more details.

4 The sign of ${\hat{κ}}_{1, i}$ also depends on the sign of ${\hat{Γ}}_{1},$ which converges in probability to $Γ_{1} > 0 .$ For simplicity we assume that ${\hat{Γ}}_{1}$ is strictly positive. Similarly, we assume that ${\hat{Γ}}_{0} > 0 .$

5 We do impose later in Assumption 3.6 that the conditional densities of the potential outcomes are bounded away from zero. However, this assumption is only needed to ensure that if one inverts the distribution function to estimate the quantile function over the entire [0,1] range, then the inverse still converges at the parametric rate. For example, if one is interested only in an “interior” range of quantiles, say, [0.2,0.8], then this assumption is again unneeded.

6 If $Y = R,$ one can still estimate the quantile functions at the parametric rate over some compact subset of the unit interval, say, $[ϵ, 1 - ϵ]$ for $0 < ϵ < 1 / 2,$ provided that the density functions are strictly positive on the interval.

7 See Comments 6–8 in Donald et al. (Citation2014b) for more details.

8 The weak convergence is in the sense of Definition 1.3.3 of van der Vaart and Wellner (Citation1996), and $ℓ^{\infty} (Y)$ denotes the set of all uniformly bounded real functions on $Y .$

9 This means that for any $ϵ > 0$ and $η > 0,$ there exist $δ > 0$ small enough and N > 0 large enough such that for all n > N, $P (\sup_{ν_{d} (y, y') < δ} | \sqrt{n} ({\tilde{F}}_{Y_{d} | C} (y) - F_{Y_{d} | C} (y)) - \sqrt{n} ({\tilde{F}}_{Y_{d} | C} (y') - F_{Y_{d} | C} (y')) | \leq ϵ) \geq 1 - η .$

10 The nonparametric bootstrap is potentially very time consuming given the objects of interest have to be re-estimated for each bootstrap sample. In contrast, the computational burden of the multiplier bootstrap is substantially reduced as all resampling procedures can be simulated simultaneously. However, applying multiplier bootstrap requires a consistent estimation of the functions involved in the influence function. We therefore provide such estimators in this paper.

11 The family income is reported as zero for 5.1% in 1980, 4.6% in 1990, and 4.0% in 2000 sample.

12 Note that $Γ_{1}^{t}$ is not the ATT of Z on D, but $Γ_{1}^{t} / P (Z = 1)$ is. Similarly, $E {q (X) [\frac{Z D 1 (Y \leq y)}{q (X)} - \frac{(1 - Z) D 1 (Y \leq y)}{1 - q (X)}]} / P (Z = 1)$ is the ATT of Z on $D 1 (Y \leq y) .$ Hence, the identification result of $F_{Y_{1} | C}^{t} (y)$ in (Equation7.1(7.1) $\begin{matrix} F_{Y_{1} | C}^{t} (y) = \frac{1}{Γ_{1}^{t}} E {q (X) [\frac{Z D 1 (Y \leq y)}{q (X)} - \frac{(1 - Z) D 1 (Y \leq y)}{1 - q (X)}]}, \\ F_{Y_{0} | C}^{t} (y) = \frac{1}{Γ_{0}^{t}} E {q (X) [\frac{Z (D - 1) 1 (Y \leq y)}{q (X)} - \frac{(1 - Z) (D - 1) 1 (Y \leq y)}{1 - q (X)}]}, \\ Γ_{1}^{t} = E {q (X) [\frac{Z D}{q (X)} - \frac{(1 - Z) D}{1 - q (X)}]}, \\ Γ_{0}^{t} = E {q (X) [\frac{Z (D - 1)}{q (X)} - \frac{(1 - Z) (D - 1)}{1 - q (X)}]}, \end{matrix}$ (7.1) ) can be interpreted as of the ATT of Z on $D 1 (Y \leq y)$ over the ATT of Z on D.

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 61.00 Add to cart

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 578.00 Add to cart

* Local tax will be added as applicable

Estimation and inference for distribution and quantile functions in endogenous treatment effect models

Log in via your institution

Log in to Taylor & Francis Online

Restore content access

Related Research

Information for

Open access

Opportunities

Help and information

Estimation and inference for distribution and quantile functions in endogenous treatment effect models

Abstract

Acknowledgment

Notes

Log in via your institution

Log in to Taylor & Francis Online

Log in to Taylor & Francis Online

Restore content access

Related Research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature