30
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Estimation of counterfactual distributions with a continuous endogenous treatment

Pages 595-637 | Published online: 01 Jul 2024
 

Abstract

In this article, I propose a method to estimate the counterfactual distribution of an outcome variable when the treatment is endogenous, continuous, and its effect is heterogeneous. The types of counterfactuals considered are those in which the change in treatment intensity can be correlated with the individual effects or when some of the structural functions are changed by some other group’s counterparts. I characterize the outcome and the treatment with a triangular system of equations in which the unobservables are related by a copula that captures the endogeneity of the treatment, which is nonparametrically identified by inverting the quantile processes that determine the outcome and the treatment. Both processes are estimated using existing quantile regression methods, and I propose a parametric and a nonparametric estimator of the copula. To illustrate these methods, I estimate several counterfactual distributions of the birth weight of children, had their mothers smoked differently during pregnancy.

JEL classification::

Acknowledgments

Departamento de Economía, Universidad de Cantabria, Avenida de los Castros, s/n, 39005 Santander, Spain. I would like to thank the editor, Esfandiar Maasoumi, the associate editor, two anonymous referees, Manuel Arellano, Stéphane Bonhomme, Domenico Depalo, Bryan Graham, Giuseppe Ilardi, Michael Jansson, James Powell, Demian Pouzo, Enrique Sentana, and seminar participants at CEMFI and University of California, Berkeley for their helpful comments and discussion. All remaining errors are my own. I can be reached via email at [email protected]

Notes

1. Other related works include Firpo, Fortin, and Lemieux (Citation2009), who proposed an estimator based on reweighting of the influence function to estimate distributional effects under exogeneity, or Frölich and Melly (Citation2013), who proposed a nonparametric estimator of the unconditional quantile treatment effect for the subpopulation of compliers with an endogenous, binary treatment.

2. Because these variables are unidimensional, the amount of heterogeneity of the model is restricted. In particular, it rules out models that are not monotonic on the unobservables, such as random coefficients models. Unfortunately, these models are not nonparametrically identified (Hahn and Ridder, Citation2011; Hoderlein, Holzmann, and Meister, Citation2017; Kasy, Citation2011; Masten, Citation2018) and they would further complicate the present analysis. Deriving set-identification results with multidimensional unobserved heterogeneity is beyond the scope of this article.

3. This system uses the Skorohod representation, which states that a random variable φi can be written in terms of its quantile function: φi = Q(Ui), where UiU(0,1).

4. By definition, a copula is the multivariate distribution of (U1,...,Um) such that their marginal distributions are uniformly distributed on the unit interval. Sklar (Citation1959) showed that any multivariate distribution of the continuously distributed variables X1,...,Xm, with respective marginal cdfs F1(x1),...,Fm(xm), there exists a unique cdf C, such that P(X1x1,...,Xmxm)=C(F1(x1),...,Fm(xm)). The conditional copula is defined as C(F1(x1),...,Fm1(xm1),Fm+1(xm+1),...,Fm(xm)|Fm(xm))=Fm(xm)C(F1(x1),...,Fm(xm)). Lastly, the copula density is defined as c(F1(x1)),...,Fm(Fm(xm))=mF1(x1)...Fm(xm)C(F1(x1),...,Fm(xm)).

5. The relation of the copula and the MTE can be immediately generalized to the case of multidimensional unobserved heterogeneity. For example, assume that it depends on {Um}m=1M. Define the multidimensional copula by CUV|X2(U1,...,UM,V|x2). Then, the MTE would be calculated as in Eq. (Equation3) by integrating the derivative of gd with respect to the multidimensional copula, holding V constant.

6. This is a particular case of the Bernstein copula.

7. Note that in many circumstances, the ability of the policy maker to implement such counterfactuals may be limited, making them informative of such potential effects theoretically.

8. Chesher (Citation2003) and Imbens and Newey (Citation2009), who studied the nonparametric identification of nonseparable models using a control function approach. Other papers proposed semiparametric methods which do not suffer from the curse of dimensionality, such as Jun (Citation2009), or Lee (Citation2007) who assumed the model to be separable. Alternatively, Ma and Koenker (Citation2006) proposed a parametric model of Chesher (Citation2003). Another recent paper is D’Haultføeuille and Février (Citation2015), which established the identification based on similar conditions to those in Torgovitsky (Citation2015). On the other hand, (Hoderlein and Mammen Citation2007) discussed the identification without monotonicity.

9. This constant serves to avoid the estimation of extreme quantiles. See, for example, Chernozhukov and Hansen (Citation2005).

10. Additionally, the linear specification requires regressors to take either positive or negative values, but not both, as that would make the process non monotonic. See Koenker (Citation2005) for further details.

11. I discuss the properties of the estimator when this assumption is relaxed in Appendix C.

12. With some slight abuse of notation, the metric spaces used for the convergence omit the spaces of group indicators. They should be implicitly included throughout the article.

13. The finite sample performance is shown in a Monte Carlo exercise in Appendix E.

14. See Fang and Santos (Citation2019) for further details.

15. The asymptotic distribution of the counterfactual unconditional distribution and quantile function is obtained analogously to that of the first kind of counterfactual, shown in Theorem 1. Its proof is omitted for the sake of brevity.

16. Even for such estimator it would not be possible to establish the asymptotic normality as in Theorem 1, since the estimator of the conditional copula converges at a rate slower than n. See Appendix B.7 for further details.

17. Note that the weight for each individual is the same in every step.

18. The bootstrap validity for the second kind of counterfactual estimators is analogously proved and therefore omitted.

19. Note that the bootstrapped covariance matrix requires additional conditions to be valid. See Kato (Citation2011) for further details.

20. See, for example, Woutersen and Ham (Citation2013).

21. For a more detailed description of the datasets, see Evans and Ringel (Citation1999).

22. The age variable is introduced in the regressions as actual age minus 18.

23. Note that the extreme quantiles take values outside this range. Because these are less precisely estimated, I do not comment them in the text.

24. As stated in the text, this counterfactual is not Hadamard differentiable, so the bootstrap does not consistently approximate the asymptotic distribution of the estimator. Regardless, because the alternative method reported in Remark 5 is infeasible, I report these inconsistent estimates.

25. I only report the estimates with these two copulas because the Frank and Clayton copulas either yield very similar results or fit the actual data much worse. The results with them are available upon request.

26. Some of this notation is the standard in the literature of empirical processes. See VanDerVaart2000.

27. See, for example, Theorem 1 in Pratt (Citation1960).

28. Notice that it is not possible to apply the extended continuous mapping theorem to conclude that if Ûd,iPUd,i and V̂iPVi, then 1(Ûd,iu)P1(Ud,iu) or 1(V̂i<v)P1(Vi<v) uniformly in (u, v). Hence, a different argument is required for the proof.

Additional information

Funding

This work is part of the I+D+i project Ref. TED2021-131763A-I00 financed by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR. I gratefully acknowledge financial support from the Spanish Ministry of Universities and the European Union-NextGenerationEU (RMZ-18).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 578.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.