673
Views
0
CrossRef citations to date
0
Altmetric
Statistical Inference

Convolutional Neural Networks for Valid and Efficient Causal Inference

ORCID Icon, ORCID Icon & ORCID Icon
Pages 714-723 | Received 27 Jan 2023, Accepted 29 Aug 2023, Published online: 23 Oct 2023

References

  • Barban, N., de Luna, X., Lundholm, E., Svensson, I., and Billari, F. C. (2020), “Causal Effects of the Timing of Life-Course Events: Age at Retirement and Subsequent Health,” Sociological Methods & Research, 49, 216–249. DOI: 10.1177/0049124117729697.
  • Bartlett, P. L., Bousquet, O., and Mendelson, S. (2005), “Local Rademacher Complexities,” The Annals of Statistics, 33, 1497–1537. DOI: 10.1214/009053605000000282.
  • Bartlett, P. L., Harvey, N., Liaw, C., and Mehrabian, A. (2019), “Nearly-Tight vc-Dimension and Pseudodimension Bounds for Piecewise Linear Neural Networks,” The Journal of Machine Learning Research, 20, 2285–2301.
  • Belloni, A., Chen, D., Chernozhukov, V., and Hansen, C. (2012), “Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain,” Econometrica, 80, 2369–2429.
  • Belloni, A., Chernozhukov, V., and Hansen, C. (2014), “Inference on Treatment Effects After Selection Among High-Dimensional Controls,” The Review of Economic Studies, 81, 608–650. DOI: 10.1093/restud/rdt044.
  • Chen, X., and White, H. (1999), “Improved Rates and Asymptotic Normality for Nonparametric Neural Network Estimators,” IEEE Transactions on Information Theory, 45, 682–691. DOI: 10.1109/18.749011.
  • Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., and Robins, J. (2018), “Double/Debiased Machine Learning for Treatment and Structural Parameters,” The Econometrics Journal, 21, C1–C68. DOI: 10.1111/ectj.12097.
  • Chernozhukov, V., Hansen, C., and Spindler, M. (2016), “hdm: High-Dimensional Metrics,” R Journal, 8, 185–199. DOI: 10.32614/RJ-2016-040.
  • Evans, L. C. (2010), Partial Differential Equations, Providence, RI: American Mathematical Society.
  • Farrell, M. H. (2015), “Robust Inference on Average Treatment Effects with Possibly More Covariates than Observations,” Journal of Econometrics, 189, 1–23. DOI: 10.1016/j.jeconom.2015.06.017.
  • Farrell, M. H. (2018), “Robust Inference on Average Treatment Effects with Possibly more Covariates than Observations,” arXiv:1309.4686v3.
  • Farrell, M. H., Liang, T., and Misra, S. (2021), “Deep Neural Networks for Estimation and Inference,” Econometrica, 89, 181–213. DOI: 10.3982/ECTA16901.
  • Kennedy, E. H. (2016), “Semiparametric Theory and Empirical Processes in Causal Inference,” in Statistical Causal Inferences and their Applications in Public Health Research, eds. H. He, P. Wu, D.-G. Chen, pp. 141–167, Cham: Springer.
  • Klusowski, J. M., and Barron, A. R. (2018), “Approximation by Combinations of Relu and Squared Relu Ridge Functions with l1 and l0 Controls,” IEEE Transactions on Information Theory, 64, 7649–7656.
  • LeCun, Y., and Bengio, Y. (1995), “Convolutional Networks for Images, Speech, and Time Series,” in The Handbook of Brain Theory and Neural Networks, 3361, ed. M. A. Arbib, pp. 255–258, Cambridge, MA: MIT Press.
  • Lindgren, U., Nilsson, K., de Luna, X., and Ivarsson, A. (2016), “Data Resource Profile: Swedish Microdata Research from Childhood into Lifelong Health and Welfare (Umeå SIMSAM Lab),” International Journal of Epidemiology, 45, 1075–1075. DOI: 10.1093/ije/dyv358.
  • Moosavi, N., Häggström, J., and de Luna, X. (2021), “The Costs and Benefits of Uniformly Valid Causal Inference with High-Dimensional Nuisance Parameters,” Statistical Science, to appear ArXiv preprint arXiv:2105.02071.
  • Robins, J. M., Rotnitzky, A., and Zhao, L. P. (1994), “Estimation of Regression Coefficients When Some Regressors are not Always Observed,” Journal of the American statistical Association, 89, 846–866. DOI: 10.1080/01621459.1994.10476818.
  • Rubin, D. B. (1974), “Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies,” Journal of Educational Psychology, 66, 688–701. DOI: 10.1037/h0037350.
  • Scharfstein, D., Rotnitzky, A., and Robins, J. (1999), “Rejoinder to Comments on “Adjusting for Non-ignorable Drop-Out Using Semiparametric Non-response Models?” Journal of the American Statistical Association, 94, 1121–1146. DOI: 10.2307/2669930.
  • Stein, E. M. (1970), Singular Integrals and Differentiability Properties of Functions (PMS-30), Princeton: Princeton University Press.
  • Tan, Z. (2007), “Comment: Understanding OR, PS and DR,” Statistical Science, 22, 560–568. DOI: 10.1214/07-STS227A.
  • Tsiatis, A. (2006), Semiparametric Theory and Missing Data, New York: Springer.
  • van der Laan, M., and Rose, S. (2011), Targeted Learning: Causal Inference for Observational and Experimental Data. Springer Series in Statistics, New York: Springer.
  • Zhou, D.-X. (2020), “Universality of Deep Convolutional Neural Networks,” Applied and Computational Harmonic Analysis, 48, 787–794. DOI: 10.1016/j.acha.2019.06.004.