414
Views
1
CrossRef citations to date
0
Altmetric
Article

Variable selection in double/debiased machine learning for causal inference: an outcome-adaptive approach

ORCID Icon & ORCID Icon
Pages 5880-5893 | Received 05 Apr 2021, Accepted 29 Oct 2021, Published online: 15 Nov 2021
 

Abstract

Access to high-dimensional data has made the use of machine learning in causal inference more common in recent years. The double/debiased machine learning (DML) estimator for the treatment effect is designed to obtain a valid inference when nuisance functions in the treatment and outcome equations, are estimated using machine learning methods. However, when some covariates in the treatment equation do not appear in the outcome equation, the inclusion of such covariates in the propensity score estimation will result in the increasing bias and variance of the DML estimator. To solve this issue, we introduce an outcome-adaptive DML estimator, which incorporates the outcome-adaptive lasso for the variable selection in the propensity score estimation. We evaluate the performance of the proposed method using Monte Carlo simulation. The results indicate that our proposed method in many cases outperforms other methods.

MATHEMATICS SUBJECT CLASSIFICATION:

Acknowledgments

The authors thank an anonymous referee and participants of the Kansai Econometric Society Meetings at Hitotsubashi University for their helpful comments and discussion.

Disclosure statement

The first author of this article received an honorarium from the Daiichi-Sankyo donation program (https://www.daiichisankyo.co.jp/corporate/ds-shougakukifu/#secI, accessed March 6, 2021).

Notes

1 In general, when confounders are omitted from the outcome equation, a regression coefficient on the treatment suffers from bias because the error term and treatment variable are correlated. If there is a covariate included in the treatment equation but not in the outcome equation, it can be used as an instrumental variable since it is correlated with the regressor (namely, the treatment variable) but not correlated with the error term in the regression model.

Additional information

Funding

This work was supported by the JSPS KAKENHI (20H01482 and 20K20247).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 1,090.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.