3,597
Views
12
CrossRef citations to date
0
Altmetric
Tutorial

Demystifying Statistical Learning Based on Efficient Influence Functions

ORCID Icon, , & ORCID Icon
Pages 292-304 | Received 01 Jul 2021, Accepted 05 Dec 2021, Published online: 04 Feb 2022
 

Abstract

Evaluation of treatment effects and more general estimands is typically achieved via parametric modeling, which is unsatisfactory since model misspecification is likely. Data-adaptive model building (e.g., statistical/machine learning) is commonly employed to reduce the risk of misspecification. Naïve use of such methods, however, delivers estimators whose bias may shrink too slowly with sample size for inferential methods to perform well, including those based on the bootstrap. Bias arises because standard data-adaptive methods are tuned toward minimal prediction error as opposed to, for example, minimal MSE in the estimator. This may cause excess variability that is difficult to acknowledge, due to the complexity of such strategies. Building on results from nonparametric statistics, targeted learning and debiased machine learning overcome these problems by constructing estimators using the estimand’s efficient influence function under the nonparametric model. These increasingly popular methodologies typically assume that the efficient influence function is given, or that the reader is familiar with its derivation. In this article, we focus on derivation of the efficient influence function and explain how it may be used to construct statistical/machine-learning-based estimators. We discuss the requisite conditions for these estimators to perform well and use diverse examples to convey the broad applicability of the theory.

Supplementary Materials

Supplementary material is available online which contains details regarding the Riesz representation theorem and additional examples of efficient influence function derivations.

Medical Research Council (MRC) International Centre for Genomic Medicine in Neuromuscular Disease;

Notes

1 Note that we have evaluated the plug-in bias term at the true propensity score because the considered plug-in estimator does not rely on an estimated propensity score. One may alternatively evaluate the plug-in bias term at an estimated propensity score, which will then only affect the remainder term.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 106.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.