Imputation of Counterfactual Outcomes when the Errors are Predictable: Journal of Business & Economic Statistics: Vol 0, No 0

Sample our Mathematics & Statistics journals, sign in here to start your FREE access for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/07350015.2024.2358900?needAccess=true

Abstract

A crucial input into causal inference is the imputed counterfactual outcome. Imputation error can arise because of sampling uncertainty from estimating the prediction model using the untreated observations, or from out-of-sample information not captured by the model. While the literature has focused on sampling uncertainty, it vanishes with the sample size. Often overlooked is the possibility that the out-of-sample error can be informative about the missing counterfactual outcome if it is mutually or serially correlated. Motivated by the best linear unbiased predictor (BLUP) of Goldberger in a time series setting, we propose an improved predictor of potential outcome when the errors are correlated. The proposed PUP is practical as it is not restricted to linear models, can be used with consistent estimators already developed, and improves mean-squared error for a large class of strong mixing error processes. Ignoring predictability in the errors can distort conditional inference. However, the precise impact will depend on the choice of estimator as well as the realized values of the residuals.

Keywords:

BLUP
Missing values
Synthetic control
Treatment effect

Disclosure Statement

The authors report that there are no competing interests to declare.

Notes

1 We thank Bruce Hansen for this suggestion.

2 Regularization is not necessary to consistently estimate the missing values, but could give a lower rank common component than the one in Bai and Ng (Citation2021).

3 Robinson (Citation1991) provides a survey of its many derivations, including a Kalman filter interpretation, see also Spall (Citation1991). Taub (Citation1979) and Baltagi (Citation2008, 2013) use it in variance components analysis of panel data.

4 Cochrane-Orcutt performs least squares regression of $y_{t} - ϕ_{1} y_{t - 1}$ on $X_{t} - ϕ_{1} X_{t - 1}$ for given $ϕ_{1}$ using data from $t = 2, \dots, T_{0}$ , and then estimates $ϕ_{1}$ from an autoregression in $y_{t} - X_{t}^{'} \hat{β}$ till convergence. The Prais-Winsten estimator additionally exploits information in t = 1. It is also possible to estimate β and $ϕ_{1}$ directly from the Durbin equation $y_{t} = X_{t}^{'} β + y_{t - 1} ϕ_{1} + X_{t - 1}^{'} γ +$ error.

5 Brodersen et al. (Citation2015) consider state space estimation of the counterfactual outcomes in the presence of trends, but serial correlation in idiosyncratic shocks and/or the factors are not allowed. Carvalho, Masini, and Medeiros (Citation2018) and Masini and Medeiros (Citation2021, Citation2022) consider causal inference in a high-dimensional setting when the data are persistent and possibly non-stationary.

6 This follows because $u_{i, T_{0} + h} = v_{i, T_{0} + h} + \dots + ϕ_{i}^{h - 1} v_{i, T_{0}}$ . Thus, $ω_{h, i}^{2} = σ_{v, i}^{2} \sum_{j = 0}^{h - 1} ϕ_{i}^{2 h} = σ_{v, i}^{2} \frac{1 - ϕ_{i}^{2 h}}{1 - ϕ_{i}^{2}} \equiv γ_{0, i} (1 - ϕ_{i}^{2 h})$ , $γ_{0, i} = σ_{v, i}^{2} {(1 - ϕ_{i}^{2})}^{- 1}$ .

7 For large T₁, $\frac{1}{T_{1}} \sum_{h = 1}^{T_{1}} m_{i, T_{0} + h} - {\hat{m}}_{i, T_{0} + h} + \frac{1}{T_{1}} \sum_{h = 1}^{T} e_{i, T_{0} + h} + δ_{i, T_{0} + h} - E [δ_{i, T_{0} + h}]$ , which equals ${\hat{Δ}}_{i, T_{0} + 1 : T} - Δ_{i, T_{0} + 1 : T} + δ_{i, T_{0} + h} - E [δ_{i, T_{0} + h}] .$

Bai, J., and Ng, S. (2021), “Matrix Completion, Counterfactuals, and Factor Analysis of Missing Data,” Journal of the American Statistical Association, 116, 1746–1763. DOI: 10.1080/01621459.2021.1967163.

Web of Science ®Google Scholar

Robinson, G. K. (1991), “That BLUP is a Good Thing: The Estimation of Random Effects,” Statistical Science, 6, 15–32. DOI: 10.1214/ss/1177011926.

Google Scholar

Spall, J. (1991), “Comment: The Kalman Filter and BLUP,” Statistical Science, 6, 39–41.

Google Scholar

Taub, A. (1979), “Prediction in Context of Variance-Components Model,” Journal of Econometrics, 10, 103–107. DOI: 10.1016/0304-4076(79)90068-X.

Google Scholar

Baltagi, B. (2008), “Forecasting with Panel Data,” Journal of Forecasting, 27, 153–173. DOI: 10.1002/for.1047.

Web of Science ®Google Scholar

Brodersen, K., Gallusser, F., Koehler, J., Remy, N., and Scott, S. (2015), “Inferring Causal Impact Using Bayesian Structural Time-Series Models,” Annals of Applied Statistics, 9, 247–274.

Web of Science ®Google Scholar

Carvalho, C., Masini, R., and Medeiros, M. (2018), “ArCo: An Artificial Counterfactual Approach for High-Dimensional Panel Time Series Data,” Journal of Econometrics, 207, 352–380. DOI: 10.1016/j.jeconom.2018.07.005.

Web of Science ®Google Scholar

Masini, R., and Medeiros, M. C. (2021), “Counterfactual Analysis With Artificial Controls: Inference, High Dimensions, and Nonstationarity,” Journal of the American Statistical Association, 116, 1773–1788. DOI: 10.1080/01621459.2021.1964978.

Web of Science ®Google Scholar

Masini, R., and Medeiros, M. (2022), “Counterfactual Analysis and Inference with Nonstationary Data,” Journal of Business and Economic Statistics, 40, 227–239. DOI: 10.1080/07350015.2020.1799814.

Web of Science ®Google Scholar

Additional information

Funding

The authors gratefully acknowledge support from the NSERC grant (Canada) and the National Science Foundation (Ng SES 018369)

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later Item saved, go to cart

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 61.00 Add to cart

PDF download + Online access - Online Checkout

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 123.00 Add to cart

Issue Purchase - Online Checkout

* Local tax will be added as applicable

Share icon
Back to Top

Supercharge Your Next Research Paper: Research and write your next paper with Jenni AI.

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

Imputation of Counterfactual Outcomes when the Errors are Predictable

Log in via your institution

Log in to Taylor & Francis Online

Restore content access

Related Research

Information for

Open access

Opportunities

Help and information

Imputation of Counterfactual Outcomes when the Errors are Predictable

Abstract

Disclosure Statement

Notes

Additional information

Funding

Log in via your institution

Log in to Taylor & Francis Online

Log in to Taylor & Francis Online

Restore content access

Related Research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature