652
Views
8
CrossRef citations to date
0
Altmetric
Presidential Address

An Approach to Addressing Multiple Imputation Model Uncertainty Using Bayesian Model Averaging

&
Pages 553-567 | Published online: 20 Sep 2019
 

Abstract

This paper considers the problem of imputation model uncertainty in the context of missing data problems. We argue that so-called “Bayesianly proper” approaches to multiple imputation, although correctly accounting for uncertainty in imputation model parameters, ignore the uncertainty in the imputation model itself. We address imputation model uncertainty by implementing Bayesian model averaging as part of the imputation process. Bayesian model averaging accounts for both model and parameter uncertainty, and thus we argue is fully Bayesianly proper. We apply Bayesian model averaging to multiple imputation under the fully conditional specification approach. An extensive simulation study is conducted comparing our Bayesian model averaging approach against normal theory-based Bayesian imputation not accounting for model uncertainty. Across almost all conditions of the simulation study, the results reveal the extent of model uncertainty in multiple imputation and a consistent advantage to our Bayesian model averaging approach over normal-theory multiple imputation under missing-at-random and missing-completely-at random in terms of Kullback-Liebler divergence and mean squared prediction error. A small case study is also presented. Directions for future research are discussed.

Notes

1 Note: footnote ours. This paper was eventually published as Rubin (Citation1996).

2 Note that the unit information prior is equivalent to Zellner’s g-prior (Zellner et al. Citation1986), where g = 1/N…, and where N is the sample size. See also Fernández et al. (Citation2001).

3 The Laplace method of integrals is based on a Taylor expansion of a function f(u) of a q-dimensional vector u. The approximation is ef(u)du2(π)q/2|A|1/2exp{f(u*)}, where u* is the value of u at which f attains its maximum, and A is minus the inverse of the Hessian of f evaluated at u*. Following Raftery (Citation1996, pg. 253), when the Laplace method is applied to equation (5), we obtain the approximation p(y|Mk)(2π)qk|Ak|1/2p(y|θ˜k,Mk)p(θ˜k,Mk), where qk is the dimension of θk, θ˜k is the posterior mode of θk, and Ak is minus the inverse of the Hessian of log{p(y|θk,Mk)p(θk|Mk)}, evaluated at the posterior mode θ˜k.

4 Note that the “mice” program is flexible enough to allow different imputation methods to be chosen for different scales of variables. We address this issue and its implications for missing data with Bayesian model averaging in the Discussion section.

5 We thank an anonymous reviewer for bringing this paper to our attention.

6 The default number of models in the “BMA” package is 150.

7 In the interest of space, we only show the results for MCAR1, MCAR3, and MCAR5. The results for MCAR2 and MCAR4 results are very similar under all conditions.

8 We recognize that when using plausible values in the analysis of large-scale educational assessments, it is more appropriate to use all plausible values and combine them using Rubin’s (Citation1987) rules. However, extending our miBMA approach to the plausible value framework is beyond the scope of this paper.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 352.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.