2,080
Views
42
CrossRef citations to date
0
Altmetric
Applications and Case Studies

Multiple Imputation of Missing Categorical and Continuous Values via Bayesian Mixture Models With Local Dependence

&
Pages 1466-1479 | Received 01 Sep 2014, Published online: 04 Jan 2017

References

  • Banerjee, A., Murray, J., and Dunson, D. B. (2013), “Bayesian Learning of Joint Distributions of Objects,” in Proceedings of the 16th International Conference on Artificial Intelligence and Statistics (AISTATS), Scottsdale, AZ, USA.
  • Böhning, D., Seidel, W., Alfó, M., Garel, B., Patilea, V., Walther, G., Di Zio, M., Guarnera, U., and Luzi, O. (2007), “Imputation Through Finite Gaussian Mixture Models,” Computational Statistics & Data Analysis, 51, 5305–5316.
  • De Iorio, M., Müller, P., Rosner, G. L., and MacEachern, S. N. (2004), “An ANOVA Model for Dependent Random Measures,” Journal of the American Statistical Association, 99, 205–215.
  • Dunson, D. B., and Bhattacharya, A. (2011), “Nonparametric Bayes Regression and Classification Through Mixtures of Product Kernels,” in Bayesian Statistics 9, Proceedings of Ninth Valencia International Conference on Bayesian Statistics, eds. J. M. Bernardo, M. J. Bayarri, J. O. Berger, A. P. Dawid, D. Heckerman, A. F. M. Smith, and M. West, Oxford, UK: Oxford University Press.
  • Dunson, D. B., and Xing, C. (2009), “Nonparametric Bayes Modeling of Multivariate Categorical Data,” Journal of the American Statistical Association, 104, 1042–1051.
  • Elliott, M. R., and Stettler, N. (2007), “Using a Mixture Model for Multiple Imputation in the Presence of Outliers: The “Healthy for Life” Project,” Journal of the Royal Statistical Society, Series C, 56, 63–78.
  • Gebregziabher, M., and DeSantis, S. M. (2010), “Latent Class Based Multiple Imputation Approach for Missing Categorical Data,” Journal of Statistical Planning and Inference, 140, 3252–3262.
  • Hannah, L. A., Blei, D. M., and Powell, W. B. (2011), “Dirichlet Process Mixtures of Generalized Linear Models,” Journal of Machine Learning Research, 12, 1923–1953.
  • Heeringa, S. G., Little, R. J. A., and Raghunathan, T. E. (2002), “Multivariate Imputation of Coarsened Survey Data on Household Wealth,” in Survey Nonresponse, eds. R. Groves, D. A. Dillman, J. L. Eltinge, and R. J. A. Little, New York: Wiley.
  • Ibrahim, J. G., Chen, M. H., Lipsitz, S. R., and Herring, A. H. (2005), “Missing Data Methods for Generalized Linear Models: A Comparative Review,” Journal of the American Statistical Association, 100, 332–346.
  • Ibrahim, J. G., Lipsitz, S. R., and Chen, M. H. (1999), “Missing Covariates in Generalized Linear Models When the Missing Data Mechanism is Non-Ignorable,” Journal of the Royal Statistical Society, Series B, 61, 173–190.
  • Ishwaran, H., and James, L. F. (2001), “Gibbs Sampling Methods for Stick-Breaking Priors,” Journal of the American Statistical Association, 96, 161–173.
  • Kim, H. J., Reiter, J. P., Wang, Q., Cox, L. H., and Karr, A. F. (2014), “Multiple Imputation of Missing or Faulty Values Under Linear Constraints,” Journal of Business and Economic Statistics, 32, 375–386.
  • Kinney, S. K., and Reiter, J. P. (2010), “Tests of Multivariate Hypotheses When Using Multiple Imputation for Missing Data and Disclosure Limitation,” Journal of Official Statistics, 26, 301.
  • Kropko, J., Goodrich, B., Gelman, A., and Hill, J. (2014), “Multiple Imputation for Continuous and Categorical Data: Comparing Joint Multivariate Normal and Conditional Approaches,” Political Analysis, 22, 497–519.
  • Lee, K. J., and Carlin, J. B. (2010), “Multiple Imputation for Missing Data: Fully Conditional Specification Versus Multivariate Normal Imputation,” American Journal of Epidemiology, 171, 624–632.
  • Lipsitz, S. R., and Ibrahim, J. G. (1996), “A Conditional Model for Incomplete Covariates in Parametric Regression Models,” Biometrika, 83, 916–922.
  • Little, R. J. A. (1988), “Missing-Data Adjustments in Large Surveys,” Journal of Business & Economic Statistics, 6, 287–296.
  • Little, R. J. A., and Schluchter, M. D. (1985), “Maximum Likelihood Estimation for Mixed Continuous and Categorical Data With Missing Values,” Biometrika, 72, 497–512.
  • Liu, C., and Rubin, D. B. (1998), “Ellipsoidally Symmetric Extensions of the General Location Model for Mixed Categorical and Continuous Data,” Biometrika, 85, 673–688.
  • Liu, J., Gelman, A., Hill, J., Su, Y.-S., and Kropko, J. (2014), “On the Stationary Distribution of Iterative Imputations,” Biometrika, 101, 155–173.
  • Lumley, T. (2004), “Analysis of Complex Survey Samples,” Journal of Statistical Software, 09.
  • Manrique-Vallier, D., and Reiter, J. (2014a), “Bayesian Multiple Imputation for Large-scale Categorical Data With Structural Zeros,” Survey Methodology, 40, 125–134.
  • Manrique-Vallier, D., and Reiter, J. P. (2014b), “Bayesian Estimation of Discrete Multivariate Latent Structure Models With Structural Zeros,” Journal of Computational and Graphical Statistics, 23, 1061–1079.
  • Meng, X.-L. (1994), “Multiple-Imputation Inferences With Uncongenial Sources of Input,” Statistical Science, 9, 538–558.
  • Molitor, J., Papathomas, M., Jerrett, M., and Richardson, S. (2010), “Bayesian Profile Regression With an Application to the National Survey Of Children’s Health,” Biostatistics, 11, 484–498.
  • Moore, J., Fields, J., Benedetto, G., Stinson, M., Chan, A., and Maples, J. (2009), The 2008 Survey of Income and Program Participation Event History Calendar Field Test: Study Design and Initial Results Statistical Research Division Research Report Series (Survey Methodology), U.S. Census Bureau.
  • Olkin, I., and Tate, R. F. (1961), “Multivariate Correlation Models With Mixed Discrete and Continuous Variables,” The Annals of Mathematical Statistics, 32, 448–465.
  • Raghunathan, T. E., Lepkowski, J. M., Van Hoewyk, J., and Solenberger, P. (2001), “A Multivariate Technique for Multiply Imputing Missing Values Using a Sequence of Regression Models,” Survey Methodology, 27, 85–96.
  • Reiter, J. P., and Raghunathan, T. E. (2007), “The Multiple Adaptations of Multiple Imputation,” Journal of the American Statistical Association, 102, 1462–1471.
  • Rubin, D. B. (1976), “Inference and Missing Data,” Biometrika, 63, 581–592.
  • Rubin, D. B. (1987), Multiple Imputation for Nonresponse in Surveys, New York: Wiley.
  • Rubin, D. B. (1996), “Multiple Imputation After 18+ Years,” Journal of the American Statistical Association, 91, 473–489.
  • Schafer, J. (1997), Analysis of Incomplete Multivariate Data, Boca Raton, FL: CRC Press.
  • Sethuraman, J. (1994), “A Constructive Definition of Dirichlet Priors,” Statistica Sinica, 4, 639–650.
  • Shahbaba, B., and Neal, R. (2009), “Nonlinear Models Using Dirichlet Process Mixtures,” The Journal of Machine Learning Research, 10, 1829–1850.
  • Si, Y., and Reiter, J. P. (2013), “Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys,” Journal of Educational and Behavioral Statistics, 38, 499–521.
  • Si, Y., Reiter, J. P., and Hillygus, D. S. (2014), “Semi-Parametric Selection Models for Potentially Non-Ignorable Attrition in Panel Studies With Refreshment Samples,” Political Analysis, forthcoming.
  • Suchard, M. A., Wang, Q., Chan, C., Frelinger, J., Cron, A., and West, M. (2010), “Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures,” Journal of Computational and Graphical Statistics, 19, 419–438.
  • U. S. Census Bureau. (2013), SIPP-EHC: 2011 and 2012 Field Test Evaluation Statistical Research Division Research Report Series (Survey Methodology), U.S. Census Bureau.
  • Van Buuren, S. (2007), “Multiple Imputation of Discrete and Continuous Data by Fully Conditional Specification,” Statistical Methods in Medical Research, 16, 219–242.
  • Van Buuren, S., and Groothuis-Oudshoorn, K. (2011), “mice: Multivariate Imputation by Chained Equations in R,” Journal of Statistical Software, 45, 1–67.
  • Van Buuren, S., and Oudshoorn, K. (1999), “Flexible Multivariate Imputation by MICE,” Leiden, The Netherlands: TNO Prevention Center.
  • Vermunt, J. K., Van Ginkel, J. R., Van Der Ark, L. A., and Sijtsma, K. (2008), “Multiple Imputation of Incomplete Categorial Data Using Latent Class Analysis,” Sociological Methodology, 38, 369–397.
  • Wade, S., Mongelluzzo, S., and Petrone, S. (2011), “An Enriched Conjugate Prior for Bayesian Nonparametric Inference,” Bayesian Analysis, 6, 359–385.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.