408
Views
0
CrossRef citations to date
0
Altmetric
Bayesian Computation

A Bayesian Singular Value Decomposition Procedure for Missing Data Imputation

ORCID Icon & ORCID Icon
Pages 470-482 | Received 04 Jun 2021, Accepted 22 Jul 2022, Published online: 07 Oct 2022

References

  • Andrews, D. F., and Mallows, C. L. (1974), “Scale Mixtures of Normal Distributions,” Journal of the Royal Statistical Society, Series B, 36, 99–102. DOI: 10.1111/j.2517-6161.1974.tb00989.x.
  • Audigier, V., Husson, F., and Josse, J. (2016), “Multiple Imputation for Continuous Variables Using a Bayesian Principal Component Analysis,” Journal of Statistical Computation and Simulation, 86, 2140–2156. DOI: 10.1080/00949655.2015.1104683.
  • Brick, J. M., and Kalton, G. (1996), “Handling Missing Data in Survey Research,” Statistical Methods in Medical Research, 5, 215–238. DOI: 10.1177/096228029600500302.
  • Burgette, L. F., and Reiter, J. P. (2010), “Multiple Imputation for Missing Data via Sequential Regression Trees,” American Journal of Epidemiology, 172, 1070–1076. DOI: 10.1093/aje/kwq260.
  • Carlin, J. B. (2014), “Multiple Imputation: Perspective and Historical Overview,” in Handbook of Missing Data Methodology, eds. G. Molenberghs, G. Fitzmaurice, M. G. Kenward, A. A. Tsiatis, and G. Verbeke, pp. 239–261, Boca Raton, FL: CRC Press.
  • Carpenter, J., and Kenward, M. (2012), Multiple Imputation and its Application, Chichester: Wiley.
  • Carpenter, J. R., and Kenward, M. G. (2014), “Development of Methods for the Analysis of Partially Observed Data and Critique of ad hoc Methods,” in Handbook of Missing Data Methodology, eds. G. Molenberghs, G. Fitzmaurice, M. G. Kenward, A. A. Tsiatis, and G. Verbeke, pp. 23–46, Boca Raton, FL: CRC Press.
  • Casella, G. (2001), “Empirical Bayes Gibbs Sampling,” Biostatistics, 2, 485–500. DOI: 10.1093/biostatistics/2.4.485.
  • Chikuse, Y. (2003), Statistics on Special Manifolds (Vol. 174), New York: Springer.
  • Doove, L. L., Van Buuren, S., and Dusseldorp, E. (2014), “Recursive Partitioning for Missing Data Imputation in the Presence of Interaction Effects,” Computational Statistics & Data Analysis, 72, 92–104.
  • Dua, D., and Graff, C. (2017), “UCI Machine Learning Repository,” Available at http://archive.ics.uci.edu/ml.
  • Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B. (2013), Bayesian Data Analysis, Boca Raton, FL: CRC Press.
  • Goldstein, H., Carpenter, J. R., and Browne, W. J. (2014), “Fitting Multilevel Multivariate Models with Missing Data in Responses and Covariates that may Include Interactions and Non-linear Terms,” Journal of the Royal Statistical Society, Series A, 177, 553–564. DOI: 10.1111/rssa.12022.
  • Griffin, J. E., and Brown, P. J. (2010), “Inference with Normal-Gamma Prior Distributions in Regression Problems,” Bayesian Analysis, 5, 171–188.
  • Hastie, T., Tibshirani, R., and Wainwright, M. (2019), Statistical Learning with Sparsity: The Lasso and Generalizations, Boca Raton, FL: Chapman and Hall/CRC.
  • Hoeting, J. A., Madigan, D., Raftery, A. E., and Volinsky, C. T. (1999), “Bayesian Model Averaging: A Tutorial,” Statistical Science, 14, 382–401.
  • Hoff, P. D. (2007), “Model Averaging and Dimension Selection for the Singular Value Decomposition,” Journal of the American Statistical Association, 102, 674–685. DOI: 10.1198/016214506000001310.
  • Hoff, P. D. (2009), “Simulation of the Matrix Bingham-von Mises-Fisher Distribution, With Applications to Multivariate and Relational Data,” Journal of Computational and Graphical Statistics, 18, 438–456, papers2://publication/doi/ DOI: 10.1198/jcgs.2009.07177..
  • Hogan, J. W., Roy, J., and Korkontzelou, C. (2004), “Handling Drop-out in Longitudinal Studies,” Statistics in Medicine, 23, 1455–1497. DOI: 10.1002/sim.1728.
  • Honaker, J., and King, G. (2010), “What to do about Missing Values in Time-Series Cross-Section Data,” American Journal of Political Science, 54, 561–581. DOI: 10.1111/j.1540-5907.2010.00447.x.
  • Honaker, J., King, G., and Blackwell, M. (2011a), “Amelia II: A Program for Missing Data,” Journal of Statistical Software, 45, 1–47. Available at http://www.jstatsoft.org/v45/i07/.
  • Honaker, J., King, G., and Blackwell, M. (2011b), “Amelia II: A Program for Missing Data,” Journal of Statistical Software, 45, 1–47.
  • Husson, F., Josse, J., Narasimhan, B., and Robin, G. (2019), “Imputation of Mixed Data with Multilevel Singular Value Decomposition,” Journal of Computational and Graphical Statistics, 28, 552–566. DOI: 10.1080/10618600.2019.1585261.
  • Jannach, D., Resnick, P., Tuzhilin, A., and Zanker, M. (2016), “Recommender Systems-Beyond Matrix Completion,” Communications of the ACM, 59, 94–102. DOI: 10.1145/2891406.
  • Josse, J., and Husson, F. (2016), “missMDA: A Package for Handling Missing Values in Multivariate Data Analysis,” Journal of Statistical Software, 70, 1–31. DOI: 10.18637/jss.v070.i01.
  • Laurent, M. (2009), “Matrix Completion Problems,” Encyclopedia of Optimization, 3, 221–229.
  • Little, R. J. (1988), “Missing-Data Adjustments in Large Surveys,” Journal of Business & Economic Statistics, 6, 287–296.
  • Little, R. J., and Rubin, D. B. (2019), Statistical Analysis with Missing Data (Vol. 793), Hoboken, NJ: Wiley.
  • Little, R. J., and Schluchter, M. D. (1985), “Maximum Likelihood Estimation for Mixed Continuous and Categorical Data with Missing Values,” Biometrika, 72, 497–512. DOI: 10.1093/biomet/72.3.497.
  • Liu, J., Gelman, A., Hill, J., Su, Y.-S., and Kropko, J. (2013), “On the Stationary Distribution of Iterative Imputations,” Biometrika, 101, 155–173. DOI: 10.1093/biomet/ast044.
  • Mangasarian, O. L., and Wolberg, W. H. (1990), “Cancer Diagnosis via Linear Programming,” Technical Report, University of Wisconsin-Madison Department of Computer Sciences.
  • Meng, X.-L. (1994), “Multiple-Imputation Inferences with Uncongenial Sources of Input,” Statistical Science, 9, 538–558.
  • Morita, S., Thall, P. F., and Müller, P. (2008), “Determining the Effective Sample Size of a Parametric Prior,” Biometrics, 64, 595–602. DOI: 10.1111/j.1541-0420.2007.00888.x.
  • Murray, J. S. (2018), “Multiple Imputation: A Review of Practical and Theoretical Findings,” Statistical Science, 33, 142–159. DOI: 10.1214/18-STS644.
  • Park, T., and Casella, G. (2008), “The Bayesian Lasso,” Journal of the American Statistical Association, 103, 681–686. DOI: 10.1198/016214508000000337.
  • Rubin, D. B. (1976), “Inference and Missing Data,” Biometrika, 63, 581–592. DOI: 10.1093/biomet/63.3.581.
  • Rubin, D. B. (1996), “Multiple Imputation after 18+ Years”, Journal of the American Statistical Association, 91, 473–489.
  • Rubin, D. B. (2004), Multiple Imputation for Nonresponse in Surveys (Vol. 81), Hoboken, NJ: Wiley.
  • Rubin, D. B., and Schenker, N. (1986), “Multiple Imputation for Interval Estimation from Simple Random Samples with Ignorable Nonresponse,” Journal of the American Statistical Association, 81, 366–374. DOI: 10.1080/01621459.1986.10478280.
  • Schafer, J. L. (1997), Analysis of Incomplete Multivariate Data, Boca Raton, FL: CRC Press.
  • Shah, A. D., Bartlett, J. W., Carpenter, J., Nicholas, O., and Hemingway, H. (2014), “Comparison of Random Forest and Parametric Imputation Models for Imputing Missing Data Using MICE: A CALIBER Study,” American Journal of Epidemiology, 179, 764–774. DOI: 10.1093/aje/kwt312.
  • Song, Y., and Zhu, J. (2016), “Bayesian Matrix Completion via Adaptive Relaxed Spectral Regularization,” in Thirtieth AAAI Conference on Artificial Intelligence.
  • Tanner, M. A., and Wong, W. H. (1987), “The Calculation of Posterior Distributions by Data Augmentation,” Journal of the American statistical Association, 82, 528–540. DOI: 10.1080/01621459.1987.10478458.
  • Tuncer, Y., Tanik, M. M., and Allison, D. B. (2008), “An Overview of Statistical Decomposition Techniques Applied to Complex Systems,” Computational Statistics & Data Analysis, 52, 2292–2310.
  • Tzikas, D. G., Likas, A. C., and Galatsanos, N. P. (2008), “The Variational Approximation for Bayesian Inference,” IEEE Signal Processing Magazine, 25, 131–146. DOI: 10.1109/MSP.2008.929620.
  • Van Buuren, S. (2018), Flexible Imputation of Missing Data, Boca Raton, FL: Chapman and Hall/CRC.
  • van Buuren, S., Brand, J. P. L., Groothuis-Oudshoorn, C. G. M., and Rubin, D. B. (2006), “Fully Conditional Specification in Multivariate Imputation,” Journal of Statistical Computation and Simulation, 76, 1049–1064. DOI: 10.1080/10629360600810434.
  • van Buuren, S., and Groothuis-Oudshoorn, K. (2011), “mice: Multivariate Imputation by Chained Equations in R,” Journal of Statistical Software, 45, 1–67. Available at https://doi.org/10.18637/jss.v045.i03.
  • White, I. R., Royston, P., and Wood, A. M. (2011), “Multiple Imputation Using Chained Equations: Issues and Guidance for Practice,” Statistics in Medicine, 30, 377–399. DOI: 10.1002/sim.4067.
  • Wold, S., Esbensen, K., and Geladi, P. (1987), “Principal Component Analysis,” Chemometrics and Intelligent Laboratory Systems, 2, 37–52. DOI: 10.1016/0169-7439(87)80084-9.
  • Xu, D., Daniels, M. J., and Winterstein, A. G. (2016), “Sequential BART for Imputation of Missing Covariates,” Biostatistics, 17, 589–602. DOI: 10.1093/biostatistics/kxw009.
  • Yang, L., Fang, J., Duan, H., Li, H., and Zeng, B. (2018), “Fast Low-Rank Bayesian Matrix Completion with Hierarchical Gaussian Prior Models,” IEEE Transactions on Signal Processing, 66, 2804–2817. DOI: 10.1109/TSP.2018.2816575.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.