1,314
Views
6
CrossRef citations to date
0
Altmetric
Theory and Methods

A Geometric Variational Approach to Bayesian Inference

, &
Pages 822-835 | Received 10 Aug 2017, Accepted 29 Jan 2019, Published online: 30 Apr 2019

References

  • Amari, S. (1998), “Natural Gradient Works Efficiently in Learning,” Neural Computation, 10, 251–276. DOI: 10.1162/089976698300017746.
  • Barber, D., and Wiegerinck, W. (1999), “Tractable Variational Structures for Approximating Graphical Models,” in Neural Information Processing Systems (Vol. 11), pp. 183–189.
  • Bauer, M., Joshi, S., and Modin, K. (2015), “Diffeomorphic Density Matching by Optimal Information Transport,” SIAM Journal on Imaging Sciences, 8, 1718–1751. DOI: 10.1137/151006238.
  • Bauer, M., Joshi, S., and Modin, K. (2017), “Diffeomorphic Random Sampling Using Optimal Information Transport,” in Geometric Science of Information: Third International Conference, GSI 2017, pp. 135–142.
  • Beal, M. J. (2003), “Variational Algorithms for Approximate Bayesian Inference,” PhD thesis, University College London.
  • Bhattacharyya, A. (1943), “On a Measure of Divergence Between Two Statistical Population Defined by Their Population Distributions,” Bulletin of the Calcutta Mathematical Society, 35, 99–109.
  • Bishop, C. M. (2006), Pattern Recognition and Machine Learning, New York: Springer.
  • Blei, D. M., Kucukelbir, A., and McAuliffe, J. D. (2017), “Variational Inference: A Review for Statisticians,” Journal of the American Statistical Association, 112, 859–877. DOI: 10.1080/01621459.2017.1285773.
  • Broderick, T., Boyd, N., Wibisono, A., Wilson, A. C., and Jordan, M. I. (2013), “Streaming Variational Bayes,” in Neural Information Processing Systems, pp. 1727–1735.
  • Carlin, B. P., and Louis, T. A. (2008), Bayesian Methods for Data Analysis, London: CRC Press.
  • Cencov, N. N. (2000), Statistical Decision Rules and Optimal Inference, Number 53, Providence, RI: American Mathematical Society.
  • Chen, T., Streets, J., and Shahbaba, B. (2015), “A Geometric View of Posterior Approximation,” arXiv:1510.00861.
  • Cowles, M. K., and Carlin, B. P. (1996), “Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review,” Journal of the American Statistical Association, 91, 883–904. DOI: 10.1080/01621459.1996.10476956.
  • Dua, D., and Taniskidou, E. K. (2017). “UCI Machine Learning Repository.”
  • Ghahramani, Z., and Beal, M. J. (1999), “Variational Inference for Bayesian Mixtures of Factor Analysers,” in Neural Information Processing Systems (Vol. 12), pp. 449–455.
  • Girolami, M., and Calderhead, B. (2011), “Riemann Manifold Langevin and Hamiltonian Monte Carlo Methods,” Journal of the Royal Statistical Society, Series B, 73, 123–214. DOI: 10.1111/j.1467-9868.2010.00765.x.
  • Hernández-Lobato, J., Li, Y., Rowland, M., Bui, T., Hernández-Lobato, D., and Turner, R. (2016), “Black-Box α-Divergence Minimization,” in International Conference on Machine Learning, pp. 1511–1520.
  • Hoffman, M., and Blei, D. (2015), “Stochastic Structured Variational Inference,” in Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, pp. 361–369.
  • Hoffman, M. D., Blei, D. M., Wang, C., and Paisley, J. W. (2013), “Stochastic Variational Inference,” Journal of Machine Learning Research, 14, 1303–1347.
  • Jaakkola, T., and Jordan, M. I. (1997), “A Variational Approach to Bayesian Logistic Regression Models and Their Extensions,” in International Workshop on Artificial Intelligence and Statistics (Vol. 82).
  • Kass, R. E., and Vos, P. W. (2011), Geometrical Foundations of Asymptotic Inference (Vol. 908), Hoboken, NJ: Wiley.
  • Kingma, D. P., Salimans, T., Jozefowicz, R., Chen, X., Sutskever, I., and Welling, M. (2016), “Improved Variational Inference With Inverse Autoregressive Flow,” in Advances in Neural Information Processing Systems, pp. 4743–4751.
  • Kucukelbir, A., Tran, D., Ranganath, R., Gelman, A., and Blei, D. M. (2017), “Automatic Differentiation Variational Inference,” Journal of Machine Learning Research, 18, 430–474.
  • Kurtek, S. (2017), “A Geometric Approach to Pairwise Bayesian Alignment of Functional Data Using Importance Sampling,” Electronic Journal of Statistics, 11, 502–531. DOI: 10.1214/17-EJS1243.
  • Kurtek, S., and Bharath, K. (2015), “Bayesian Sensitivity Analysis With the Fisher-Rao Metric,” Biometrika, 102, 601–616. DOI: 10.1093/biomet/asv026.
  • Lang, S. (2012), Fundamentals of Differential Geometry (Vol. 191), Berlin: Springer Science & Business Media.
  • Leonard, T. (1978), “Density Estimation, Stochastic Processes and Prior Information,” Journal of the Royal Statistical Society, Series B, 40, 113–146. DOI: 10.1111/j.2517-6161.1978.tb01655.x.
  • Li, Y., and Turner, R. E. (2016), “Rényi Divergence Variational Inference,” in Neural Information Processing Systems, pp. 1073–1081.
  • McGrory, C. A., and Titterington, D. (2007), “Variational Approximations in Bayesian Model Selection for Finite Mixture Distributions,” Computational Statistics & Data Analysis, 51, 5352–5367. DOI: 10.1016/j.csda.2006.07.020.
  • Minka, T. P. (2001), “Expectation Propagation for Approximate Bayesian Inference,” in Seventeenth Conference on Uncertainty in Artificial Intelligence, pp. 362–369.
  • Minka, T. P. (2005), “Divergence Measures and Message Passing,” Technical Report.
  • Olson, J. M., and Weissfeld, L. A. (1991), “Approximation of Certain Multivariate Integrals,” Statistics & Probability Letters, 11, 309–317.
  • Ramsay, J. O., Hooker, G., and Graves, S. (2009), Functional Data Analysis with R and MATLAB, Berlin: Springer Science & Business Media.
  • Rao, C. R. (1945), “Information and Accuracy Attainable in the Estimation of Statistical Parameters,” Bulletin of the Calcutta Mathematical Society, 37, 81–91.
  • Rényi, A. (1961), “On Measures of Entropy and Information,” in Berkeley Symposium on Mathematical Statistics and Probability (Vol. 1), pp. 547–561.
  • Rezende, D., and Mohamed, S. (2015), “Variational Inference With Normalizing Flows,” in International Conference on Machine Learning, pp. 1530–1538.
  • Riihimäki, J., and Vehtari, A. (2014), “Laplace Approximation for Logistic Gaussian Process Density Estimation and Regression,” Bayesian Analysis, 9, 425–448. DOI: 10.1214/14-BA872.
  • Ring, W., and Wirth, B. (2012), “Optimization Methods on Riemannian Manifolds and Their Applications to Shape Space,” SIAM Journal of Optimization, 22, 596–627. DOI: 10.1137/11082885X.
  • Saul, L. K., and Jordan, M. I. (1996), “Exploiting Tractable Substructures in Intractable Networks,” in Neural Information Processing Systems, pp. 486–492.
  • Sigillito, V. G., Wing, S. P., Hutton, L. V., and Baker, K. B. (1989), “Classification of Radar Returns From the Ionosphere Using Neural Networks,” Johns Hopkins APL Technical Digest, 10, 262–266.
  • Srivastava, A., Jermyn, I. H., and Joshi, S. H. (2007), “Riemannian Analysis of Probability Density Functions With Applications in Vision,” in IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8.
  • Srivastava, A., Klassen, E., Joshi, S. H., and Jermyn, I. H. (2011), “Shape Analysis of Elastic Curves in Euclidean Spaces,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 1415–1428. DOI: 10.1109/TPAMI.2010.184.
  • Tan, L. S. L. (2021), “Use of model reparametrization to improve variational Bayes,” Journal of the Royal Statistical Society: Series B (Statistical Methodology), 83, 30–57. https://doi.org/10.1111/rssb.12399 (arXiv:1805.07267).
  • Tokdar, S. T., and Ghosh, J. K. (2007), “Posterior Consistency of Logistic Gaussian Process Priors in Density Estimation,” Journal of Statistical Planning and Inference, 137, 34–42. DOI: 10.1016/j.jspi.2005.09.005.
  • Ueda, N., and Ghahramani, Z. (2002), “Bayesian Model Search for Mixture Models Based on Optimizing Variational Bounds,” Neural Networks, 15, 1223–1241. DOI: 10.1016/S0893-6080(02)00040-0.
  • van der Vaart, A. W., and van Zanten, J. H. (2009), “Adaptive Bayesian Estimation Using a Gaussian Random Field With Inverse Gamma Bandwidth,” Annals of Statistics, 37, 2655–2675. DOI: 10.1214/08-AOS678.
  • Van Erven, T., and Harremos, P. (2014), “Rényi Divergence and Kullback-Leibler Divergence,” IEEE Transactions on Information Theory, 60, 3797–3820. DOI: 10.1109/TIT.2014.2320500.
  • Wang, C., and Blei, D. M. (2013), “Variational Inference in Nonconjugate Models,” Journal of Machine Learning Research, 14, 1005–1031.
  • Yeung, D., Chang, H., Xiong, Y., George, S., Kashi, R., Matsumoto, T., and Rigoll, G. (2004), “SVC2004: First International Signature Verification Competition,” in International Conference on Biometric Authentication, pp. 16–22.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.