330
Views
3
CrossRef citations to date
0
Altmetric
Bayesian and Latent Variable Models

Scalable Hyperparameter Selection for Latent Dirichlet Allocation

&
Pages 875-895 | Received 20 Dec 2018, Accepted 02 Dec 2019, Published online: 15 May 2020

References

  • Asuncion, A., Welling, M., Smyth, P., and Teh, Y. W. (2009), “On Smoothing and Inference for Topic Models,” in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI ‘09, AUAI Press, Arlington, VA.
  • Blei, D. M., and Jordan, M. I. (2003), “Modeling Annotated Data,” in Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ‘03, ACM, New York, NY.
  • Blei, D. M., Kucukelbir, A., and McAuliffe, J. D. (2017), “Variational Inference: A Review for Statisticians,” Journal of the American Statistical Association, 112, 859–877. DOI: 10.1080/01621459.2017.1285773.
  • Blei, D. M., and Lafferty, J. D. (2007), “A Correlated Topic Model of Science,” The Annals of Applied Statistics, 1, 17–35. DOI: 10.1214/07-AOAS114.
  • Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003), “Latent Dirichlet Allocation,” Journal of Machine Learning Research, 3, 993–1022.
  • Celeux, G., Hurn, M., and Robert, C. P. (2000), “Computational And Inferential Difficulties With Mixture Posterior Distributions,” Journal of the American Statistical Association, 95, 957–970. DOI: 10.1080/01621459.2000.10474285.
  • Chen, M.-H. (1994), “Importance-Weighted Marginal Bayesian Posterior Density Estimation,” Journal of the American Statistical Association, 89, 818–824. DOI: 10.1080/01621459.1994.10476815.
  • Chen, Z. (2015), “Inference for the Number of Topics in the Latent Dirichlet Allocation Model via Bayesian Mixture Modelling,” Ph.D. thesis, University of Florida.
  • Chib, S. (1995), “Marginal Likelihood From the Gibbs Output,” Journal of the American Statistical Association, 90, 1313–1321. DOI: 10.1080/01621459.1995.10476635.
  • Chib, S., and Jeliazkov, I. (2001), “Marginal Likelihood From the Metropolis-Hastings Output,” Journal of the American Statistical Association, 96, 270–281. DOI: 10.1198/016214501750332848.
  • Donoho, D. L., and Liu, R. C. (1991), “Geometrizing Rates of Convergence, II,” The Annals of Statistics, 19, 633–667. DOI: 10.1214/aos/1176348114.
  • Doss, H., and George, C. P. (2019), “Theoretical and Empirical Evaluation of a Grouped Gibbs Sampler for Parallel Computation in the LDA Model,” Tech. Rep., Department of Statistics, University of Florida.
  • Doss, H., and Linero, A. (2019), “A Fully-Bayes Approach to Empirical Bayes Inference and Bayesian Sensitivity Analysis,” Tech. Rep., Department of Statistics, University of Florida.
  • Escobar, M. D., and West, M. (1995), “Bayesian Density Estimation and Inference Using Mixtures,” Journal of the American Statistical Association, 90, 577–588. DOI: 10.1080/01621459.1995.10476550.
  • Flegal, J. M., Haran, M., and Jones, G. L. (2008), “Markov Chain Monte Carlo: Can We Trust the Third Significant Figure?,” Statistical Science, 23 250–260. DOI: 10.1214/08-STS257.
  • Flegal, J. M., Hughes, J., and Vats, D. (2016), “mcmcse: Monte Carlo Standard Errors for MCMC,” Riverside, CA and Minneapolis, MN, R Package Version 1.2-1.
  • George, C. P. (2015), “Latent Dirichlet Allocation: Hyperparameter Selection and Applications to Electronic Discovery,” Ph.D. thesis, University of Florida.
  • George, C. P., and Doss, H. (2018), “Principled Selection of Hyperparameters in the Latent Dirichlet Allocation Model,” Journal of Machine Learning Research, 18, 1–38.
  • George, E. I., and Foster, D. P. (2000), “Calibration and Empirical Bayes Variable Selection,” Biometrika, 87, 731–747. DOI: 10.1093/biomet/87.4.731.
  • Geyer, C. J., and Thompson, E. A. (1995), “Annealing Markov Chain Monte Carlo With Applications to Ancestral Inference,” Journal of the American Statistical Association, 90, 909–920. DOI: 10.1080/01621459.1995.10476590.
  • Griffiths, T. L., and Steyvers, M. (2004), “Finding Scientific Topics,” Proceedings of the National Academy of Sciences of the United States of America, 101, 5228–5235. DOI: 10.1073/pnas.0307752101.
  • Hobert, J. P. (2011), “The Data Augmentation Algorithm: Theory and Methodology,” in Handbook of Markov Chain Monte Carlo, eds. S. P. Brooks, A. Gelman, G. L. Jones, and X.-L. Meng, Boca Raton, FL: CRC Press, pp. 253–293.
  • Jasra, A., Holmes, C. C., and Stephens, D. A. (2005), “Markov Chain Monte Carlo Methods and the Label Switching Problem in Bayesian Mixture Modeling,” Statistical Science, 20, 50–67. DOI: 10.1214/088342305000000016.
  • Jones, G. L., Haran, M., Caffo, B. S., and Neath, R. (2006), “Fixed-Width Output Analysis for Markov Chain Monte Carlo,” Journal of the American Statistical Association, 101, 1537–1547. DOI: 10.1198/016214506000000492.
  • Jordan, M. I., Ghahramani, Z., Jaakkola, T. S., and Saul, L. K. (1999), “An Introduction to Variational Methods for Graphical Models,” Machine Learning, 37, 183–233. DOI: 10.1023/A:1007665907178.
  • Marinari, E., and Parisi, G. (1992), “Simulated Tempering: A New Monte Carlo Scheme,” Europhysics Letters, 19, 451–458. DOI: 10.1209/0295-5075/19/6/002.
  • Meng, X.-L., and Van Dyk, D. (1997), “The EM Algorithm—An Old Folk-Song Sung to a Fast New Tune,” Journal of the Royal Statistical Society, Series B, 59, 511–567. DOI: 10.1111/1467-9868.00082.
  • Minka, T. P. (2003), “Estimating a Dirichlet Distribution,” available at http://research.microsoft.com/∼minka/papers/dirichlet/.
  • Nash, J. C., and Varadhan, R. (2011), “Unifying Optimization Algorithms to Aid Software System Users: optimx for R,” Journal of Statistical Software, 43, 1–14. DOI: 10.18637/jss.v043.i09.
  • Neal, R. M. (2003), “Slice Sampling,” The Annals of Statistics, 31, 705–741. DOI: 10.1214/aos/1056562461.
  • Neal, R. M. (2011), “MCMC Using Hamiltonian Dynamics,” in Handbook of Markov Chain Monte Carlo, eds. S. P. Brooks, A. Gelman, G. L. Jones, and X.-L. Meng, Boca Raton, FL: CRC Press, pp. 113–162.
  • Newman, D., Asuncion, A., Smyth, P., and Welling, M. (2009), “Distributed Algorithms for Topic Models,” Journal of Machine Learning Research, 10, 1801–1828.
  • Newton, M., and Raftery, A. (1994), “Approximate Bayesian Inference With the Weighted Likelihood Bootstrap” (with discussion), Journal of the Royal Statistical Society, Series B, 56, 3–48. DOI: 10.1111/j.2517-6161.1994.tb01956.x.
  • Řehůřek, R., and Sojka, P. (2010), “Software Framework for Topic Modelling With Large Corpora,” in Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, ELRA, Valletta, Malta.
  • Robert, C. P. (2001), The Bayesian Choice: From Decision-Theoretic Foundations to Computational Implementation, New York: Springer-Verlag.
  • Rosen-Zvi, M., Griffiths, T., Steyvers, M. and Smyth, P. (2004), “The Author-Topic Model for Authors and Documents,” in Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, UAI ‘04, AUAI Press, Arlington, VA.
  • Teh, Y. W., Jordan, M. I., Beal, M. J., and Blei, D. M. (2006), “Hierarchical Dirichlet Processes,” Journal of the American Statistical Association, 101, 1566–1581. DOI: 10.1198/016214506000000302.
  • Tsybakov, A. B. (1990), “Recursive Estimation of the Mode of a Multivariate distribution,” Problemy Peredachi Informatsii, 26, 38–45.
  • Wallach, H. M. (2006), “Topic Modeling: Beyond Bag-of-Words,” in Proceedings of the 23rd International Conference on Machine Learning, ICML ‘06, ACM, New York, NY.
  • Wallach, H. M. (2008), “Structured Topic Models for Language,” Ph.D. thesis, University of Cambridge.
  • Wallach, H. M., Murray, I., Salakhutdinov, R., and Mimno, D. (2009), “Evaluation Methods for Topic Models,” in Proceedings of the 26th Annual International Conference on Machine Learning, ACM. DOI: 10.1145/1553374.1553515.
  • Wolpert, R. L., and Schmidler, S. C. (2012), “α-Stable Limit Laws for Harmonic Mean Estimators of Marginal Likelihoods,” Statistica Sinica, 22, 1233–1251. DOI: 10.5705/ss.2010.221.
  • Xia, W. (2018), “Scalable Hyperparameter Selection for Latent Dirichlet Allocation,” Ph.D. thesis, University of Florida.
  • Xia, W., and Doss, H. (2020), “Supplement to ‘Scalable Hyperparameter Selection for Latent Dirichlet Allocation’.”
  • Yang, Y. (2005), “Can the Strengths of AIC and BIC Be Shared? A Conflict Between Model Indentification and Regression Estimation,” Biometrika, 92, 937–950. DOI: 10.1093/biomet/92.4.937.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.