123
Views
2
CrossRef citations to date
0
Altmetric
Articles

A data-driven selection of the number of clusters in the Dirichlet allocation model via Bayesian mixture modelling

, &
Pages 2848-2870 | Received 02 Feb 2019, Accepted 10 Jul 2019, Published online: 18 Jul 2019

References

  • MacQueen J. Some methods for classification and analysis of multivariate observations. Proceedings of the fifth Berkeley symposium on mathematical statistics and probability Vol. 1. Berkeley (CA): University of California Press; 1967. p. 281–297.
  • Ward JH. Hierarchical groupings to optimize an objective function. J Am Stat Assoc. 1963;58:234–244. doi: 10.1080/01621459.1963.10500845
  • Sturn A, Quackenbush J, Trajanoski Z. Genesis: cluster analysis of microarray data. Bioinformatics. 2002;18:207–208. doi: 10.1093/bioinformatics/18.1.207
  • Wride M, Mansergh F, Adams S, et al. Expression profiling and gene discovery in the mouse lens. Mol Vis. 2003;9:360–396.
  • Oyelade OJ, Oladipupo OO, Obagbuwa I. Application of k-means clustering algorithm for prediction of students' academic performance. Int J Comput Sci Inf Secur. 2010;7:292–295.
  • Peterson AD, Ghosh AP, Maitra R. Merging k-means with hierarchical clustering for identifying general-shaped groups. Stats. 2018;7:e172. doi: 10.1002/sta4.172
  • Oh MS, Raftery AE. Model-based clustering with dissimilarities: a Bayesian approach. J Comput Graph Stat. 2007;16:559–585. doi: 10.1198/106186007X236127
  • Bouveyron C, Brunet C. Model-based clustering of high-dimensional data: a review. Comput Stat Data Anal. 2014;71:52–78. doi: 10.1016/j.csda.2012.12.008
  • McLachlan G, Basford KE. Mixture models: inference and applications to clustering. New York (NY): Marcel Dekker; 1988.
  • Banfield JD, Raftery AE. Model-based gaussian and non-gaussian clustering. Biometrics. 1993;49:803–821. doi: 10.2307/2532201
  • McLachlan G, Peel D. Finite mixture models. New York: Wiley Interscience; 2000.
  • Fraley C, Raftery A. Model-based clustering, discriminant analysis, and density estimation. J Am Stat Assoc. 2002;97:611–631. doi: 10.1198/016214502760047131
  • Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc, B. 1977;39(1):1–38.
  • Akaike HA. New look at the statistical model identification. IEEE Trans Automat Contr. 1974;19:716–723. doi: 10.1109/TAC.1974.1100705
  • Bozdogan H. Model selection and Akaike's information criterion (AIC): the general theory and its analytical extensions. Psychometrica. 1987;52:345–370. doi: 10.1007/BF02294361
  • Schwarz GE. Estimating the dimension of a model. Ann Stat. 1978;6:461–464. doi: 10.1214/aos/1176344136
  • Spiegelhalter DJ, Best NG, Carlin BP, et al. Bayesian measures of model complexity and fit. J R Stat Soc Ser B. 2002;64:583–616. doi: 10.1111/1467-9868.00353
  • Celeux G, Hurn M, Robert CP. Computational and inferential difficulties with mixture posterior distributions. J Am Stat Assoc. 2000;95:957–970. doi: 10.1080/01621459.2000.10474285
  • Richardson S, Green PJ. On bayesian analysis of mixtures with an unknown number of components. J R Stat Soc Ser B. 1997;59:731–792. doi: 10.1111/1467-9868.00095
  • Kullback S, Leibler RA. On information and sufficiency. Ann Math Stat. 1951;22:79–86. doi: 10.1214/aoms/1177729694
  • Godsill SJ. On the relationship between markov chain monte carlo methods for model uncertainty. J Comput Graph Stat. 2001;10:230–248. doi: 10.1198/10618600152627924
  • Dellaportas P, Forster J, Ntzoufras I. On bayesian model and variable selection using MCMC. Stat Comput. 2002;12:27–36. doi: 10.1023/A:1013164120801
  • Saraiva EF, Louzada F, Milan LA. Mixture models with an unknown number of components via a new posterior split-merge mcmc algorithm. Appl Math Comput. 2014;244:959–975.
  • Fruhwirth-Schnatter S. Finite mixture and Markov switching models. New York (NY): Springer Science & Business Media; 2006.
  • Stephens M. Dealing with label switching in mixture models. J R Stat Soc Ser B. 2000;62:795–809. doi: 10.1111/1467-9868.00265
  • Jasra A, Holmes CC, Stephens DA. Markov chain monte carlo methods and the label switching problem in bayesian mixture modeling. Stat Sci. 2005;20:50–67. doi: 10.1214/088342305000000016
  • Ferguson TS. A Bayesian analysis of some nonparametric problems. Ann Stat. 1973;1:209–230. doi: 10.1214/aos/1176342360
  • Antoniak CE. Mixture of processes Dirichlet with applications to Bayesian nonparametric problems. Ann Stat. 1974;2:1152–1174. doi: 10.1214/aos/1176342871
  • Jain S, Neal R. A split-merge markov chain monte carlo procedure for the dirichlet process mixture models. J Comput Graph Stat. 2004;13(1):158–182. doi: 10.1198/1061860043001
  • Carlin BP, Chib S. Bayesian model choice via markov chain monte carlo methods. J R Stat Soc B. 1995;57:473–484.
  • Chib S, Greenberg E. Understanding the Metropolis-Hastings algorithm. Am Stat. 1995;49:327–335.
  • Casella G, Robert C, Wells M. Mixture models, latent variables and partitioned importance sampling. Paris: CREST, INSEE; 2000. (Technical Report).
  • Nobile A, Fearnside AT. Bayesian finite mixtures with an unknown number of components: the allocation sampler. Stat Comput. 2007;17:147–162. doi: 10.1007/s11222-006-9014-7
  • Saraiva EF, Suzuki AK, Louzada F, et al. Partitioning gene expression data by data-driven markov chain monte carlo. J Appl Stat. 2016;43:1155–1173. doi: 10.1080/02664763.2015.1092113
  • Roeder K, Wasserman L. Practical bayesian density estimation using mixture of normals. J Am Stat Assoc. 1997;92:894–902. doi: 10.1080/01621459.1997.10474044
  • Escobar MD, West M. Bayesian density estimation and inference using mixtures. J Am Stat Assoc. 1995;90:577–588. doi: 10.1080/01621459.1995.10476550
  • Peng F, Dey D. Bayesian analysis of outlier problems using divergence measures. Can J Stat. 1995;23:199–213. doi: 10.2307/3315445

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.