184
Views
7
CrossRef citations to date
0
Altmetric
Articles

Robust inference for parsimonious model-based clustering

&
Pages 414-442 | Received 07 Jul 2018, Accepted 28 Nov 2018, Published online: 05 Dec 2018

References

  • Garcia-Escudero LA, Gordaliza A, Matrán C. Trimming tools in exploratory data analysis. J Comput Graph Stat. 2003;12(2):434–449. doi: 10.1198/1061860031806
  • Farcomeni A, Ventura L. An overview of robust methods in medical research. Stat Methods Med Res. 2012;21:111–133. doi: 10.1177/0962280210385865
  • Farcomeni A, Greco L. Robust methods for data reduction. Boca Raton: Chapman & Hall / CRC Press; 2015.
  • Dotto F. Advances in robust clustering methods with applications [Ph.D. thesis]. Ph.D School of Statistiscal Sciences. “La Sapienza” University of Rome; 2017.
  • García-Escudero LA, Gordaliza A, Matrán C, et al. Avoiding spurious local maximizers in mixture modeling. Stat Comput. 2015;25:619–633. doi: 10.1007/s11222-014-9455-3
  • Ingrassia S, Rocci R. Degeneracy of the EM algorithm for the MLE of multivariate Gaussian mixtures and dynamic constraints. Comput Stat Data Anal. 2011;55:1715–1725. doi: 10.1016/j.csda.2010.10.026
  • Coretto P, Hennig C. Robust improper maximum likelihood: tuning, computation, and a comparison with other methods for robust Gaussian clustering. J Am Stat Assoc. 2016;111:1648–1659. doi: 10.1080/01621459.2015.1100996
  • Cerioli A, Farcomeni A, Riani M. Wild adaptive trimming for robust estimation and cluster analysis. Scand J Statist. 2018. doi: 10.1111/sjos.12349
  • McLachlan G, Peel D. Finite mixture models. New York: John Wiley & Sons; 2004.
  • Zhang J, Liang F. Robust clustering using exponential power mixtures. Biometrics. 2010;66:1078–1086. doi: 10.1111/j.1541-0420.2010.01389.x
  • Bagnato L, Punzo A, Zoia MG. The multivariate leptokurtic-normal distribution and its application in model-based clustering. Can J Stat. 2017;45:95–119. doi: 10.1002/cjs.11308
  • Banfield JD, Raftery AE. Model-based Gaussian and non-Gaussian clustering. Biometrics. 1993;49:803–821. doi: 10.2307/2532201
  • Celeux G, Govaert G. Gaussian parsimonious clustering models. Pattern Recognit. 1995;28:781–793. doi: 10.1016/0031-3203(94)00125-6
  • Fraley C, Raftery AE. Model based clustering, discriminant analysis and density estimation. J Am Stat Assoc. 2002;97:611–631. doi: 10.1198/016214502760047131
  • Fraley C, Raftery AE. Model based methods of classification: using the mclust software in chemometrics. J Stat Softw. 2007;6:1–13.
  • Punzo A, McNicholas PD. Parsimonious mixtures of multivariate contaminated normal distributions. Biometric J. 2016;58:1506–1537. doi: 10.1002/bimj.201500144
  • Gallegos MT, Ritter G. Trimmed ML estimation of contaminated mixtures. Sankhya. 2009a;71:164–220.
  • Gallegos MT, Ritter G. Trimming algorithms for clustering contaminated grouped data and their robustness. Adv Data Anal Classif. 2009b;3:135–167. doi: 10.1007/s11634-009-0044-9
  • Gallegos MT, Ritter G. Using combinatorial optimization in model-based trimmed clustering with cardinality constraints. Comput Statit Data Anal. 2010;54:637–654. doi: 10.1016/j.csda.2009.08.023
  • Hathaway RJ. A constrained EM algorithm for univariate normal mixtures. J Stat Comput Simul. 1986;23:211–230. doi: 10.1080/00949658608810872
  • García-Escudero LA, Gordaliza A, Matrán C, et al. A general trimming approach to robust cluster analysis. Ann Stat. 2008;36:1324–1345. doi: 10.1214/07-AOS515
  • Symons M. Clustering criteria and multivariate normal mixtures. Biometrics. 1981;37:35–43. doi: 10.2307/2530520
  • Bryant P. Large-sample results for optimization-based clustering methods. J Classif. 1991;8:31–44. doi: 10.1007/BF02616246
  • Dykstra RL. An algorithm for restricted least squares regression. J Am Stat Assoc. 1983;78:837–849. doi: 10.1080/01621459.1983.10477029
  • Fritz H, García-Escudero LA, Mayo-Iscar A. tclust: an R package for a trimming approach to Cluster Analysis. J Stat Softw. 2012;47. Available from: http://www.jstatsoft.org/v47/i12 doi: 10.18637/jss.v047.i12
  • Ruwet C, Garcia-Escudero LA, Gordaliza A, et al. On the breakdown behavior of TCLUST clustering procedure. TEST. 2013;22:466–487. doi: 10.1007/s11749-012-0312-4
  • Farcomeni A. Robust constrained clustering in presence of entry-wise outliers. Technometrics. 2014a;56:102–111. doi: 10.1080/00401706.2013.826148
  • Farcomeni A. Snipping for robust k-means clustering under component-wise contamination. Stat Comput. 2014b;24:909–917. doi: 10.1007/s11222-013-9410-8
  • Ritter G. Robust cluster analysis and variable selection. New York: CRC Press; 2014.
  • García-Escudero LA, Gordaliza A, Matrán C, et al. Exploring the number of groups in robust model-based clustering. Stat Comput. 2011;21(4):585–599. doi: 10.1007/s11222-010-9194-z
  • Cerioli A, García-Escudero LA, Mayo-Iscar A, et al. Finding the number of normal groups in model-based clustering via constrained likelihoods. J Comput Graph Stat. 2018a; available online.
  • Fraley C, Raftery AE, Murphy B, et al. mclust version 4 for r: normal mixture modeling for model-based clustering, classification, and density estimation. Washington, DC: Department of Statistics, University of Washington; 2012. (Tech. rep.).
  • Punzo A, Mazza A, McNicholas P. ContaminatedMixt: an R package for fitting parsimonious mixtures of multivariate contaminated normal distributions. J Stat Softw. 2018;85:1–25. doi: 10.18637/jss.v085.i10
  • Andrews JL, Wickins JR, Boers NM, et al. teigen: an R package for model-based clustering and classification via the multivariate T distribution. J Stat Softw. 2018; available online.
  • Dotto F, Farcomeni A, García-Escudero LA, et al. A reweighting approach to robust clustering. Stat Comput. 2018;28:477–493. doi: 10.1007/s11222-017-9742-x
  • Cerioli A, Riani M, Atkinson AC, et al. The power of monitoring: how to make the most of a contaminated multivariate sample. Stat Methods Appt. 2018b; available online.
  • Farcomeni A, Dotto F. The power of (extended) monitoring in robust clustering. Stat Methods Appt. 2018; available online.
  • Hardin J, Rocke DM. Outlier detection in the multiple cluster setting using the minimum covariance determinant estimator. Comput Stat Data Anal. 2004;44:625–638. doi: 10.1016/S0167-9473(02)00280-3
  • Rousseeuw PJ, Leroy AM. Robust regression and outlier detection. New York: John Wiley & Sons; 2005; Vol. 589.
  • Cerioli A, Farcomeni A. Error rates for multivariate outlier detection. Comput Stat Data Anal. 2011;55:544–553. doi: 10.1016/j.csda.2010.05.021
  • García-Escudero LA, Gordaliza A, Matrán C, et al. A review of robust clustering methods. Adv Data Anal Classif. 2010;4:89–109. doi: 10.1007/s11634-010-0064-5

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.