3,082
Views
40
CrossRef citations to date
0
Altmetric
Theory and Methods

Robust Improper Maximum Likelihood: Tuning, Computation, and a Comparison With Other Methods for Robust Gaussian Clustering

&
Pages 1648-1659 | Received 01 Jul 2014, Published online: 04 Jan 2017

References

  • Banfield, J. D., and Raftery, A. E. (1993), “Model-based Gaussian and Non-Gaussian Clustering,” Biometrics, 49, 803–821.
  • Becker, C., and Gather, U. (1999), “The Masking Breakdown Point of Multivariate Outlier Identification Rules,” Journal of the American Statistical Association, 94, 947–955.
  • Byers, S., and Raftery, A. E. (1998), “Nearest-Neighbor Clutter Removal for Estimating Features in Spatial Point Processes,” Journal of the American Statistical Association, 93, 577–584.
  • Cator, E. A., and Lopuhaä, H. P. (2012), “Central Limit Theorem and Influence Function for the MCD Estimators at General Multivariate Distributions,” Bernoulli, 18, 520–551.
  • Coretto, P., and Hennig, C. (2010), “A Simulation Study to Compare Robust Clustering Methods Based on Mixtures,” Advances in Data Analysis and Classification, 4, 111–135.
  • Coretto, P. (in press a), “A Consistent and Breakdown Robust Model-based Clustering Method,” available at http://arxiv.org/pdf/1309.6895.
  • Coretto, P. (in press b), Supplement to: “Robust Improper Maximum Likelihood: Tuning, Computation, and a Comparison With Other Methods for Robust Gaussian Clustering,” available at http://arxiv.org/src/1406.0808/anc/supplement.pdf.
  • Cuesta-Albertos, J. A., Gordaliza, A., and Matrán, C. (1997), “Trimmed k-means: An Attempt to Robustify Quantizers,” Annals of Statistics, 25, 553–576.
  • Croux, C., and Haesbroeck, G. (1999), “Influence Function and Efficiency of the Minimum Covariance Determinant Scatter Matrix Estimator,” Journal of Multivariate Analysis, 71, 161–190.
  • Davies, L., and Gather, U. (1993), “The Identification of Multiple Outliers,” Journal of the American Statistical Association, 88, 782–792.
  • Day, N. E. (1969), “Estimating the Components of a Mixture of Normal Distributions,” Biometrika, 56, 463–474.
  • Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977), “Maximum Likelihood From Incomplete Data via the EM Algorithm,” Journal of the Royal Statistical Society, Series B, 39, 1–47.
  • Dennis, J. E. J. (ed.) (1981), Algorithms for Nonlinear Fitting, (NATO Advanced Research Symposium), Cambridge, UK: Cambridge University Press.
  • Fraley, C., and Raftery, A. E. (1998), “How Many Clusters? Which Clustering Method? Answers via Model-based Cluster Analysis,” The Computer Journal, 41, 578–588.
  • Fraley, C., Raftery, A. E., Murphy, T. B., and Scrucca, L. (2012), mclust Version 4 for R: Normal Mixture Modeling for Model-Based Clustering, Classification, and Density Estimation, Technical Report no. 597, Department of Statistics, University of Washington, Seattle, WA.
  • Fritz, H., García-Escudero, L. A., and Mayo-Iscar, A. (2012), “tclust: An R Package for a Trimming Approach to Cluster Analysis,” Journal of Statistical Software, 47, 1–26.
  • Gallegos, M. T. (2002), “Maximum Likelihood Clustering With Outliers,” in Classification, Clustering, and Data Analysis, eds. K. Jajuga, A. Sokolowski, and H.-H. Bock, Berlin: Springer, pp. 247–255.
  • Gallegos, M. T., and Ritter, G. (2005), “A Robust Method for Cluster Analysis,” Annals of Statistics, 33, 347–380.
  • Gallegos, M. T. (2009), “Trimmed ML Estimation of Contaminated Mixtures,” Sankhya (Series A), 71, 164–220.
  • García-Escudero, L. A., Gordaliza, A., Matrán, C., and Mayo-Iscar, A. (2008), “A General Trimming Approach to Robust Cluster Analysis,” Annals of Statistics, 38, 1324–1345.
  • García-Escudero, L. A. (2010), “A Review of Robust Clustering Methods,” Advances in Data Analysis and Classification, 4, 89–109.
  • García-Escudero, L. A. (2011), “Exploring the Number of Groups in Robust Model-Based Clustering,” Statistics and Computing, 21, 585–599.
  • Hathaway, R. J. (1985), “A Constrained Formulation of Maximum-Likelihood Estimation for Normal Mixture Distributions,” The Annals of Statistics, 13, 795–800.
  • Hennig, C. (2004), “Breakdown Points for Maximum Likelihood Estimators of Location-Scale Mixtures,” The Annals of Statistics, 32, 1313–1340.
  • Hennig, C. (2010), “Methods for Merging Gaussian Mixture Components,” Advances in Data Analysis and Classification, 4, 3–34.
  • Hennig, C., and Liao, T. F. (2013), “How to Find an Appropriate Clustering for Mixed Type Variables With Application to Socioeconomic Stratification” ( with discussion), Journal of the Royal Statistical Science, Series C, 62, 309–369.
  • Hubert, L., and Arabie, P. (1985), “Comparing Partitions,” Journal of Classification, 2, 193–218.
  • Ingrassia, S. (2004), “A Likelihood-based Constrained Algorithm for Multivariate Normal Mixture Models,” Statistical Methods & Applications, 13, 151–166.
  • Ingrassia, S., and Rocci, R. (2007), “Constrained Monotone EM Algorithms for Finite Mixture of Multivariate Gaussians,” Computational Statistics and Data Analysis, 51, 5339–5351.
  • Ingrassia, S. (2011), “Degeneracy of the EM Algorithm for the MLE of Multivariate Gaussian Mixtures and Dynamic Constraints,” Computational Statistics & Data Analysis, 55, 1715–1725.
  • Kiefer, J. (1953), “Sequential Minimax Search for a Maximum,” Proceedings of the American Mathematical Society, 4, 502–506.
  • McLachlan, G. J., and Peel, D. (2000), “Robust Mixture Modelling Using the t–distribution,” Statistics and Computing, 10, 339–348.
  • Müllensiefen, D. (2009), Fantastic: Feature ANalysis Technology Accessing STatistics (In a Corpus): Technical Report v1.5, London: Goldsmiths University of London.
  • Neykov, N., Filzmoser, P., Dimova, R., and Neytchev, P. (2007), “Robust Fitting of Mixtures Using the Trimmed Likelihood Estimator,” Computational Statistics and Data Analysis, 17, 299–308.
  • Pison, G., Aelst, S. V., and Willems, G. (2002), “Small Sample Corrections for LTS and MCD,” Metrika, 55, 111–123.
  • Qin, Y., and Priebe, C. E. (2013), “Maximum lq-likelihood Estimation via the Expectation-Maximization Algorithm: A Robust Estimation of Mixture Models,” Journal of the American Statistical Association, 108, 914–928.
  • Rousseeuw, P. J. (1985), “Multivariate Estimation With High Breakdown Point,” Mathematical Statistics and Applications, 8, 283–297.
  • Schaffrath, H. (1992), “The esac Databases and Mappet Software,” Computing in Musicology, 8, 66.
  • Sommerer, E.-O., and Weihs, C. (2005), “Introduction to the Contest “Social Milieus in Dortmund,” in Classification - the Ubiquitous Challenge, Berlin: Springer, pp. 667–673.