371
Views
0
CrossRef citations to date
0
Altmetric
Articles

Fast Robust Location and Scatter Estimation: A Depth-based Method

, &
Pages 14-27 | Received 19 Sep 2022, Accepted 09 May 2023, Published online: 27 Jun 2023

References

  • Aggarwal, C. C., and Sathe, S. (2015), “Theoretical Foundations and Algorithms for Outlier Ensembles,” ACM SIGKDD Explorations Newsletter, 17, 24–47. DOI: 10.1145/2830544.2830549.
  • Agulló, J., Croux, C., and Van Aelst, S. (2008), “The Multivariate Least-Trimmed Squares Estimator,” Journal of Multivariate Analysis, 99, 311–338. DOI: 10.1016/j.jmva.2006.06.005.
  • Alqallaf, F., Van Aelst, S., Yohai, V. J., and Zamar, R. H. (2009), “Propagation of Outliers in Multivariate Data,” The Annals of Statistics, 37, 311–331. DOI: 10.1214/07-AOS588.
  • Boudt, K., Rousseeuw, P. J., Vanduffel, S., and Verdonck, T. (2020), “The Minimum Regularized Covariance Determinant Estimator,” Statistics and Computing, 30, 113–128. DOI: 10.1007/s11222-019-09869-x.
  • Butler, R., Davies, P., and Jhun, M. (1993), “Asymptotics for the Minimum Covariance Determinant Estimator,” The Annals of Statistics, 21, 1385–1400. DOI: 10.1214/aos/1176349264.
  • Cator, E. A., and Lopuhaä, H. P. (2012), “Central Limit Theorem and Influence Function for the MCD Estimators at General Multivariate Distributions,” Bernoulli, 18, 520–551. DOI: 10.3150/11-BEJ353.
  • Coakley, C. W., and Hettmansperger, T. P. (1993), “A Bounded Influence, High Breakdown, Efficient Regression Estimator,” Journal of the American Statistical Association, 88, 872–880. DOI: 10.1080/01621459.1993.10476352.
  • Croux, C., and Haesbroeck, G. (2000), “Principal Component Analysis based on Robust Estimators of the Covariance or Correlation Matrix: Influence Functions and Efficiencies,” Biometrika, 87, 603–618. DOI: 10.1093/biomet/87.3.603.
  • De Ketelaere, B., Hubert, M., Raymaekers, J., Rousseeuw, P. J., and Vranckx, I. (2020), “Real-Time Outlier Detection for Large Datasets by RT-DetMCD,” Chemometrics and Intelligent Laboratory Systems, 199, 103957. DOI: 10.1016/j.chemolab.2020.103957.
  • Debruyne, M., and Hubert, M. (2009), “The Influence Function of the Stahel–Donoho Covariance Estimator of Smallest Outlyingness,” Statistics & Probability Letters, 79, 275–282. DOI: 10.1016/j.spl.2008.08.006.
  • Donoho, D. L. (1982), “Breakdown Properties of Multivariate Location Estimators,” Ph. D. Qualifying Paper, Harvard University.
  • Dyckerhoff, R., Mozharovskyi, P., and Nagy, S. (2021), “Approximate Computation of Projection Depths,” Computational Statistics & Data Analysis, 157, 107166. DOI: 10.1016/j.csda.2020.107166.
  • Fekri, M., and Ruiz-Gazen, A. (2004), “Robust Weighted Orthogonal Regression in the Errors-in-Variables Model,” Journal of Multivariate Analysis, 88, 89–108. DOI: 10.1016/S0047-259X(03)00057-5.
  • Hardin, J., and Rocke, D. M. (2004), “Outlier Detection in the Multiple Cluster Setting Using the Minimum Covariance Determinant Estimator,” Computational Statistics & Data Analysis, 44, 625–638. DOI: 10.1016/S0167-9473(02)00280-3.
  • Hastie, T., Tibshirani, R., and Friedman, J. (2009), The Elements of Statistical Learning: Data Mining, Inference, and Prediction, New York: Springer.
  • Hubert, M., Debruyne, M., and Rousseeuw, P. J. (2018), “Minimum Covariance Determinant and Extensions,” Wiley Interdisciplinary Reviews: Computational Statistics, 10, e1421.
  • Hubert, M., and Rousseeuw, P. J. (1997), “Robust Regression with both Continuous and Binary Regressors,” Journal of Statistical Planning and Inference, 57, 153–163. DOI: 10.1016/S0378-3758(96)00041-9.
  • Hubert, M., Rousseeuw, P. J., and Vanden Branden, K. V. (2005), “ROBPCA: A New Approach to Robust Principal Component Analysis,” Technometrics, 47, 64–79. DOI: 10.1198/004017004000000563.
  • Hubert, M., Rousseeuw, P. J., and Van Aelst, S. (2008), “High-Breakdown Robust Multivariate Methods,” Statistical Science, 23, 92–119. DOI: 10.1214/088342307000000087.
  • Hubert, M., Rousseeuw, P. J., and Verdonck, T. (2012), “A Deterministic Algorithm for Robust Location and Scatter,” Journal of Computational and Graphical Statistics, 21, 618–637. DOI: 10.1080/10618600.2012.672100.
  • Hubert, M., and Van Driessen, K. (2004), “Fast and Robust Discriminant Analysis,” Computational Statistics & Data Analysis, 45, 301–320. DOI: 10.1016/S0167-9473(02)00299-2.
  • Kalina, J., and Tichavskỳ, J. (2021), “The Minimum Weighted Covariance Determinant Estimator for High-Dimensional Data,” Advances in Data Analysis and Classification, 16, 977–999. DOI: 10.1007/s11634-021-00471-6.
  • Liu, R. Y. (1990), “On a Notion of Data Depth based on Random Simplices,” The Annals of Statistics, 18, 405–414. DOI: 10.1214/aos/1176347507.
  • Lopuhaa, H. P., and Rousseeuw, P. J. (1991), “Breakdown Points of Affine Equivariant Estimators of Multivariate Location and Covariance Matrices,” The Annals of Statistics, 19, 229–248. DOI: 10.1214/aos/1176347978.
  • Mahalanobis, P. C. (1936), “On the Generalized Distance in Statistics,” National Institute of Science of India, 12, 49–55.
  • Maronna, R. A., and Zamar, R. H. (2002), “Robust Estimates of Location and Dispersion for High-Dimensional Datasets,” Technometrics, 44, 307–317. DOI: 10.1198/004017002188618509.
  • Milo, W. (1990), “Multivariate Statistics: A Practical Approach,” Journal of the Royal Statistical Society, Series A, 153, 112–113. DOI: 10.2307/2983115.
  • Mosler, K., and Mozharovskyi, P. (2022), “Choosing Among Notions of Multivariate Depth Statistics,” Statistical Science, 37, 348–368. DOI: 10.1214/21-STS827.
  • Niinimaa, A., and Oja, H. (1995), “On the Influence Functions of Certain Bivariate Medians,” Journal of the Royal Statistical Society, Series B, 57, 565–574. DOI: 10.1111/j.2517-6161.1995.tb02048.x.
  • Oja, H. (1983), “Descriptive Statistics for Multivariate Distributions,” Statistics & Probability Letters, 1, 327–332. DOI: 10.1016/0167-7152(83)90054-8.
  • Pison, G., Rousseeuw, P. J., Filzmoser, P., and Croux, C. (2003), “Robust Factor Analysis,” Journal of Multivariate Analysis, 84, 145–172. DOI: 10.1016/S0047-259X(02)00007-6.
  • Porwal, U., and Mukund, S. (2017), “Outlier Detection by Consistent Data Selection Method,” arXiv preprint arXiv:1712.04129.
  • Roelant, E., Van Aelst, S., and Willems, G. (2009), “The Minimum Weighted Covariance Determinant Estimator,” Metrika, 70, 177–204. DOI: 10.1007/s00184-008-0186-3.
  • Rousseeuw, P., Van Aelst, S., Driessen, K., and Agullo, J. (2004), “Robust Multivariate Regression,” Technometrics, 46, 293–305. DOI: 10.1198/004017004000000329.
  • Rousseeuw, P. J. (1984), “Least Median of Squares Regression,” Journal of the American Statistical Association, 79, 871–880. DOI: 10.1080/01621459.1984.10477105.
  • Rousseeuw, P. J., and Croux, C. (1993), “Alternatives to the Median Absolute Deviation,” Journal of the American Statistical Association, 88, 1273–1283. DOI: 10.1080/01621459.1993.10476408.
  • Rousseeuw, P. J., and Driessen, K. V. (1999), “A Fast Algorithm for the Minimum Covariance Determinant Estimator,” Technometrics, 41, 212–223. DOI: 10.1080/00401706.1999.10485670.
  • Rousseeuw, P. J., and Van Zomeren, B. C. (1990), “Unmasking Multivariate Outliers and Leverage Points,” Journal of the American Statistical Association, 85, 633–639. DOI: 10.1080/01621459.1990.10474920.
  • Schreurs, J., Vranckx, I., Ketelaere, B. D., Hubert, M., Suykens, J. A. K., and Rousseeuw, P. J. (2021), “Outlier Detection in Non-Elliptical Data by Kernel MRCD,” Statistics and Computing, 31, 1–18. DOI: 10.1007/s11222-021-10041-7.
  • Tukey, J. W. (1975), “Mathematics and the Picturing of Data,” in Proceedings of the International Congress of Mathematicians, Vancouver, 1975 (Vol. 2), pp. 523–531.
  • Vardi, Y., and Zhang, C.-H. (2000), “The Multivariate L1-median and Associated Data Depth,” Proceedings of the National Academy of Sciences, 97, 1423–1426. DOI: 10.1073/pnas.97.4.1423.
  • Willems, G., Joe, H., and Zamar, R. (2009), “Diagnosing Multivariate Outliers Detected by Robust Estimators,” Journal of Computational and Graphical Statistics, 18, 73–91. DOI: 10.1198/jcgs.2009.0005.
  • Yohai, V. J., and Zamar, R. H. (1988), “High Breakdown-Point Estimates of Regression by Means of the Minimization of an Efficient Scale,” Journal of the American Statistical Association, 83, 406–413. DOI: 10.1080/01621459.1988.10478611.
  • Zuo, Y. (2006), “Multidimensional Trimming based on Projection Depth,” The Annals of Statistics, 34, 2211–2251. DOI: 10.1214/009053606000000713.
  • Zuo, Y., and Serfling, R. (2000a), “General Notions of Statistical Depth Function,” The Annals of Statistics, 28, 461–482. DOI: 10.1214/aos/1016218226.
  • Zuo, Y., and Serfling, R. (2000b), “Structural Properties and Convergence Results for Contours of Sample Statistical Depth Functions,” The Annals of Statistics, 28, 483–499.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.