Publication Cover
Statistics
A Journal of Theoretical and Applied Statistics
Volume 58, 2024 - Issue 1
159
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Elegant robustification of sparse partial least squares by robustness-inducing transformations

, ORCID Icon & ORCID Icon
Pages 44-64 | Received 15 Feb 2023, Accepted 16 Jan 2024, Published online: 12 Feb 2024

References

  • Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc B. 1996;58(1):267–288.
  • Bühlmann P, Kalisch M, Meier L. High-dimensional statistics with a view toward applications in biology. Annu Rev Stat Appl. 2014;1(1):255–278. doi: 10.1146/statistics.2013.1.issue-1
  • Bunea F, She Y, Ombao H, et al. Penalized least squares regression methods and applications to neuroimaging. Neuroimage. 2011;55(4):1519–1527. doi: 10.1016/j.neuroimage.2010.12.028
  • Zeng ISL, Lumley T. Review of statistical learning methods in integrated omics studies (an integrated information science). Bioinform Biol Insights. 2018;12:1177932218759292. doi: 10.1177/1177932218759292
  • Belloni A, Chernozhukov V. High dimensional sparse econometric models: an introduction. Berlin, Heidelberg: Springer; 2011. p. 121–156.
  • Fan J, Feng Y, Wu Y. Network exploration via the adaptive lasso and scad penalties. Ann Appl Stat. 2009;3:521–541. doi: 10.1214/08-AOAS215
  • Ding J, Tarokh V, Yang Y. Model selection techniques: an overview. IEEE Signal Process Mag. 2018;35(6):16–34. doi: 10.1109/MSP.2018.2867638
  • Huber PJ. Robust estimation of a location parameter. Ann Math Stat. 1964;35(1):73–101. doi: 10.1214/aoms/1177703732
  • Hampel FR, Ronchetti EM, Rousseeuw PJ, et al. Robust statistics: the approach based on influence functions. New York: John Wiley & Sons; 1986.
  • Huber PJ, Ronchetti EM. Robust statistics. New Jersey: John Wiley & Sons; 2009.
  • Maronna RA, Martin RD, Yohai VJ. Robust statistics: theory and methods. New York: John Wiley & Sons; 2006.
  • Alfons A, Croux C, Gelper S. Sparse least trimmed squares regression for analyzing high-dimensional large data sets. Ann Appl Stat. 2013;7(1):226–248. doi: 10.1214/12-AOAS575
  • Freue GVC, Kepplinger D, Salibián-Barrera M, et al. Robust elastic net estimators for variable selection and identification of proteomic biomarkers. Ann Appl Stat. 2019;13(4):2065–2090.
  • Insolia L, Kenney A, Chiaromonte F, et al. Simultaneous feature selection and outlier detection with optimality guarantees. Biometrics. 2022;78(4):1592–1603. doi: 10.1111/biom.v78.4
  • Kurnaz FS, Hoffmann I, Filzmoser P. Robust and sparse estimation methods for high-dimensional linear and logistic regression. Chemometr Intell Lab Syst. 2018;172:211–222. doi: 10.1016/j.chemolab.2017.11.017
  • Smucler E, Yohai VJ. Robust and sparse estimators for linear regression models. Comput Stat Data Anal. 2017;111:116–130. doi: 10.1016/j.csda.2017.02.002
  • Bels IG, Vrinssen I. Robust estimation and variable selection in heteroscedastic linear regression. Statistics. 2019;53(3):489–532. doi: 10.1080/02331888.2019.1579215
  • Cook RD, Forzani L. Big data and partial least-squares prediction. Can J Stat. 2018;46(1):62–78. doi: 10.1002/cjs.v46.1
  • Wold H. Soft modelling by latent variables: the non-linear iterative partial least squares (nipals) approach. J Appl Probab. 1975;12(S1):117–142. doi: 10.1017/S0021900200047604
  • Chun H, Keleş S. Sparse partial least squares regression for simultaneous dimension reduction and variable selection. J R Stat Soc B. 2010;72:3–25. doi: 10.1111/j.1467-9868.2009.00723.x
  • Serneels S, Croux C, Filzmoser P, et al. Partial robust m-regression. Chemometr Intell Lab Syst. 2005;79(1-2):55–64. doi: 10.1016/j.chemolab.2005.04.007
  • Hoffmann I, Serneels S, Filzmoser P, et al. Sparse partial robust m regression. Chemometr Intell Lab Syst. 2015;149:50–59. doi: 10.1016/j.chemolab.2015.09.019
  • Hoffmann I, Filzmoser P, Serneels S, et al. Sparse and robust pls for binary classification. J Chemom. 2016;30(4):153–162. doi: 10.1002/cem.v30.4
  • Serneels S, De Nolf E, Van Espen PJ. Spatial sign preprocessing: a simple way to impart moderate robustness to multivariate estimators. J Chem Inf Model. 2006;46(3):1402–1409. doi: 10.1021/ci050498u
  • Raymaekers J, Rousseeuw PJ. A generalized spatial sign covariance matrix. J Multivar Anal. 2019;171:94–111. doi: 10.1016/j.jmva.2018.11.010
  • Raymaekers J, Rousseeuw PJ. Fast robust correlation for high-dimensional data. Technometrics. 2021;63:184–198. doi: 10.1080/00401706.2019.1677270
  • Gil JA, Romera R. On robust partial least squares (pls) methods. J Chemom J Chemom Soc. 1998;12(6):365–378.
  • Vanden Branden K, Hubert M. Robustness properties of a robust partial least squares regression method. Anal Chim Acta. 2004;515(1):229–241. doi: 10.1016/j.aca.2004.01.004
  • Filzmoser P, Serneels S, Maronna R, et al. Robust multivariate methods in chemometrics. In Brown S, Tauler R, Walczak B, editors. Comprehensive chemometrics. 2nd ed. Oxford: Elsevier; 2020. p. 393–430.
  • Locantore N, Marron J, Simpson D, et al. Robust principal component analysis for functional data. Test. 1999;8(1):1–73. doi: 10.1007/BF02595862
  • Menvouta EJ, Serneels S, Verdonck T. direpack: a python 3 package for state-of-the-art statistical dimensionality reduction methods. SoftwareX. 2023;21:101282. doi: 10.1016/j.softx.2022.101282
  • Serneels S, Croux C, Van Espen P. Influence properties of partial least squares regression. Chemometr Intell Lab Syst. 2004;71:13–20. doi: 10.1016/j.chemolab.2003.10.009
  • Cook RD, Helland IS, Su Z. Envelopes and partial least squares regression. J R Stat Soc B. 2013;75(5):851–877. doi: 10.1111/rssb.12018
  • Debruyne M, Höppner S, Serneels S, et al. Outlyingness: which variables contribute most?. Stat Comput. 2019;29(4):707–723. doi: 10.1007/s11222-018-9831-5
  • Donoho DL, Huber PJ. The notion of breakdown point. In: Bickel P, Doksum KA, Hodges JL, editors. A festschrift for Erich L. Lehmann. Belmont, MA: Wadsworth; 1983. p. 157–184.
  • Hampel FR, Rousseeuw PJ, Ronchetti E. The change-of-variance curve and optimal redescending m-estimators. J Am Stat Assoc. 1981;76(375):643–648.
  • Rousseeuw PJ, Leroy AM. Robust regression and outlier detection. New York: John Wiley & Sons; 1987.
  • Yohai VJ, Zamar RH. High breakdown-point estimates of regression by means of the minimization of an efficient scale. J Am Stat Assoc. 1988;83:406–413. doi: 10.1080/01621459.1988.10478611
  • Oshima RG, Baribault H, Caulín C. Oncogenic regulation and function of keratins 8 and 18. Cancer Metastasis Rev. 1996;15(4):445–471. doi: 10.1007/BF00054012
  • Alfons A. robusthd: an r package for robust regression with high-dimensional data. J Open Source Softw. 2021;6(67):3786. doi: 10.21105/joss
  • Janssens KH, De Raedt I, Schalm O, et al. Composition of 15–17th century archaeological glass vessels excavated in Antwerp, Belgium. In: Modern developments and applications in microbeam analysis. Vienna: Springer; 1998. p. 253–267.
  • Lemberge P, De Raedt I, Janssens KH, et al. Quantitative analysis of 16–17th century archaeological glass vessels using PLS regression of epxma and μ-XRF data. J Chemom J Chemom Soc. 2000;14(5–6):751–763.
  • Christidis A-A, Lakshmanan L, Smucler E, et al. Split regularized regression. Technometrics. 2020;62(3):330–338. doi: 10.1080/00401706.2019.1635533

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.