1,592
Views
23
CrossRef citations to date
0
Altmetric
Theory and Methods

ECA: High-Dimensional Elliptical Component Analysis in Non-Gaussian Distributions

&
Pages 252-268 | Received 01 Aug 2015, Accepted 01 Oct 2016, Published online: 26 Sep 2017
 

ABSTRACT

We present a robust alternative to principal component analysis (PCA)—called elliptical component analysis (ECA)—for analyzing high-dimensional, elliptically distributed data. ECA estimates the eigenspace of the covariance matrix of the elliptical data. To cope with heavy-tailed elliptical distributions, a multivariate rank statistic is exploited. At the model-level, we consider two settings: either that the leading eigenvectors of the covariance matrix are nonsparse or that they are sparse. Methodologically, we propose ECA procedures for both nonsparse and sparse settings. Theoretically, we provide both nonasymptotic and asymptotic analyses quantifying the theoretical performances of ECA. In the nonsparse setting, we show that ECA’s performance is highly related to the effective rank of the covariance matrix. In the sparse setting, the results are twofold: (i) we show that the sparse ECA estimator based on a combinatoric program attains the optimal rate of convergence; (ii) based on some recent developments in estimating sparse leading eigenvectors, we show that a computationally efficient sparse ECA estimator attains the optimal rate of convergence under a suboptimal scaling. Supplementary materials for this article are available online.

Supplementary Materials

The online supplementary materials contain the appendices for the article.

Funding

Fang Han’s research was supported by NIBIB-EB012547, NSF DMS-1712536, and a UW faculty start-up grant. Han Liu’s research was supported by the NSF CAREER Award DMS-1454377, NSF IIS-1546482, NSF IIS-1408910, NSF IIS-1332109, NIH R01-MH102339, NIH R01-GM083084, and NIH R01-HG06841.

Notes

1 The Tyler’s M estimator cannot be directly applied to study high-dimensional data because of both theoretical and empirical reasons. Theoretically, to the authors’ knowledge, the sharpest sufficient condition to guarantee its consistency still requires d = o(n1/2) (Duembgen Citation1997). Empirically, our simulations show that Tyler’s M estimator always fails to converge when d > n.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 343.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.