Sparse PCA for High-Dimensional Data With Outliers

作者:Hubert Mia*; Reynkens Tom; Schmitt Eric; Verdonck Tim
来源:Technometrics, 2016, 58(4): 424-434.
DOI:10.1080/00401706.2015.1093962

摘要

A new sparse PCA algorithm is presented, which is robust against outliers. The approach is based on the ROBPCA algorithm that generates robust but nonsparse loadings. The construction of the new ROSPCA method is detailed, as well as a selection criterion for the sparsity parameter. An extensive simulation study and a real data example are performed, showing that it is capable of accurately finding the sparse structure of datasets, even when challenging outliers are present. In comparison with a projection pursuit-based algorithm, ROSPCA demonstrates superior robustness properties and comparable sparsity estimation capability, as well as significantly faster computation time.

  • 出版日期2016-11