Sparse exponential family Principal Component Analysis

Lu, Meng<sup>*</sup>; Huang, Jianhua Z.; Qian, Xiaoning

doi:10.1016/j.patcog.2016.05.024

摘要

We propose a Sparse exponential family Principal Component Analysis (SePCA) method suitable for any type of data following exponential family distributions to achieve simultaneous dimension reduction and variable selection for better interpretation of the results. Because of the generality of exponential family distributions, the method can be applied to a wide range of applications, in particular when analyzing high dimensional next-generation sequencing data and genetic mutation data in genomics. The use of sparsity-inducing penalty helps produce sparse principal component loading vectors such that the principal components can focus on informative variables. By using an equivalent dual form of the formulated optimization problem for SePCA, we derive optimal solutions with efficient iterative closed form updating rules. The results from both simulation experiments and real-world applications have demonstrated the superiority of our SePCA in reconstruction accuracy and computational efficiency over traditional exponential family PCA (ePCA), the existing Sparse PCA (SPCA) and Sparse Logistic PCA (SLPCA) algorithms.

出版日期2016-12
单位天津大学

全文

访问全文

收藏分享被引(13) 浏览

更新时间：2024-05-13 20:39

Sparse exponential family Principal Component Analysis

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友