Title of article
Consistency of sparse PCA in High Dimension, Low Sample Size contexts
Author/Authors
Shen، نويسنده , , Dan-Hua Shen، نويسنده , , Haipeng and Marron، نويسنده , , J.S.، نويسنده ,
Issue Information
دوفصلنامه با شماره پیاپی سال 2013
Pages
17
From page
317
To page
333
Abstract
Sparse Principal Component Analysis (PCA) methods are efficient tools to reduce the dimension (or number of variables) of complex data. Sparse principal components (PCs) are easier to interpret than conventional PCs, because most loadings are zero. We study the asymptotic properties of these sparse PC directions for scenarios with fixed sample size and increasing dimension (i.e. High Dimension, Low Sample Size (HDLSS)). We consider the previously studied single spike covariance model and assume in addition that the maximal eigenvector is sparse. We extend the existing HDLSS asymptotic consistency and strong inconsistency results of conventional PCA in an entirely new direction. We find a large set of sparsity assumptions under which sparse PCA is still consistent even when conventional PCA is strongly inconsistent. The consistency of sparse PCA is characterized along with rates of convergence. Furthermore, we clearly identify the mathematical boundaries of the sparse PCA consistency, by showing strong inconsistency for an oracle version of sparse PCA beyond the consistent region, as well as its inconsistency on the boundaries of the consistent region. Simulation studies are performed to validate the asymptotic results in finite samples.
Keywords
Sparse PCA , Consistency , high dimension , Low sample size
Journal title
Journal of Multivariate Analysis
Serial Year
2013
Journal title
Journal of Multivariate Analysis
Record number
1566146
Link To Document