Title :
Phase transitions in sparse PCA
Author :
Thibault Lesieur;Florent Krzakala;Lenka Zdeborová
Author_Institution :
Institut de Physique Thé
fDate :
6/1/2015 12:00:00 AM
Abstract :
We study optimal estimation for sparse principal component analysis when the number of non-zero elements is small but on the same order as the dimension of the data. We employ approximate message passing (AMP) algorithm and its state evolution to analyze what is the information theoretically minimal mean-squared error and the one achieved by AMP in the limit of large sizes. For a special case of rank one and large enough density of non-zeros Deshpande and Montanari [1] proved that AMP is asymptotically optimal. We show that both for low density and for large rank the problem undergoes a series of phase transitions suggesting existence of a region of parameters where estimation is information theoretically possible, but AMP (and presumably every other polynomial algorithm) fails. The analysis of the large rank limit is particularly instructive.
Keywords :
"Principal component analysis","Mathematical model","Noise","Sparse matrices","Estimation","Message passing","Approximation algorithms"
Conference_Titel :
Information Theory (ISIT), 2015 IEEE International Symposium on
Electronic_ISBN :
2157-8117
DOI :
10.1109/ISIT.2015.7282733