DocumentCode
1350851
Title
Accuracy of Pseudo-Inverse Covariance Learning—A Random Matrix Theory Analysis
Author
Hoyle, David C.
Author_Institution
Fac. of Life Sci., Univ. of Manchester, Manchester, UK
Volume
33
Issue
7
fYear
2011
fDate
7/1/2011 12:00:00 AM
Firstpage
1470
Lastpage
1481
Abstract
For many learning problems, estimates of the inverse population covariance are required and often obtained by inverting the sample covariance matrix. Increasingly for modern scientific data sets, the number of sample points is less than the number of features and so the sample covariance is not invertible. In such circumstances, the Moore-Penrose pseudo-inverse sample covariance matrix, constructed from the eigenvectors corresponding to nonzero sample covariance eigenvalues, is often used as an approximation to the inverse population covariance matrix. The reconstruction error of the pseudo-inverse sample covariance matrix in estimating the true inverse covariance can be quantified via the Frobenius norm of the difference between the two. The reconstruction error is dominated by the smallest nonzero sample covariance eigenvalues and diverges as the sample size becomes comparable to the number of features. For high-dimensional data, we use random matrix theory techniques and results to study the reconstruction error for a wide class of population covariance matrices. We also show how bagging and random subspace methods can result in a reduction in the reconstruction error and can be combined to improve the accuracy of classifiers that utilize the pseudo-inverse sample covariance matrix. We test our analysis on both simulated and benchmark data sets.
Keywords
covariance matrices; eigenvalues and eigenfunctions; learning (artificial intelligence); Frobenius norm; covariance eigenvalues; inverse population covariance; pseudo-inverse covariance learning; random matrix theory analysis; reconstruction error; Accuracy; Bagging; Covariance matrix; Eigenvalues and eigenfunctions; Machine learning; Simulation; Upper bound; Pseudo-inverse; bagging; linear discriminants; peaking phenomenon; random matrix theory; random subspace method.;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2010.186
Filename
5601741
Link To Document