DocumentCode :
2707807
Title :
Matrix-based Kernel Principal Component analysis for large-scale data set
Author :
Shi, Weiya ; Guo, Yue-Fei ; Xue, Xiangyang
Author_Institution :
Sch. of Comput. Sci., Fudan Univ., Shanghai, China
fYear :
2009
fDate :
14-19 June 2009
Firstpage :
2908
Lastpage :
2913
Abstract :
Kernel Principal Component Analysis (KPCA) is a nonlinear feature extraction approach, which generally needs to eigen-decompose the kernel matrix. But the size of kernel matrix scales with the number of data points, it is infeasible to store and compute the kernel matrix when faced with the large-scale data set. To overcome computational and storage problem for large-scale data set, a new framework, Matrixbased Kernel Principal Component Analysis (M-KPCA), is proposed. By dividing the large scale data set into small subsets, we could treat the autocorrelation matrix of each subset as the special computational unit. A novel polynomial-matrix kernel function is adopted to compute the similarity between the data matrices in place of vectors. It is also proved that the polynomial kernel is the extreme case of the polynomial-matrix one. The proposed M-KPCA can greatly reduce the size of kernel matrix, which makes its computation possible. The effectiveness is demonstrated by the experimental results on the artificial and real data set.
Keywords :
feature extraction; polynomial matrices; principal component analysis; eigen-decompose; kernel matrix; kernel principal component analysis; large-scale data set; nonlinear feature extraction; polynomial-matrix kernel function; Autocorrelation; Data mining; Feature extraction; Iterative algorithms; Kernel; Large-scale systems; Neural networks; Polynomials; Principal component analysis; Sampling methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2009. IJCNN 2009. International Joint Conference on
Conference_Location :
Atlanta, GA
ISSN :
1098-7576
Print_ISBN :
978-1-4244-3548-7
Electronic_ISBN :
1098-7576
Type :
conf
DOI :
10.1109/IJCNN.2009.5178692
Filename :
5178692
Link To Document :
بازگشت