DocumentCode :
3601171
Title :
Noise Level Estimation for Model Selection in Kernel PCA Denoising
Author :
Varon, Carolina ; Alzate, Carlos ; Suykens, Johan A. K.
Author_Institution :
Dept. of Electr. Eng., STADIUS Center for Dynamical Syst., Signal Process. & Data Anal., KU Leuven, Leuven, Belgium
Volume :
26
Issue :
11
fYear :
2015
Firstpage :
2650
Lastpage :
2663
Abstract :
One of the main challenges in unsupervised learning is to find suitable values for the model parameters. In kernel principal component analysis (kPCA), for example, these are the number of components, the kernel, and its parameters. This paper presents a model selection criterion based on distance distributions (MDDs). This criterion can be used to find the number of components and the σ2 parameter of radial basis function kernels by means of spectral comparison between information and noise. The noise content is estimated from the statistical moments of the distribution of distances in the original dataset. This allows for a type of randomization of the dataset, without actually having to permute the data points or generate artificial datasets. After comparing the eigenvalues computed from the estimated noise with the ones from the input dataset, information is retained and maximized by a set of model parameters. In addition to the model selection criterion, this paper proposes a modification to the fixed-size method and uses the incomplete Cholesky factorization, both of which are used to solve kPCA in large-scale applications. These two approaches, together with the model selection MDD, were tested in toy examples and real life applications, and it is shown that they outperform other known algorithms.
Keywords :
eigenvalues and eigenfunctions; principal component analysis; signal denoising; statistical analysis; unsupervised learning; σ2 parameter; Cholesky factorization; MDD; eigenvalues; kPCA; kernel PCA denoising; kernel principal component analysis; model selection criterion; noise level estimation; radial basis function kernel; unsupervised learning; Eigenvalues and eigenfunctions; Estimation; Kernel; Noise; Noise level; Noise reduction; Principal component analysis; Kernel principal component analysis (kPCA); least squares support vector machines (LS-SVMs); noise level estimation; unsupervised learning; unsupervised learning.;
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2015.2388696
Filename :
7012106
Link To Document :
بازگشت