DocumentCode
13203
Title
Improved Bounds for the Nyström Method With Application to Kernel Classification
Author
Rong Jin ; Tianbao Yang ; Mahdavi, Mehdi ; Yu-Feng Li ; Zhi-Hua Zhou
Author_Institution
Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
Volume
59
Issue
10
fYear
2013
fDate
Oct. 2013
Firstpage
6939
Lastpage
6949
Abstract
We develop two approaches for analyzing the approximation error bound for the Nyström method that approximates a positive semidefinite (PSD) matrix by sampling a small set of columns, one based on a concentration inequality for integral operators, and one based on random matrix theory. We show that the approximation error, measured in the spectral norm, can be improved from O(N/√m) to O(N/m1-ρ) in the case of large eigengap, where N is the total number of data points, m is the number of sampled data points, and ρ ∈ (0, 1/2) is a positive constant that characterizes the eigengap. When the eigenvalues of the kernel matrix follow a p-power law, our analysis based on random matrix theory further improves the bound to O(N/mp-1) under an incoherence assumption. We present a kernel classification approach based on the Nyström method and derive its generalization performance using the improved bound. We show that when the eigenvalues of the kernel matrix follow a p-power law, we can reduce the number of support vectors to N2p/(p2 - 1), which is sublinear in N when p > 1+√2, without seriously sacrificing its generalization performance.
Keywords
approximation theory; computational complexity; eigenvalues and eigenfunctions; learning (artificial intelligence); matrix algebra; pattern classification; Nyström method; PSD matrix; approximation error bound; concentration inequality; eigengap; kernel classification approach; machine learning; positive semidefinite matrix; random matrix theory; sampled data points; Approximation error; Coherence; Eigenvalues and eigenfunctions; Integral equations; Kernel; Linear matrix inequalities; Approximation error; Nyström method; concentration inequality; kernel methods; random matrix theory;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/TIT.2013.2271378
Filename
6547995
Link To Document