DocumentCode
1465868
Title
NeNMF: An Optimal Gradient Method for Nonnegative Matrix Factorization
Author
Guan, Naiyang ; Tao, Dacheng ; Luo, Zhigang ; Yuan, Bo
Author_Institution
Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
Volume
60
Issue
6
fYear
2012
fDate
6/1/2012 12:00:00 AM
Firstpage
2882
Lastpage
2898
Abstract
Nonnegative matrix factorization (NMF) is a powerful matrix decomposition technique that approximates a nonnegative matrix by the product of two low-rank nonnegative matrix factors. It has been widely applied to signal processing, computer vision, and data mining. Traditional NMF solvers include the multiplicative update rule (MUR), the projected gradient method (PG), the projected nonnegative least squares (PNLS), and the active set method (AS). However, they suffer from one or some of the following three problems: slow convergence rate, numerical instability and nonconvergence. In this paper, we present a new efficient NeNMF solver to simultaneously overcome the aforementioned problems. It applies Nesterov´s optimal gradient method to alternatively optimize one factor with another fixed. In particular, at each iteration round, the matrix factor is updated by using the PG method performed on a smartly chosen search point, where the step size is determined by the Lipschitz constant. Since NeNMF does not use the time consuming line search and converges optimally at rate in optimizing each matrix factor, it is superior to MUR and PG in terms of efficiency as well as approximation accuracy. Compared to PNLS and AS that suffer from numerical instability problem in the worst case, NeNMF overcomes this deficiency. In addition, NeNMF can be used to solve -norm, -norm and manifold regularized NMF with the optimal convergence rate. Numerical experiments on both synthetic and real-world datasets show the efficiency of NeNMF for NMF and its variants comparing to representative NMF solvers. Extensive experiments on document clustering suggest the effectiveness of NeNMF.
Keywords
approximation theory; computer vision; convergence of numerical methods; data mining; gradient methods; iterative methods; least squares approximations; matrix decomposition; numerical analysis; pattern clustering; Lipschitz constant; MUR; NeNMF solver; Nesterov optimal gradient method; PG; PG method; PNLS; active set method; computer vision; data mining; iteration round; low-rank nonnegative matrix factors; matrix decomposition technique; matrix factor; multiplicative update rule; nonconvergence; nonnegative matrix factorization; numerical instability; numerical instability problem; optimal convergence rate; projected gradient method; projected nonnegative least squares; real-world datasets; signal processing; slow convergence rate; synthetic datasets; time consuming line search; Convergence; Educational institutions; Gradient methods; Least squares approximation; Matrix decomposition; Sparse matrices; $L_{1}$ -norm; $L_{2}$ -norm; manifold regularization; nonnegative matrix factorization (NMF); optimal gradient method;
fLanguage
English
Journal_Title
Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
1053-587X
Type
jour
DOI
10.1109/TSP.2012.2190406
Filename
6166359
Link To Document