NeNMF: An Optimal Gradient Method for Nonnegative Matrix Factorization

Author

Guan, Naiyang ; Tao, Dacheng ; Luo, Zhigang ; Yuan, Bo

Author_Institution

Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China

Volume

60

Issue

6

fYear

2012

fDate

6/1/2012 12:00:00 AM

Firstpage

2882

Lastpage

2898

Abstract

Nonnegative matrix factorization (NMF) is a powerful matrix decomposition technique that approximates a nonnegative matrix by the product of two low-rank nonnegative matrix factors. It has been widely applied to signal processing, computer vision, and data mining. Traditional NMF solvers include the multiplicative update rule (MUR), the projected gradient method (PG), the projected nonnegative least squares (PNLS), and the active set method (AS). However, they suffer from one or some of the following three problems: slow convergence rate, numerical instability and nonconvergence. In this paper, we present a new efficient NeNMF solver to simultaneously overcome the aforementioned problems. It applies Nesterov´s optimal gradient method to alternatively optimize one factor with another fixed. In particular, at each iteration round, the matrix factor is updated by using the PG method performed on a smartly chosen search point, where the step size is determined by the Lipschitz constant. Since NeNMF does not use the time consuming line search and converges optimally at rate in optimizing each matrix factor, it is superior to MUR and PG in terms of efficiency as well as approximation accuracy. Compared to PNLS and AS that suffer from numerical instability problem in the worst case, NeNMF overcomes this deficiency. In addition, NeNMF can be used to solve -norm, -norm and manifold regularized NMF with the optimal convergence rate. Numerical experiments on both synthetic and real-world datasets show the efficiency of NeNMF for NMF and its variants comparing to representative NMF solvers. Extensive experiments on document clustering suggest the effectiveness of NeNMF.

Keywords

approximation theory; computer vision; convergence of numerical methods; data mining; gradient methods; iterative methods; least squares approximations; matrix decomposition; numerical analysis; pattern clustering; Lipschitz constant; MUR; NeNMF solver; Nesterov optimal gradient method; PG; PG method; PNLS; active set method; computer vision; data mining; iteration round; low-rank nonnegative matrix factors; matrix decomposition technique; matrix factor; multiplicative update rule; nonconvergence; nonnegative matrix factorization; numerical instability; numerical instability problem; optimal convergence rate; projected gradient method; projected nonnegative least squares; real-world datasets; signal processing; slow convergence rate; synthetic datasets; time consuming line search; Convergence; Educational institutions; Gradient methods; Least squares approximation; Matrix decomposition; Sparse matrices; $L_{1}$-norm; $L_{2}$-norm; manifold regularization; nonnegative matrix factorization (NMF); optimal gradient method;

fLanguage

English

Journal_Title

Signal Processing, IEEE Transactions on

Publisher

ieee

ISSN

1053-587X

Type

jour

DOI

10.1109/TSP.2012.2190406

Filename

6166359