Title :
Online graph regularized non-negative matrix factorization for streamming data
Author :
Fudong Liu ; Naiyang Guan ; Yuhua Tang
Author_Institution :
State Key Lab. of High Performance Comput., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Nonnegative matrix factorization (NMF) has been widely used to reduce dimensionality of data in image processing and various applications. Incorporating the geometric structure into NMF, graph regularized nonnegative matrix factorization (GNMF) has shown significant performance improvement in comparison to conventional NMF. However, both NMF and GNMF require the data matrix to reside in the memory, which gives rise to tremendous pressure for computation and storage. Moreover, this problem becomes serious if the datasets increase dramatically. In this paper, we propose an online GNMF (OGNMF) algorithm to process the incoming data in an incremental manner, i.e., OGNMF processes one data point or one chunk of data points one by one. By utilizing a smart buffering technique, OGNMF scales gracefully to large-scale datasets. Experimental results on text corpora demonstrate that OGNMF achieves better performance than the existing online NMF algorithms in terms of both accuracy and normalized mutual information, and outperforms the existing batch GNMF algorithms in terms of time overhead.
Keywords :
data handling; graph theory; matrix decomposition; OGNMF algorithm; OGNMF processes; batch GNMF algorithms; data matrix; datasets; image processing; normalized mutual information; online GNMF algorithm; online graph regularized nonnegative matrix factorization; smart buffering; streamming data; Accuracy; Approximation algorithms; Clustering algorithms; Convergence; Linear programming; Mutual information; Vectors; graph regularized nonnegative matrix factorization (GNMF); large-scale datasets; nonnegative matrix factorization (NMF); online algorithm;
Conference_Titel :
Security, Pattern Analysis, and Cybernetics (SPAC), 2014 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4799-5352-3
DOI :
10.1109/SPAC.2014.6982683