DocumentCode
3065549
Title
New Methods of Data Clustering and Classification Based on NMF
Author
Tang, Jie ; Xinyu Ceng ; Peng, Bo
Author_Institution
Southwest Pet. Univ., Chengdu, China
fYear
2011
fDate
29-31 July 2011
Firstpage
432
Lastpage
435
Abstract
Nonnegative matrix factorization method is a kind of new matrix decomposition method. It is an effective tool for large data processing and analysis. At the same time, NMF has an important performance on intelligent information processing and pattern recognition. This paper first analyses and discusses the NMF algorithms based on its basic theory. We then propose new methods of data clustering and classification based on NMF separately. NMF method is applied to reduce the dimension of the original matrix. We run clustering algorithms on the encoded matrix after NMF processing instead of on the original matrix. Running clustering algorithms on smaller encoded matrix can save more time and storage space. After that, we bring in a series of improvement methods of classification on the basis of clustering. Finally we have done experiments to test and verify them, and gotten good results.
Keywords
data analysis; data mining; matrix decomposition; pattern classification; pattern clustering; NMF; data analysis; data classification method; data clustering method; data mining; data processing; intelligent information processing; nonnegative matrix factorization method; pattern recognition; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Matrix decomposition; Support vector machines; Training; Data classification; Data clustering; K-means; NMF; encoded matrix;
fLanguage
English
Publisher
ieee
Conference_Titel
Business Computing and Global Informatization (BCGIN), 2011 International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4577-0788-9
Electronic_ISBN
978-0-7695-4464-9
Type
conf
DOI
10.1109/BCGIn.2011.114
Filename
6003916
Link To Document