• DocumentCode
    3065549
  • Title

    New Methods of Data Clustering and Classification Based on NMF

  • Author

    Tang, Jie ; Xinyu Ceng ; Peng, Bo

  • Author_Institution
    Southwest Pet. Univ., Chengdu, China
  • fYear
    2011
  • fDate
    29-31 July 2011
  • Firstpage
    432
  • Lastpage
    435
  • Abstract
    Nonnegative matrix factorization method is a kind of new matrix decomposition method. It is an effective tool for large data processing and analysis. At the same time, NMF has an important performance on intelligent information processing and pattern recognition. This paper first analyses and discusses the NMF algorithms based on its basic theory. We then propose new methods of data clustering and classification based on NMF separately. NMF method is applied to reduce the dimension of the original matrix. We run clustering algorithms on the encoded matrix after NMF processing instead of on the original matrix. Running clustering algorithms on smaller encoded matrix can save more time and storage space. After that, we bring in a series of improvement methods of classification on the basis of clustering. Finally we have done experiments to test and verify them, and gotten good results.
  • Keywords
    data analysis; data mining; matrix decomposition; pattern classification; pattern clustering; NMF; data analysis; data classification method; data clustering method; data mining; data processing; intelligent information processing; nonnegative matrix factorization method; pattern recognition; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Matrix decomposition; Support vector machines; Training; Data classification; Data clustering; K-means; NMF; encoded matrix;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Business Computing and Global Informatization (BCGIN), 2011 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4577-0788-9
  • Electronic_ISBN
    978-0-7695-4464-9
  • Type

    conf

  • DOI
    10.1109/BCGIn.2011.114
  • Filename
    6003916