• DocumentCode
    3467486
  • Title

    Chinese text classification method based on NMF

  • Author

    Zhang, Lei ; Xiang, Xuezhi

  • Author_Institution
    Inf. & Commun. Eng. Coll., Harbin Eng. Univ., Harbin, China
  • Volume
    2
  • fYear
    2009
  • fDate
    5-6 Dec. 2009
  • Firstpage
    240
  • Lastpage
    243
  • Abstract
    Text document classification based on the semantic level is a hot issue in text processing presently. In this paper, a method based on NMF for Chinese text classification is presented. According to NMF, the term-document matrix is decomposed to capture the relation between terms. This method settled effectively the problems of synonym and polysemy. It experimentally shows that, compared with LSI based on SVD, this method has advantages of faster computing speed, less memory occupancy and improvement of classification precision when the dimension reduces markedly.
  • Keywords
    information retrieval; matrix decomposition; singular value decomposition; text analysis; NMF; SVD; chinese text document classification method; nonnegative matrix factorization; polysemy; semantic level; singular value decomposition; synonym; term-document matrix; text processing; Dictionaries; Educational institutions; Image retrieval; Indexing; Information retrieval; Matrix decomposition; Surveillance; Testing; Text categorization; Text processing; NMF; SVD; text document classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Test and Measurement, 2009. ICTM '09. International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-4699-5
  • Type

    conf

  • DOI
    10.1109/ICTM.2009.5413065
  • Filename
    5413065