• DocumentCode
    2873165
  • Title

    Topic Discovery in Research Literature Based on Non-negative Matrix Factorization and Testor Theory

  • Author

    Li, Fang ; Zhu, Qunxiong ; Lin, Xiaoyong

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Beijing Univ. of Chem. Technol., Beijing, China
  • Volume
    2
  • fYear
    2009
  • fDate
    18-19 July 2009
  • Firstpage
    266
  • Lastpage
    269
  • Abstract
    The paper proposes a new way of comprising the Non-negative matrix factorization (NMF) and Testor theory to make topic discovery. NMF method is good at dealing with high dimensional documents and clustering, while Testor theory is used to find the topic of each cluster. By an example of ten abstracts of Chinese science literature from magazines relative to environmental science, the whole process is described in detail. In the end, a case study about automatic classification of a conference proceeding (in Chinese) is given. The result shows the effectiveness of the whole method.
  • Keywords
    classification; data mining; data reduction; literature; matrix decomposition; pattern clustering; text analysis; Chinese science literature; Testor theory; automatic conference proceeding classification; environmental science; high dimensional document clustering; nonnegative matrix factorization; research literature; text data dimensionality reduction; text mining; topic discovery; Abstracts; Chemical technology; Clustering algorithms; Clustering methods; Computer science; Conference proceedings; Information processing; Paper technology; Partitioning algorithms; Testing; Document Clustering; NMF; Term-Document Matrix; Testor theory; Topic Discovery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Processing, 2009. APCIP 2009. Asia-Pacific Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-0-7695-3699-6
  • Type

    conf

  • DOI
    10.1109/APCIP.2009.202
  • Filename
    5197187