DocumentCode
2873165
Title
Topic Discovery in Research Literature Based on Non-negative Matrix Factorization and Testor Theory
Author
Li, Fang ; Zhu, Qunxiong ; Lin, Xiaoyong
Author_Institution
Sch. of Comput. Sci. & Technol., Beijing Univ. of Chem. Technol., Beijing, China
Volume
2
fYear
2009
fDate
18-19 July 2009
Firstpage
266
Lastpage
269
Abstract
The paper proposes a new way of comprising the Non-negative matrix factorization (NMF) and Testor theory to make topic discovery. NMF method is good at dealing with high dimensional documents and clustering, while Testor theory is used to find the topic of each cluster. By an example of ten abstracts of Chinese science literature from magazines relative to environmental science, the whole process is described in detail. In the end, a case study about automatic classification of a conference proceeding (in Chinese) is given. The result shows the effectiveness of the whole method.
Keywords
classification; data mining; data reduction; literature; matrix decomposition; pattern clustering; text analysis; Chinese science literature; Testor theory; automatic conference proceeding classification; environmental science; high dimensional document clustering; nonnegative matrix factorization; research literature; text data dimensionality reduction; text mining; topic discovery; Abstracts; Chemical technology; Clustering algorithms; Clustering methods; Computer science; Conference proceedings; Information processing; Paper technology; Partitioning algorithms; Testing; Document Clustering; NMF; Term-Document Matrix; Testor theory; Topic Discovery;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Processing, 2009. APCIP 2009. Asia-Pacific Conference on
Conference_Location
Shenzhen
Print_ISBN
978-0-7695-3699-6
Type
conf
DOI
10.1109/APCIP.2009.202
Filename
5197187
Link To Document