DocumentCode :
3467486
Title :
Chinese text classification method based on NMF
Author :
Zhang, Lei ; Xiang, Xuezhi
Author_Institution :
Inf. & Commun. Eng. Coll., Harbin Eng. Univ., Harbin, China
Volume :
2
fYear :
2009
fDate :
5-6 Dec. 2009
Firstpage :
240
Lastpage :
243
Abstract :
Text document classification based on the semantic level is a hot issue in text processing presently. In this paper, a method based on NMF for Chinese text classification is presented. According to NMF, the term-document matrix is decomposed to capture the relation between terms. This method settled effectively the problems of synonym and polysemy. It experimentally shows that, compared with LSI based on SVD, this method has advantages of faster computing speed, less memory occupancy and improvement of classification precision when the dimension reduces markedly.
Keywords :
information retrieval; matrix decomposition; singular value decomposition; text analysis; NMF; SVD; chinese text document classification method; nonnegative matrix factorization; polysemy; semantic level; singular value decomposition; synonym; term-document matrix; text processing; Dictionaries; Educational institutions; Image retrieval; Indexing; Information retrieval; Matrix decomposition; Surveillance; Testing; Text categorization; Text processing; NMF; SVD; text document classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Test and Measurement, 2009. ICTM '09. International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-4699-5
Type :
conf
DOI :
10.1109/ICTM.2009.5413065
Filename :
5413065
Link To Document :
بازگشت