DocumentCode :
2917463
Title :
Using NMF-based text summarization to improve supervised and unsupervised classification
Author :
Tsarev, Dmitry ; Petrovskiy, Mikhail ; Mashechkin, Igor
Author_Institution :
Comput. Sci. Dept., Lomonosov Moscow State Univ., Moscow, Russia
fYear :
2011
fDate :
5-8 Dec. 2011
Firstpage :
185
Lastpage :
189
Abstract :
This paper presents a new generic text summarization method using Non-negative Matrix Factorization (NMF) to estimate sentence relevance. Proposed sentence relevance estimation is based on normalization of NMF topic space and further weighting of each topic using sentences representation in topic space. The proposed method shows better summarization quality and performance than state of the art methods on DUC 2002 standard dataset. In addition, we study how this method can improve the performance of supervised and unsupervised text classification tasks. In our experiments with Reuters-21578 and Classic4 benchmark datasets we apply developed text summarization method as a preprocessing step for further multi-label classification and clustering. As a result, the quality of classification and clustering has been significantly improved.
Keywords :
matrix decomposition; pattern classification; pattern clustering; text analysis; Classic4; NMF based text summarization; Reuters-21578; multilabel classification; multilabel clustering; nonnegative matrix factorization; sentence relevance estimation; sentences representation; supervised classification; unsupervised classification; Benchmark testing; Clustering algorithms; Hybrid intelligent systems; Matrix decomposition; Semantics; Training; Vectors; clustering; generic text summarization; latent semantic analysis; multi-label classification; non-negative matrix factorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hybrid Intelligent Systems (HIS), 2011 11th International Conference on
Conference_Location :
Melacca
Print_ISBN :
978-1-4577-2151-9
Type :
conf
DOI :
10.1109/HIS.2011.6122102
Filename :
6122102
Link To Document :
بازگشت