DocumentCode
3401301
Title
Generic multi-document summarization using cluster refinement and NMF
Author
Park, Sun ; An, Dong Un ; Cho, Youn Jeong
Author_Institution
Adv. Grad. Educ. Center of Jeonbuk for Electron. & Inf. Technol.-BK21, Chonbuk Nat. Univ., Jeonju, South Korea
fYear
2009
fDate
14-17 Dec. 2009
Firstpage
65
Lastpage
70
Abstract
In this paper, a generic summarization method that uses cluster refinement and NMF is introduced to extract meaningful sentences from documents. The proposed method uses cluster refinement to improve the quality of document clustering since it helps us to remove dissimilarity information easily and avoid biased inherent semantics of documents to be reflected in clusters by NMF. In addition, it uses the weighted semantic variable to select meaningful sentences because the extracted sentences are well covered with the major topics of document. The experimental results demonstrate that the proposed method has better performance than other methods that use the other methods.
Keywords
feature extraction; pattern clustering; text analysis; cluster refinement; document clustering; generic multidocument summarization; meaningful sentence extraction; Clustering algorithms; Data mining; Diversity methods; Diversity reception; Educational technology; Feature extraction; Internet; Sun; Vectors; NMF; cluster coherence; cluster refinement; geneirc multi-document summarization;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Information Technology (ISSPIT), 2009 IEEE International Symposium on
Conference_Location
Ajman
Print_ISBN
978-1-4244-5949-0
Type
conf
DOI
10.1109/ISSPIT.2009.5407492
Filename
5407492
Link To Document