Title :
Weighted graph model based sentence clustering and ranking for document summarization
Author :
Ge, Shuzhi Sam ; Zhang, Zhengchen ; He, Hongsheng
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
This paper proposes a sentence ranking and clustering based summarization method that extracts essential sentences from a document. To discover central sentences, a weighted undirected graph that takes sentence similarities and the discourse relationship between sentences as the weights of edges is constructed for the given document. A graph-ranking algorithm is implemented to calculate the scores of sentences. We also build a matrix for the document, and an algorithm based on Sparse Non-negative Matrix Factorization is introduced to cluster the sentences in the document. High ranked sentences of each cluster are selected to comprise the summarization of the document. The experimental results on the Document Understanding Conference (DUC) 2001 data set demonstrate the effectiveness of the document summarization algorithm.
Keywords :
document handling; graph theory; matrix decomposition; clustering based summarization method; document summarization; document understanding conference; essential sentence extraction; graph-ranking algorithm; sentence clustering; sentence ranking; sparse nonnegative matrix factorization; weighted graph model; weighted undirected graph; Clustering algorithms; Connectors; Feature extraction; Semantics; Sparse matrices; Strontium; Symmetric matrices;
Conference_Titel :
Interaction Sciences (ICIS), 2011 4th International Conference on
Conference_Location :
Busan
Print_ISBN :
978-1-4577-0480-2
Electronic_ISBN :
978-89-88678-45-9