DocumentCode :
3310369
Title :
Automatically extracting summaries with a novel unsupervised framework
Author :
Peng Li ; Yinglin Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
Volume :
3
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
1778
Lastpage :
1782
Abstract :
In this paper, we propose a novel approach to automatic generation of aspect-oriented summaries from multiple documents. We first develop an event-aspect LDA model to cluster sentences into aspects. We then use extended LexRank algorithm to rank the sentences in each cluster. We use Integer Linear Programming for sentence selection. Key features of our method include automatic grouping of semantically related sentences and sentence ranking based on extension of random walk model. Also, we implement a new sentence compression algorithm which use dependency tree instead of parser tree. We compare our method with three baseline methods. Quantitative evaluation based on Rouge metric demonstrate the effectiveness and advantages of our method.
Keywords :
aspect-oriented programming; document handling; grammars; integer programming; linear programming; pattern clustering; unsupervised learning; LexRank algorithm; aspect-oriented summaries; cluster sentences; event-aspect LDA model; integer linear programming; multidocument summarization; parser tree; random walk model; sentence compression algorithm; sentence ranking; unsupervised framework; Clustering algorithms; Compression algorithms; Computational linguistics; Computational modeling; Integer linear programming; Pragmatics; Redundancy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
Type :
conf
DOI :
10.1109/FSKD.2011.6019843
Filename :
6019843
Link To Document :
بازگشت