DocumentCode
2975316
Title
Graph-based submodular selection for extractive summarization
Author
Lin, Hui ; Bilmes, Jeff ; Xie, Shasha
Author_Institution
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
381
Lastpage
386
Abstract
We propose a novel approach for unsupervised extractive summarization. Our approach builds a semantic graph for the document to be summarized. Summary extraction is then formulated as optimizing submodular functions defined on the semantic graph. The optimization is theoretically guaranteed to be near-optimal under the framework of submodularity. Extensive experiments on the ICSI meeting summarization task on both human transcripts and automatic speech recognition (ASR) outputs show that the graph-based submodular selection approach consistently outperforms the maximum marginal relevance (MMR) approach, a concept-based approach using integer linear programming (ILP), and a recursive graph-based ranking algorithm using Google´s PageRank.
Keywords
graph theory; integer programming; linear programming; speech recognition; text analysis; PageRank; automatic speech recognition; document summarization; graph-based submodular selection; human transcript; integer linear programming; optimization; recursive graph-based ranking algorithm; semantic graph; unsupervised extractive summarization; Automatic speech recognition; Computer science; Game theory; Greedy algorithms; Humans; Integer linear programming; Polynomials; Redundancy;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5373486
Filename
5373486
Link To Document