DocumentCode
3542827
Title
Improving coherence by reordering the output of extractive summarization using Centering Theory through genetic algorithm
Author
Yuliawati, Arlisa ; Manurung, Ruli
Author_Institution
Lab. of Inf. Retrieval, Univ. Indonesia, Depok, Indonesia
fYear
2013
fDate
28-29 Sept. 2013
Firstpage
213
Lastpage
218
Abstract
Extractive summarization is a widely studied and fairly easy to implement technique. It works by choosing the most important parts of a document(s) as a summary. However, this can lead to a lack of coherence in the summary itself. In this study, the principle of continuity in Centering Theory is used to maintain the entity coherence between subsequent sentences obtained from extractive news summarizer. Simultaneously, the relative order of sentences belonging to the same source document is maintained. These two considerations are implemented as fitness functions for a genetic algorithm that is used to obtain the optimal ordering of sentences in the summary. Based on the results of our study involving human judgment, a weighted fitness function combining 75% continuity and 25% relative order yields the most acceptable sentence ordering.
Keywords
genetic algorithms; text analysis; centering theory; coherence improvement; continuity principle; document source; extractive news summarizer; extractive summarization output reordering; genetic algorithm; human judgment; optimal sentence ordering; relative sentence order; weighted fitness function; Coherence; Equations; Genetic algorithms; Redundancy; Semantics; Sociology; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computer Science and Information Systems (ICACSIS), 2013 International Conference on
Conference_Location
Bali
Type
conf
DOI
10.1109/ICACSIS.2013.6761578
Filename
6761578
Link To Document