• DocumentCode
    3542827
  • Title

    Improving coherence by reordering the output of extractive summarization using Centering Theory through genetic algorithm

  • Author

    Yuliawati, Arlisa ; Manurung, Ruli

  • Author_Institution
    Lab. of Inf. Retrieval, Univ. Indonesia, Depok, Indonesia
  • fYear
    2013
  • fDate
    28-29 Sept. 2013
  • Firstpage
    213
  • Lastpage
    218
  • Abstract
    Extractive summarization is a widely studied and fairly easy to implement technique. It works by choosing the most important parts of a document(s) as a summary. However, this can lead to a lack of coherence in the summary itself. In this study, the principle of continuity in Centering Theory is used to maintain the entity coherence between subsequent sentences obtained from extractive news summarizer. Simultaneously, the relative order of sentences belonging to the same source document is maintained. These two considerations are implemented as fitness functions for a genetic algorithm that is used to obtain the optimal ordering of sentences in the summary. Based on the results of our study involving human judgment, a weighted fitness function combining 75% continuity and 25% relative order yields the most acceptable sentence ordering.
  • Keywords
    genetic algorithms; text analysis; centering theory; coherence improvement; continuity principle; document source; extractive news summarizer; extractive summarization output reordering; genetic algorithm; human judgment; optimal sentence ordering; relative sentence order; weighted fitness function; Coherence; Equations; Genetic algorithms; Redundancy; Semantics; Sociology; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Science and Information Systems (ICACSIS), 2013 International Conference on
  • Conference_Location
    Bali
  • Type

    conf

  • DOI
    10.1109/ICACSIS.2013.6761578
  • Filename
    6761578