• DocumentCode
    3145417
  • Title

    Discourse segmentation in aid of document summarization

  • Author

    Boguraev, Branimir K. ; Neff, Mary S.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2000
  • fDate
    4-7 Jan. 2000
  • Abstract
    This paper describes work to enhance a sentence-based summarizer with notions of salience, dynamically adjustable summary size, discourse segmentation, and awareness of topic shifts. Our experiments study strategies to diversify the application of a baseline summarizer, by making it aware of finer-grained ´aboutness´, capable of discerning changes of topic, and sensitive to longer-than-usual documents. Evaluated against the corpus used in the development of the baseline summarizer, summaries derived either by means of segmentation analysis alone, or by a mix of strategies for combining salience calculation and topic shift detection, are shown to be of comparable, and under certain conditions even better quality. We describe the summarization and segmentation procedures, outline a number of strategies for mixing the two, evaluate the overall impact of discourse segmentation, and suggest an interface design capable of using the notion of topic shifts to contextualize a summary and facilitate the mediation between it and the full document source.
  • Keywords
    text analysis; discourse segmentation; document summarization; dynamically adjustable summary size; salience; sentence-based summarizer; topic shifts; Character recognition; Computational modeling; Conferences; Context awareness; Degradation; Information management; Mediation; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2000. Proceedings of the 33rd Annual Hawaii International Conference on
  • Print_ISBN
    0-7695-0493-0
  • Type

    conf

  • DOI
    10.1109/HICSS.2000.926687
  • Filename
    926687