• Title of article

    Single-document and multi-document summarization techniques for email threads using sentence compression

  • Author/Authors

    David M. Zajic، نويسنده , , Bonnie J. Dorr، نويسنده , , Jimmy Lin، نويسنده ,

  • Issue Information
    دوماهنامه با شماره پیاپی سال 2008
  • Pages
    11
  • From page
    1600
  • To page
    1610
  • Abstract
    We present two approaches to email thread summarization: collective message summarization (CMS) applies a multi-document summarization approach, while individual message summarization (IMS) treats the problem as a sequence of single-document summarization tasks. Both approaches are implemented in our general framework driven by sentence compression. Instead of a purely extractive approach, we employ linguistic and statistical methods to generate multiple compressions, and then select from those candidates to produce a final summary. We demonstrate these ideas on the Enron email collection – a very challenging corpus because of the highly technical language. Experimental results point to two findings: that CMS represents a better approach to email thread summarization, and that current sentence compression techniques do not improve summarization performance in this genre.
  • Keywords
    Email summarization , Sentence compression , trimming , Enron , Informal media
  • Journal title
    Information Processing and Management
  • Serial Year
    2008
  • Journal title
    Information Processing and Management
  • Record number

    1228850