• DocumentCode
    3102179
  • Title

    Subtopic-based multi-document summarization

  • Author

    Dai, Lin ; Tang, Ji-liang ; Xia, Yun-qing

  • Author_Institution
    Sch. of Comput. Sci., Beijing Inst. of Technol., Beijing, China
  • Volume
    6
  • fYear
    2009
  • fDate
    12-15 July 2009
  • Firstpage
    3505
  • Lastpage
    3510
  • Abstract
    This paper proposes a novel approach for multi-document summarization based on subtopic segmentation. It firstly detects the subtopics in a topic, and then finds the central sentence for each subtopic. The sentences are scored based on their importance in the document and in the subtopic. Two anti-redundancy strategies are used to extract sentences to form summarization. Since our approach is intrinsically incremental, it is effective when new documents are added to the document set. Experimental results indicate that the proposed approach is effective and efficient.
  • Keywords
    text analysis; antiredundancy strategy; subtopic segmentation; subtopic-based multidocument summarization; Computer science; Cybernetics; Data mining; Information science; Information technology; Laboratories; Learning systems; Machine learning; Natural language processing; Partial response channels; Anti-redundancy strategy; Multi-document summarization; Topic Detection and Tracking; Topic segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2009 International Conference on
  • Conference_Location
    Baoding
  • Print_ISBN
    978-1-4244-3702-3
  • Electronic_ISBN
    978-1-4244-3703-0
  • Type

    conf

  • DOI
    10.1109/ICMLC.2009.5212767
  • Filename
    5212767