• DocumentCode
    3126028
  • Title

    Supervised Lazy Random Walk for Topic-Focused Multi-document Summarization

  • Author

    Du, Pan ; Guo, Jiafeng ; Cheng, Xueqi

  • Author_Institution
    Inst. of Comput. Technol., Beijing, China
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    1026
  • Lastpage
    1031
  • Abstract
    Topic-focused multi-document summarization aims to produce a summary given a specific topic description and a set of related documents. It has become a crucial text processing task in many real applications that can help users consume the massive information. This paper presents a novel extractive approach based on supervised lazy random walk (Super Lazy). This approach naturally combines the rich features of sentences with the intrinsic sentence graph structure in a principled way, and thus enjoys the advantages of both the existing supervised and unsupervised approaches. Moreover, our approach can achieve the three major goals of topic-focused multi-document summarization (i.e. relevance, salience and diversity) simultaneously with a unified ranking process. Experiments on the benchmark dataset TAC2008 and TAC2009 are performed and the ROUGE evaluation results demonstrate that our approach can significantly outperform both the state-of-the-art supervised and unsupervised methods.
  • Keywords
    information retrieval; learning (artificial intelligence); random processes; text analysis; ROUGE evaluation; SuperLazy; benchmark dataset TAC2008; benchmark dataset TAC2009; intrinsic sentence graph structure; supervised lazy random walk; text processing task; topic focused multidocument summarization; unified ranking process; Feature extraction; Humans; Lead; Measurement; Support vector machines; Training; Vectors; diversity; relevance; salience; supervised lazy random walk; topic-focused multi-document summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.140
  • Filename
    6137309