• DocumentCode
    2223580
  • Title

    Personalized PageRank Based Multi-document Summarization

  • Author

    Liu, Yong ; Wang, Xiaolei ; Zhang, Jin ; Xu, Hongbo

  • Author_Institution
    Sch. of Mech. Eng. & Autom., Northeastern Univ., Shenyang
  • fYear
    2008
  • fDate
    14-15 July 2008
  • Firstpage
    169
  • Lastpage
    173
  • Abstract
    This paper presents a novel multi-document summarization approach based on personalized pagerank (PPRSum). In this algorithm, we uniformly integrate various kinds of information in the corpus. At first, we train a salience model of sentence global features based on Naive Bayes Model. Secondly, we generate a relevance model for each corpus utilizing the query of it. Then, we compute the personalized prior probability for each sentence in the corpus utilizing the salience model and the relevance model both. With the help of personalized prior probability, a Personalized PageRank ranking process is performed depending on the relationships among all sentences in the corpus. Additionally, the redundancy penalty is imposed on each sentence. The summary is produced by choosing the sentences with both high query-focused information richness and high information novelty. Experiments on DUC2007 are performed and the ROUGE evaluation results show that PPRSum ranks between the 1st and the 2nd systems on DUC2007 main task.
  • Keywords
    Bayes methods; probability; query processing; search engines; text analysis; multidocument summarization; naive Bayes model; personalized pagerank; personalized prior probability; query processing; redundancy penalty; relevance model; salience model; text analysis; Automation; Clustering algorithms; Computers; Conferences; Data mining; Frequency; Mechanical engineering; Partitioning algorithms; Performance evaluation; Naïve Bayes model; Personalized PageRank; personalized prior probability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing and Systems, 2008. WSCS '08. IEEE International Workshop on
  • Conference_Location
    Huangshan
  • Print_ISBN
    978-0-7695-3316-2
  • Electronic_ISBN
    978-0-7695-3316-2
  • Type

    conf

  • DOI
    10.1109/WSCS.2008.32
  • Filename
    4570834