• Title of article

    Mutual-reinforcement document summarization using embedded graph based sentence clustering for storytelling

  • Author/Authors

    Zhengchen Zhang، نويسنده , , Shuzhi Sam Ge، نويسنده , , Hongsheng He، نويسنده ,

  • Issue Information
    دوماهنامه با شماره پیاپی سال 2012
  • Pages
    12
  • From page
    767
  • To page
    778
  • Abstract
    In this paper, a document summarization framework for storytelling is proposed to extract essential sentences from a document by exploiting the mutual effects between terms, sentences and clusters. There are three phrases in the framework: document modeling, sentence clustering and sentence ranking. The story document is modeled by a weighted graph with vertexes that represent sentences of the document. The sentences are clustered into different groups to find the latent topics in the story. To alleviate the influence of unrelated sentences in clustering, an embedding process is employed to optimize the document model. The sentences are then ranked according to the mutual effect between terms, sentence as well as clusters, and high-ranked sentences are selected to comprise the summarization of the document. The experimental results on the Document Understanding Conference (DUC) data sets demonstrate the effectiveness of the proposed method in document summarization. The results also show that the embedding process for sentence clustering render the system more robust with respect to different cluster numbers.
  • Keywords
    Space embedding , sentence clustering , Storytelling , Document summarization , Sentence ranking
  • Journal title
    Information Processing and Management
  • Serial Year
    2012
  • Journal title
    Information Processing and Management
  • Record number

    1229272