• DocumentCode
    658376
  • Title

    A Four Dimension Graph Model for Automatic Text Summarization

  • Author

    Ferreira, Ricardo ; Freitas, Fred ; De Souza Cabral, Luciano ; Dueire Lins, Rafael ; Lima, Raphaela ; Franca, Gabriel ; Simskez, Steven J. ; Favaro, Luciano

  • Author_Institution
    Inf. Center, Fed. Univ. of Pernambuco, Recife, Brazil
  • Volume
    1
  • fYear
    2013
  • fDate
    17-20 Nov. 2013
  • Firstpage
    389
  • Lastpage
    396
  • Abstract
    Text summarization is the process of automatically creating a shorter version of one or more text documents. In this context, word-based, sentence-based and graph-based methods approaches are largely used. Among these, graph based methods for automatic text summarization produce summaries based on the relationships between sentences. These relationships may also support the creation of several text processing applications such as extractive and abstractive summaries, question-answering and information retrieval systems, among others. A new graph model for text processing applications is proposed in this paper. It relies on four dimensions (similarity, semantic similarity, co reference, discourse information) to create the graph. The rationale behind the proposal presented here is resorting to more dimensions than previous works, and taking into account co reference resolution, taking into account to the role of pronouns in connecting the sentences. Co reference was not used in any previous graph based summarization technique. An experiment was performed using the Text Rank algorithm with the presented approach, on the CNN corpus. The results show that the model proposed here outperforms the current approaches both quantitatively and qualitatively.
  • Keywords
    graph theory; text analysis; word processing; CNN corpus; TextRank algorithm; abstractive summaries; automatic text summarization; discourse information; extractive summaries; four dimension graph model; graph-based methods; graph-based summarization technique; information retrieval systems; question-answering systems; semantic similarity; sentence-based methods; text documents; text processing applications; word-based methods; Measurement uncertainty; Proposals; Semantics; Silicon; Text processing; Vectors; Graph-Model; Summarization; TextRank;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4799-2902-3
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2013.55
  • Filename
    6690041