• DocumentCode
    675007
  • Title

    Improve the Automatic Summarization of Arabic Text Depending on Rhetorical Structure Theory

  • Author

    Ibrahim, Amin ; Elghazaly, Tarek

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Cairo Univ., Cairo, Egypt
  • fYear
    2013
  • fDate
    24-30 Nov. 2013
  • Firstpage
    223
  • Lastpage
    227
  • Abstract
    This paper uses a semantic technique by adopting a Rhetorical Structure Theory (RST) for summarization purpose, to discover the most significant paragraphs based on functional and semantic criteria. However, the quality of RST summarization suffers when dealing with large documents. This paper proposes a new hybrid summarization model for Arabic text, which mingles two sub-models: The first sub-model produces a primary summary by using Rhetorical Structure Theory for identifying a range of the most significant parts of the text (the nucleus). Then the second sub-model ranks the significant parts in the primary rhetorical-summary based on the cosine similarity feature. To evaluate the proposed model, a prototype was developed on a range of articles, which have been classified into three groups different in size. The final output summary was evaluated in relation to its manual counterpart. In terms of enhancement of the rhetorical-summary precision, the experiment shows that proposed model HSM average precision is 71.6%, superior over the primary rhetorical-summary precision 56.3%.
  • Keywords
    computational linguistics; natural language processing; text analysis; Arabic text; HSM average precision; RST summarization; automatic summarization; cosine similarity feature; functional criteria; hybrid summarization model; paragraphs; primary rhetorical-summary precision; rhetorical structure theory; semantic criteria; semantic technique; Connectors; Data visualization; Educational institutions; Semantics; Support vector machine classification; Vectors; XML; Arabic text summarization; RST; Rhetorical Structure Theory; VSM; Vector Space Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence (MICAI), 2013 12th Mexican International Conference on
  • Conference_Location
    Mexico City
  • Print_ISBN
    978-1-4799-2604-6
  • Type

    conf

  • DOI
    10.1109/MICAI.2013.35
  • Filename
    6714672