Title :
Improve the Automatic Summarization of Arabic Text Depending on Rhetorical Structure Theory
Author :
Ibrahim, Amin ; Elghazaly, Tarek
Author_Institution :
Dept. of Comput. & Inf. Sci., Cairo Univ., Cairo, Egypt
Abstract :
This paper uses a semantic technique by adopting a Rhetorical Structure Theory (RST) for summarization purpose, to discover the most significant paragraphs based on functional and semantic criteria. However, the quality of RST summarization suffers when dealing with large documents. This paper proposes a new hybrid summarization model for Arabic text, which mingles two sub-models: The first sub-model produces a primary summary by using Rhetorical Structure Theory for identifying a range of the most significant parts of the text (the nucleus). Then the second sub-model ranks the significant parts in the primary rhetorical-summary based on the cosine similarity feature. To evaluate the proposed model, a prototype was developed on a range of articles, which have been classified into three groups different in size. The final output summary was evaluated in relation to its manual counterpart. In terms of enhancement of the rhetorical-summary precision, the experiment shows that proposed model HSM average precision is 71.6%, superior over the primary rhetorical-summary precision 56.3%.
Keywords :
computational linguistics; natural language processing; text analysis; Arabic text; HSM average precision; RST summarization; automatic summarization; cosine similarity feature; functional criteria; hybrid summarization model; paragraphs; primary rhetorical-summary precision; rhetorical structure theory; semantic criteria; semantic technique; Connectors; Data visualization; Educational institutions; Semantics; Support vector machine classification; Vectors; XML; Arabic text summarization; RST; Rhetorical Structure Theory; VSM; Vector Space Model;
Conference_Titel :
Artificial Intelligence (MICAI), 2013 12th Mexican International Conference on
Conference_Location :
Mexico City
Print_ISBN :
978-1-4799-2604-6
DOI :
10.1109/MICAI.2013.35