Title :
Keyphrase based Arabic summarizer (KPAS)
Author :
El-Shishtawy, Tarek ; El-Ghannam, Fatma
Abstract :
This paper describes a computationally inexpensive and efficient generic summarization algorithm for Arabic texts. The algorithm belongs to extractive summarization family, which reduces the problem into representative sentences identification and extraction sub-problems. Important keyphrases of the document to be summarized are identified employing combinations of statistical and linguistic features. The sentence extraction algorithm exploits keyphrases as the primary attributes to rank a sentence. The present experimental work, demonstrates different techniques for achieving various summarization goals including: informative richness, coverage of both main and auxiliary topics, and keeping redundancy to a minimum. A scoring scheme is then adopted that balances between these summarization goals. To evaluate the resulted Arabic summaries with well-established systems, aligned English/Arabic texts are used through the experiments.
Keywords :
feature extraction; natural language processing; redundancy; statistical analysis; text analysis; KPAS; aligned Arabic text; aligned English text; auxiliary topics; extraction sub-problems; generic summarization algorithm; informative richness; keyphrase based Arabic summarizer; linguistic features; main topics; redundancy; representative sentence identification; scoring scheme; sentence extraction algorithm; sentence ranking; statistical features; Algorithm design and analysis; Computers; Educational institutions; Feature extraction; Informatics; Pragmatics; Semantics;
Conference_Titel :
Informatics and Systems (INFOS), 2012 8th International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4673-0828-1