DocumentCode
3289686
Title
Arabic Topic Detection using automatic text summarisation
Author
Koulali, Rim ; El-Haj, Mahmoud ; Meziane, Abdelkrim
Author_Institution
LARI Lab., Mohammed I Univ., Oujda, Morocco
fYear
2013
fDate
27-30 May 2013
Firstpage
1
Lastpage
4
Abstract
With the exponential growth of the online available Arabic documents, classifying and processing large Arabic corpora has became a challenging task. The presence of noisy information embedded in these documents has made it even more difficult to get accurate results when applying a Topic Detection (TD) process. To address this problem, a proper features selection approach is needed to enhance the topic detection accuracy. In this paper, we explore the impact of using automatic summarisation technique along with a feature-selection process to enhance Arabic Topic Detection. In our work we show that using automatic summarisation reduces noisy information and results in a significant enhancement to the topic detection process and therefore increases the performance of our TD system. This was achieved by the ability of our summariser system in reducing documents size to speed up the detection process.
Keywords
natural language processing; text analysis; Arabic corpora; Arabic topic detection; TD process; TD system; automatic summarisation technique; automatic text summarisation; feature selection approach; feature-selection process; noisy information; online available Arabic documents; summariser system; Educational institutions; Electronic mail; Feature extraction; Natural language processing; Noise measurement; Training; Vectors; Automatic Summarisation; Cosine Similarity; Natural Language Processing; TF-IDF; Topic Detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Systems and Applications (AICCSA), 2013 ACS International Conference on
Conference_Location
Ifrane
ISSN
2161-5322
Type
conf
DOI
10.1109/AICCSA.2013.6616460
Filename
6616460
Link To Document