DocumentCode :
2774527
Title :
Enhanced Arabic Information Retrieval System based on Arabic Text Classification
Author :
Ghwanmeh, Sameh ; Kanaan, Ghassan ; Al-Shalabi, Riyad ; Ababneh, Ahmad
Author_Institution :
Yarmouk Univ., Irbid
fYear :
2007
fDate :
18-20 Nov. 2007
Firstpage :
461
Lastpage :
465
Abstract :
The paper presents enhanced, effective and simple approach to text classification. The approach uses an algorithm to automatically classifying documents. The main idea of the algorithm is to select feature words from each document; those words cover all the ideas in the document. The results of this algorithm are list of the main subjects founded in the document. Also, in this paper the effects of the Arabic text classification on Information Retrieval have been investigated. The system evaluation was conducted in two cases based on precision/recall criteria: evaluate the system without using Arabic text classification and evaluate the system with Arabic text classification. A series of experiments were carried out to test the algorithm using 242 Arabic abstracts. Additionally, automatic phrase indexing was implemented. Experiments revealed that the system with text classification gives better performance than the system without text classification.
Keywords :
information retrieval; text analysis; Arabic text classification; classifying documents; enhanced Arabic information retrieval system; Abstracts; Animals; Banking; Indexing; Information retrieval; Particle measurements; Prototypes; Testing; Text categorization; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovations in Information Technology, 2007. IIT '07. 4th International Conference on
Conference_Location :
Dubai
Print_ISBN :
978-1-4244-1840-4
Electronic_ISBN :
978-1-4244-1841-1
Type :
conf
DOI :
10.1109/IIT.2007.4430469
Filename :
4430469
Link To Document :
بازگشت