DocumentCode
2774527
Title
Enhanced Arabic Information Retrieval System based on Arabic Text Classification
Author
Ghwanmeh, Sameh ; Kanaan, Ghassan ; Al-Shalabi, Riyad ; Ababneh, Ahmad
Author_Institution
Yarmouk Univ., Irbid
fYear
2007
fDate
18-20 Nov. 2007
Firstpage
461
Lastpage
465
Abstract
The paper presents enhanced, effective and simple approach to text classification. The approach uses an algorithm to automatically classifying documents. The main idea of the algorithm is to select feature words from each document; those words cover all the ideas in the document. The results of this algorithm are list of the main subjects founded in the document. Also, in this paper the effects of the Arabic text classification on Information Retrieval have been investigated. The system evaluation was conducted in two cases based on precision/recall criteria: evaluate the system without using Arabic text classification and evaluate the system with Arabic text classification. A series of experiments were carried out to test the algorithm using 242 Arabic abstracts. Additionally, automatic phrase indexing was implemented. Experiments revealed that the system with text classification gives better performance than the system without text classification.
Keywords
information retrieval; text analysis; Arabic text classification; classifying documents; enhanced Arabic information retrieval system; Abstracts; Animals; Banking; Indexing; Information retrieval; Particle measurements; Prototypes; Testing; Text categorization; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Information Technology, 2007. IIT '07. 4th International Conference on
Conference_Location
Dubai
Print_ISBN
978-1-4244-1840-4
Electronic_ISBN
978-1-4244-1841-1
Type
conf
DOI
10.1109/IIT.2007.4430469
Filename
4430469
Link To Document