Title :
Arabic stemming with two dictionaries
Author :
Kchaou, Zied ; Kanoun, Slim
Author_Institution :
Res. Group on Intell. Machines, Univ. of Sfax, Sfax
Abstract :
We propose an approach to stemming Arabic words similar to the approach of Khoja, but with two dictionaries, one of roots and another of radicals. Our approach has the advantage of reducing the words that are inspired by their radicals to their radical and words which are inspired by their roots to their roots with great reliability and consistency and solves the problem of the handicapped radicals and roots in Khoja. We tested our approach on a large corpus of Arabic texts covering several areas.
Keywords :
dictionaries; natural language processing; text analysis; Arabic text corpus; Arabic word stemming; Khoja approach; dictionary; handicapped radical; handicapped root; Dictionaries; Electric breakdown; Indexing; Information retrieval; Machine intelligence; Natural languages; Pattern matching; Testing; Vocabulary;
Conference_Titel :
Innovations in Information Technology, 2008. IIT 2008. International Conference on
Conference_Location :
Al Ain
Print_ISBN :
978-1-4244-3396-4
Electronic_ISBN :
978-1-4244-3397-1
DOI :
10.1109/INNOVATIONS.2008.4781780