DocumentCode :
2719108
Title :
Arabic stemming with two dictionaries
Author :
Kchaou, Zied ; Kanoun, Slim
Author_Institution :
Res. Group on Intell. Machines, Univ. of Sfax, Sfax
fYear :
2008
fDate :
16-18 Dec. 2008
Firstpage :
688
Lastpage :
691
Abstract :
We propose an approach to stemming Arabic words similar to the approach of Khoja, but with two dictionaries, one of roots and another of radicals. Our approach has the advantage of reducing the words that are inspired by their radicals to their radical and words which are inspired by their roots to their roots with great reliability and consistency and solves the problem of the handicapped radicals and roots in Khoja. We tested our approach on a large corpus of Arabic texts covering several areas.
Keywords :
dictionaries; natural language processing; text analysis; Arabic text corpus; Arabic word stemming; Khoja approach; dictionary; handicapped radical; handicapped root; Dictionaries; Electric breakdown; Indexing; Information retrieval; Machine intelligence; Natural languages; Pattern matching; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovations in Information Technology, 2008. IIT 2008. International Conference on
Conference_Location :
Al Ain
Print_ISBN :
978-1-4244-3396-4
Electronic_ISBN :
978-1-4244-3397-1
Type :
conf
DOI :
10.1109/INNOVATIONS.2008.4781780
Filename :
4781780
Link To Document :
بازگشت