DocumentCode
2432729
Title
A new stemmer for Farsi language
Author
Estahbanati, Somayye ; Javidan, Reza
Author_Institution
Sci. & Res. Branch, Dept. of Comput. Eng., Islamic Azad Univ., Khoozestan, Iran
fYear
2011
fDate
15-16 June 2011
Firstpage
25
Lastpage
29
Abstract
In this paper, we report on the design and implementation of a stemmer for the Farsi language, according to combination of Kazem Taghva´s method and improved Krovetz´s method. The first method removes the suffixes and prefixes according to the word´s structure. And the second method is based on saving the information in a Database. This paper reports a kind of combination of these methods. The results of our evaluation on a small Farsi document collection show a significant improvement in precision/recall.
Keywords
document handling; natural language processing; Farsi document collection; Farsi language; Kazem Taghva method; Krovetz method; stemmer; Algorithm design and analysis; Computers; Databases; Europe; Information retrieval; Internet; Morphology; Farsi language; Persian Language; Stemming; algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Software Engineering (CSSE), 2011 CSI International Symposium on
Conference_Location
Tehran
Print_ISBN
978-1-61284-206-6
Type
conf
DOI
10.1109/CSICSSE.2011.5963993
Filename
5963993
Link To Document