DocumentCode :
2760967
Title :
A structural rule-based stemmer for Persian
Author :
Rahimtoroghi, Elaheh ; Faili, Hesham ; Shakery, Azadeh
Author_Institution :
Sch. of Electr. & Comput. Eng., Univ. of Tehran, Tehran, Iran
fYear :
2010
fDate :
4-6 Dec. 2010
Firstpage :
574
Lastpage :
578
Abstract :
This paper presents a new stemmer for Persian language. We used a structural approach for stemming which uses the structure of words and morphological rules of the language to recognize the stem of each word. We composed 33 rules to describe a structural rule-based stemmer. The rules are written based on the morphology of Persian language and its word derivation structure. For evaluation, we used our stemmer in an information retrieval system. The results demonstrated that by enhancing the system with this stemmer, the information retrieval system´s precision increases, by the factor of 4.78% and the indexing file size decreases by the factor of 6%.
Keywords :
information retrieval systems; natural language processing; Persian language; Persian word derivation structure; information retrieval system; structural rule-based stemmer; Computers; Educational institutions; Indexing; Information retrieval; Morphology; Speech; Information Retrieval; Natural Language Processing; Persian Language; Stemming;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunications (IST), 2010 5th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4244-8183-5
Type :
conf
DOI :
10.1109/ISTEL.2010.5734090
Filename :
5734090
Link To Document :
بازگشت