• DocumentCode
    2432729
  • Title

    A new stemmer for Farsi language

  • Author

    Estahbanati, Somayye ; Javidan, Reza

  • Author_Institution
    Sci. & Res. Branch, Dept. of Comput. Eng., Islamic Azad Univ., Khoozestan, Iran
  • fYear
    2011
  • fDate
    15-16 June 2011
  • Firstpage
    25
  • Lastpage
    29
  • Abstract
    In this paper, we report on the design and implementation of a stemmer for the Farsi language, according to combination of Kazem Taghva´s method and improved Krovetz´s method. The first method removes the suffixes and prefixes according to the word´s structure. And the second method is based on saving the information in a Database. This paper reports a kind of combination of these methods. The results of our evaluation on a small Farsi document collection show a significant improvement in precision/recall.
  • Keywords
    document handling; natural language processing; Farsi document collection; Farsi language; Kazem Taghva method; Krovetz method; stemmer; Algorithm design and analysis; Computers; Databases; Europe; Information retrieval; Internet; Morphology; Farsi language; Persian Language; Stemming; algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering (CSSE), 2011 CSI International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-61284-206-6
  • Type

    conf

  • DOI
    10.1109/CSICSSE.2011.5963993
  • Filename
    5963993