• Title of article

    Design, implementation, and evaluation of a methodology for automatic stemmer generation

  • Author/Authors

    Massimo Melucci، نويسنده , , Massimo Melucci and Nicola Orio، نويسنده ,

  • Issue Information
    ماهنامه با شماره پیاپی سال 2007
  • Pages
    14
  • From page
    673
  • To page
    686
  • Abstract
    The authors describe a statistical approach based on hidden Markov models (HMMs), for generating stemmers automatically. The proposed approach requires little effort to insert new languages in the system even if minimal linguistic knowledge is available. This is a key advantage especially for digital libraries, which are often developed for a specific institution or government because the program can manage a great amount of documents written in local languages. The evaluation described in the article shows that the stemmers implemented by means of HMMs are as effective as those based on linguistic rules.
  • Journal title
    Journal of the American Society for Information Science and Technology
  • Serial Year
    2007
  • Journal title
    Journal of the American Society for Information Science and Technology
  • Record number

    993482