• DocumentCode
    3693954
  • Title

    Suffix sequences based morphological segmentation for Afaan Oromo

  • Author

    Getachew Mamo Wegari;Massimo Melucci;Solomon Teferra

  • Author_Institution
    IT Doctoral Program, Addis Ababa University, Addis Ababa, Ethiopia
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper reports on a morphological segmentation model for Afaan Oromo based on suffix sequences approach. Understanding and identifying the suffix sequences of a language allow us to detect morpheme boundaries of many words of Afaan Oromo. Morphological segmentation models can be used in many Natural Language Processing applications such as machine translation, speech recognition, information retrieval and part-of-speech tagging. A divisive hierarchical clustering and frequency distribution were used to build a tree of candidate stems from which segmented suffix sequences can be modeled. The proposed morphological segmentation model was evaluated with test word-lists. The accuracy obtained by our morphological segmentation model is encouraging.
  • Keywords
    "Computational modeling","Training","Natural language processing","Algorithm design and analysis","Clustering algorithms","Pragmatics"
  • Publisher
    ieee
  • Conference_Titel
    AFRICON, 2015
  • Electronic_ISBN
    2153-0033
  • Type

    conf

  • DOI
    10.1109/AFRCON.2015.7331956
  • Filename
    7331956