• Title of article

    Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey

  • Author/Authors

    Dinesh Kumar، نويسنده , , Gurpreet Singh Josan، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2010
  • Pages
    9
  • From page
    1
  • To page
    9
  • Abstract
    The problem of tagging in natural language processing is to find a way to tag every word in a text as a particular part of speech, e.g., proper pronoun. POS tagging is a very important preprocessing task for language processing activities. This paper reports about the Part of Speech (POS) taggers proposed for various Indian Languages like Hindi, Punjabi, Malayalam, Bengali and Telugu. Various part of speech tagging approaches like Hidden Markov Model (HMM), Support Vector Model (SVM), Rule based approaches, Maximum Entropy (ME) and Conditional Random Field (CRF) have been used for POS tagging. Accuracy is the prime factor in evaluating any POS tagger so the accuracy of every proposed tagger is also discussed in this paper
  • Keywords
    Tagging , HMM , Finite state automata , Support vector machines , stochastic , Stemming , Maximum entropy , Corpora , tags , morphology , Suffix , Tagset , Prefix
  • Journal title
    International Journal of Computer Applications
  • Serial Year
    2010
  • Journal title
    International Journal of Computer Applications
  • Record number

    660046