Title of article
Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey
Author/Authors
Dinesh Kumar، نويسنده , , Gurpreet Singh Josan، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2010
Pages
9
From page
1
To page
9
Abstract
The problem of tagging in natural language processing is to find a way to tag every word in a text as a particular part of speech, e.g., proper pronoun. POS tagging is a very important preprocessing task for language processing activities. This paper reports about the Part of Speech (POS) taggers proposed for various Indian Languages like Hindi, Punjabi, Malayalam, Bengali and Telugu. Various part of speech tagging approaches like Hidden Markov Model (HMM), Support Vector Model (SVM), Rule based approaches, Maximum Entropy (ME) and Conditional Random Field (CRF) have been used for POS tagging. Accuracy is the prime factor in evaluating any POS tagger so the accuracy of every proposed tagger is also discussed in this paper
Keywords
Tagging , HMM , Finite state automata , Support vector machines , stochastic , Stemming , Maximum entropy , Corpora , tags , morphology , Suffix , Tagset , Prefix
Journal title
International Journal of Computer Applications
Serial Year
2010
Journal title
International Journal of Computer Applications
Record number
660046
Link To Document