Title of article

Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey

Author/Authors

Dinesh Kumar، نويسنده , , Gurpreet Singh Josan، نويسنده ,

Issue Information

روزنامه با شماره پیاپی سال 2010

Pages

From page

To page

Abstract

The problem of tagging in natural language processing is to find a way to tag every word in a text as a particular part of speech, e.g., proper pronoun. POS tagging is a very important preprocessing task for language processing activities. This paper reports about the Part of Speech (POS) taggers proposed for various Indian Languages like Hindi, Punjabi, Malayalam, Bengali and Telugu. Various part of speech tagging approaches like Hidden Markov Model (HMM), Support Vector Model (SVM), Rule based approaches, Maximum Entropy (ME) and Conditional Random Field (CRF) have been used for POS tagging. Accuracy is the prime factor in evaluating any POS tagger so the accuracy of every proposed tagger is also discussed in this paper

Keywords

Tagging , HMM , Finite state automata , Support vector machines , stochastic , Stemming , Maximum entropy , Corpora , tags , morphology , Suffix , Tagset , Prefix

Journal title

International Journal of Computer Applications

Serial Year

2010

Journal title

International Journal of Computer Applications

Record number

660046

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=660046