DocumentCode :
3113553
Title :
Bengali parts-of-speech tagging using Global Linear Model
Author :
Mukherjee, Sayan ; Das Mandal, Shyamal Kumar
Author_Institution :
Center for Educ. Technol., Indian Inst. of Technol., Kharagpur, Kharagpur, India
fYear :
2013
fDate :
13-15 Dec. 2013
Firstpage :
1
Lastpage :
4
Abstract :
The paper describes an automatic parts-of-speech tagging for Bengali sentences using Global Linear Model (GLM) which learns to represent the whole sentence through a feature vector called Global feature. Tagger has been trained using averaged perceptron algorithm. Performance of this tagger has been compared to Conditional Random Field (CRF), Support Vector Machine (SVM), Hidden Markov Model (HMM) and Maximum Entropy (ME) based Bengali POS tagger. Experimental results show that GLM based Bengali POS tagger has the accuracy of 93.12 %.
Keywords :
natural language processing; vectors; Bengali parts-of-speech tagging; Bengali sentences; automatic parts-of-speech tagging; averaged perceptron algorithm; feature vector; global feature; global linear model; Accuracy; Hidden Markov models; Natural language processing; Support vector machines; Tagging; Training; Vectors; Bengali-POS tagger; Global Linear Model; Indian Languages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2013 Annual IEEE
Conference_Location :
Mumbai
Print_ISBN :
978-1-4799-2274-1
Type :
conf
DOI :
10.1109/INDCON.2013.6726132
Filename :
6726132
Link To Document :
بازگشت