DocumentCode :
1782488
Title :
Developing an automated Bangla parts of speech tagged dictionary
Author :
Ismail, Sabir ; Rahman, Md Saifur ; Al Mumin, Md Abdullah
fYear :
2014
fDate :
8-10 March 2014
Firstpage :
355
Lastpage :
359
Abstract :
This paper develops an algorithm for making an automated Bangla Parts Of Speech (POS) tagged dictionary. Natural Language Processing is one of the most vigorous research areas of computer science. It enables to communicate and retrieve information form computer based system more effectively and efficiently. Researches on Bangla language processing have started long back. However, this research area still suffers from resource scarcity. A POS tagged corpus is a cardinal element for language processing. POS tagging is the process of categorizing a particular word to a particular part of speech or syntactic category. In Bangla, we do not have any large POS tagged dictionary. In this paper we develop an automated way to make a POS tagged dictionary of Noun, Verb and Adjective. Initially, a suffix (or postfix) list is created manually for Bangla language. Based on this suffix list the POS tagged dictionary is developed. The proposed algorithm is evaluated using a paragraph consisting of manually tagged 10,000 words with 11 tags. We found that POS tagging is obtained more accurately for Verb than Noun and Adjective.
Keywords :
dictionaries; learning (artificial intelligence); natural language processing; Adjective; Bangla language processing; Noun; POS tagged corpus; Verb; automated Bangla POS tagged dictionary; automated Bangla parts of speech tagged dictionary; computer based system; computer science; information communication; information retrieval; natural language processing; syntactic category; Computers; Dictionaries; Hidden Markov models; Information technology; Natural language processing; Speech; Tagging; Bangla Corpus; Bangla Language Processing; Machine Learning; Parts Of Speech Tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (ICCIT), 2013 16th International Conference on
Conference_Location :
Khulna
Type :
conf
DOI :
10.1109/ICCITechn.2014.6997347
Filename :
6997347
Link To Document :
بازگشت