DocumentCode
1776387
Title
Frequency based named entity recognition system for under resource language
Author
Debbarma, Abhijit ; Bhattacharya, Pallab ; Purkayastha, B.S.
Author_Institution
Dept. of IT, Ramkrishna Mahavidyalaya Kailashahar, Unakoti, India
fYear
2014
fDate
10-11 July 2014
Firstpage
847
Lastpage
849
Abstract
This paper tries to study the issues and challenges for developing a Named Entity Recognition (NER) system for a resource scarce language of north east India. Kokborok a language spoken in the state of Tripura is taken as the target language in developing our NER system. Kokborok is an under resource language and not much digital work is available. We have used the frequency based approach to test our work which gave us a satisfactory result. As this is the first NER system being studied upon for this language we consider this to be our baseline NER system for future research in this area.
Keywords
natural language processing; Kokborok; Tripura; frequency based NER system; frequency based named entity recognition system; northeast India; resource scarce language; under resource language; Dictionaries; Educational institutions; Hidden Markov models; Instruments; Natural language processing; Support vector machines; Tagging; Kokborok; NER; NLP; Named entity recognition; Under resourse language;
fLanguage
English
Publisher
ieee
Conference_Titel
Control, Instrumentation, Communication and Computational Technologies (ICCICCT), 2014 International Conference on
Conference_Location
Kanyakumari
Print_ISBN
978-1-4799-4191-9
Type
conf
DOI
10.1109/ICCICCT.2014.6993076
Filename
6993076
Link To Document