• DocumentCode
    3767548
  • Title

    Named Entity Recognizer for less resourced language Kokborok

  • Author

    Braja Gopal Patra;Nuna Debbarma; Aby Abahai T.;Dipankar Das;Sivaji Bandyopadhyay

  • Author_Institution
    Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
  • fYear
    2015
  • Firstpage
    164
  • Lastpage
    168
  • Abstract
    Named Entity Recognition refers to the process of classifying text elements into predefined categories such as person names, organizations, locations, date, quantities etc. In this paper, we described the development of a rule based and a supervised Named Entity Recognizer for the Kokborok language which is less computerized and agglutinative. We used suffix information and Named Entity dictionary for the rule based system, while features like parts-of-speech (POS), context information and suffix etc. were used to develop the supervised system. Margin Infused Relaxed Machine Learning Algorithm is used for developing the supervised system. We achieved the maximum F-score of 83.18% after inclusion of the post-processing technique.
  • Keywords
    Dictionaries
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2015 International Conference on
  • Print_ISBN
    978-1-4673-9595-3
  • Type

    conf

  • DOI
    10.1109/IALP.2015.7451557
  • Filename
    7451557