Title :
An experience in developing the Nepali sense tagged corpus
Author :
Sarkar, Sunita ; Paul, Abhijit ; Roy, Arindam ; Purkayastha, Bipul Syam
Author_Institution :
Comput. Sci. Dept., Assam Univ., Silchar, India
Abstract :
A key resource that aids in several NLP tasks is WordNet. Wordnet is used as the sense inventory for sense tagging of corpus. Sense tagging is the task of tagging each word in the sentence with the correct sense of the word in the given context. Sense tagging activity helps in validation of WordNet and improvement of Wordnet quality. Sense tagging is one of the toughest annotation works and this paper discusses about the Sense Tagging tool, procedures involved in sense tagging the Nepali corpus and the challenges involved in sense tagging. Nepali WordNet is used as the sense inventory for sense tagging of Nepali corpus. For accurately sense tagging voluminous data, a standard and definitive lexicon is required. In this work the corpus in Nepali language is taken from newspaper domain.
Keywords :
natural language processing; NLP tasks; Nepali WordNet; Nepali language; Nepali sense tagged corpus; annotation works; natural language processing; newspaper domain; sense tagging; Automation; Compounds; Computer science; Context; Databases; Rivers; Tagging; Nepali; Sensetagging; Synset; WordNet;
Conference_Titel :
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-8889-1
DOI :
10.1109/CCAA.2015.7148388