DocumentCode :
719082
Title :
An experience in developing the Nepali sense tagged corpus
Author :
Sarkar, Sunita ; Paul, Abhijit ; Roy, Arindam ; Purkayastha, Bipul Syam
Author_Institution :
Comput. Sci. Dept., Assam Univ., Silchar, India
fYear :
2015
fDate :
15-16 May 2015
Firstpage :
279
Lastpage :
282
Abstract :
A key resource that aids in several NLP tasks is WordNet. Wordnet is used as the sense inventory for sense tagging of corpus. Sense tagging is the task of tagging each word in the sentence with the correct sense of the word in the given context. Sense tagging activity helps in validation of WordNet and improvement of Wordnet quality. Sense tagging is one of the toughest annotation works and this paper discusses about the Sense Tagging tool, procedures involved in sense tagging the Nepali corpus and the challenges involved in sense tagging. Nepali WordNet is used as the sense inventory for sense tagging of Nepali corpus. For accurately sense tagging voluminous data, a standard and definitive lexicon is required. In this work the corpus in Nepali language is taken from newspaper domain.
Keywords :
natural language processing; NLP tasks; Nepali WordNet; Nepali language; Nepali sense tagged corpus; annotation works; natural language processing; newspaper domain; sense tagging; Automation; Compounds; Computer science; Context; Databases; Rivers; Tagging; Nepali; Sensetagging; Synset; WordNet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-8889-1
Type :
conf
DOI :
10.1109/CCAA.2015.7148388
Filename :
7148388
Link To Document :
بازگشت